#

multi-modal

Here are 275 public repositories matching this topic...

OpenBMB / MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

multi-modal minicpm minicpm-v

Updated Jun 12, 2024
Python

modelscope / agentscope

Start building LLM-empowered multi-agent applications in an easier way.

agent chatbot multi-agent multi-modal distributed-agents gpt-4 large-language-models llm llm-agent llama3 gpt-4o

Updated Jun 12, 2024
Python

skl0726 / Transformer-Based-Model-Study

Transformer Based Model Paper Review and PyTorch Code

natural-language-processing computer-vision pytorch transformer multi-modal

Updated Jun 12, 2024
Python

skl0726 / Generative-Model-Study

Generative Model Paper Review and PyTorch Code

computer-vision generative-model multi-modal

Updated Jun 12, 2024

SciSharp / LLamaSharp

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

Updated Jun 12, 2024
C#

marqo

marqo-ai / marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Updated Jun 12, 2024
Python

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

pretrained-models language-model multi-modal cogvlm

Updated Jun 12, 2024
Python

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

image-classification gpt multi-modal semantic-segmentation video-classification mme image-text-retrieval llm vision-language-model gpt-4v vit-6b vit-22b gpt-4o

Updated Jun 12, 2024
Python

Yuan-ManX / ai-multimodal-timeline

Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D content. 🔥

ai multi-modal ai-agents deeplearning-ai multimodal multimodal-deep-learning llm

Updated Jun 11, 2024

modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据！

Updated Jun 12, 2024
Python

howard-hou / VisualRWKV

VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.

multi-modal large-language-models rwkv

Updated Jun 11, 2024
Python

modelscope

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jun 11, 2024
Python

docarray

docarray / docarray

Represent, send, store and search multimodal data

elasticsearch machine-learning deep-learning protobuf pytorch data-structures nearest-neighbor-search cross-modal multi-modal semantic-search multimodal nested-data weaviate dataclass pydantic fastapi neural-search qdrant docarray

Updated Jun 10, 2024
Python

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Jun 10, 2024
Python

zjunlp / DeepKE

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Updated Jun 10, 2024
Python

zeta

kyegomez / zeta

Build high-performance AI models with modular building blocks

multi-platform deep-learning transformers pytorch artificial-intelligence transformer speech-recognition multi-modal multi-agent-systems multi-modal-learning gpt4 llama2 longnet

Updated Jun 10, 2024
Python

valhalla / valhalla

Open Source Routing Engine for OpenStreetMap

directions openstreetmap routing astar traveling-salesman dijkstra routing-engine isochrones multi-modal tiled

Updated Jun 12, 2024
C++

Lizhecheng02 / MultiModal

Basic implementation code for multimodal models and some applications or fine-tuning tasks based on them.

transformer multi-modal finetune rag

Updated Jun 8, 2024
Jupyter Notebook

deep-symbolic-mathematics / Multimodal-Math-Pretraining

[ICLR 2024 Spotlight] This is the official code for the paper "SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training"

deep-learning transformers symbolic-regression representation-learning multi-modal symbolic-math multi-modal-learning ai4science ai4math

Updated Jun 8, 2024
Python

patrick-tssn / LM-Research-Hub

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

open-source api-wrapper accelerate multi-modal pretraining large-language-models llm rlhf instruction-tuning

Updated Jun 7, 2024
Python

Improve this page

Add a description, image, and links to the multi-modal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal topic, visit your repo's landing page and select "manage topics."