llama3

Here are 329 public repositories matching this topic...

vectorch-ai / ScaleLLM

A high-performance inference system for large language models, designed for production environments.

performance gpu model production cuda efficiency inference transformer llama speculative serving llm llm-inference llama3

Updated Jun 12, 2024
C++

ollama / ollama

Star

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

go golang llama gemma mistral llm llms llava llama2 ollama llama3 phi3

Updated Jun 12, 2024
Go

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated Jun 12, 2024
Python

InternLM / xtuner

Star

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated Jun 12, 2024
Python

InternLM / lmdeploy

Star

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

llama cuda-kernels deepspeed llm fastertransformer llm-inference turbomind internlm llama2 codellama llama3

Updated Jun 12, 2024
Python

modelscope / swift

Star

ms-swift: Use PEFT or Full-parameter to finetune 250+ LLMs or 35+ MLLMs. (Qwen2, GLM4, Internlm2, Yi, Llama3, Llava, Deepseek, Baichuan2...)

Updated Jun 12, 2024
Python

hiyouga / LLaMA-Factory

Star

Unify Efficient Fine-Tuning of 100+ LLMs

Updated Jun 12, 2024
Python

Programmercito / bot-microia

Star

Bot en telegram con IA de llama 2 and llama 3

java telegram springboot llama ia inteligencia-artificial llama2 ollama llama3 ollama3

Updated Jun 12, 2024
Java

SciSharp / LLamaSharp

Star

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

Updated Jun 12, 2024
C#

njfio / fluent_cli

Star

Fluent CLI is an advanced command-line interface designed to interact seamlessly with multiple workflow systems like FlowiseAI, Langflow, Make, and Zapier. Tailored for developers and IT professionals, Fluent CLI facilitates robust automation, simplifies complex interactions, and enhances productivity through a powerful and command suite

rust cli workflow automation ai zapier gemini openai make generation rag llm langchain anthropic llamaindex langflow flowise-ai groqapi llama3

Updated Jun 12, 2024
Rust

VinciGit00 / Scrapegraph-ai

Sponsor

Star

Python scraper based on AI

machine-learning scraping sc automated-scraper scraping-python gpt-3 gpt-4 llm scrapingweb llama3

Updated Jun 12, 2024
Python

Mintplex-Labs / anything-llm

Sponsor

Star

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

desktop-app webui ai-agents rag vector-database llm llamacpp chromadb localai local-llm ollama llm-webui lmstudio llm-application crewai crewaiui llama3

Updated Jun 12, 2024
JavaScript

run-llama / LlamaIndexTS

Star

LlamaIndex is a data framework for your LLM applications

Updated Jun 12, 2024
TypeScript

bigsk1 / nvidia_cli_chat

Sponsor

Star

Chat locally using leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime

python selfhosted nvidia arctic cli-app granite mistral nvidia-api llm llm-chat llama3 phi-3-mini

Updated Jun 11, 2024
Python

aws-samples / foundation-model-benchmarking-tool

Star

Foundation model benchmarking tool. Run any model on Amazon SageMaker and benchmark for performance across instance type and serving stack options.

benchmarking benchmark bedrock sagemaker p4d foundation-models inferentia generative-ai llama2 llama3

Updated Jun 11, 2024
Jupyter Notebook

unslothai / unsloth

Sponsor

Star

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

ai llama lora gemma mistral fine-tuning finetuning llms qlora llama3 phi3

Updated Jun 11, 2024
Python

mudler / LocalAI

Sponsor

Star

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.