Modern C++ REST Client library
Inference code for CodeLlama models
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
A gradio web UI for running Large Language Models like LLaMA
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
LLM training code for MosaicML foundation models
An elegent pytorch implement of transformers
Open source alternative to ChatGPT that runs 100% offline
Open-source, high-performance AI model with advanced reasoning
Chat with private and local large language models
Towards Human-Sounding Speech
Run Local LLMs on Any Device. Open-source
Deep learning framework
Distribute and run LLMs with a single file
Self-hosted, community-driven, local OpenAI compatible API
LLM Frontend for Power Users
Go ahead and axolotl questions
Linux port of FAR v2
Opiniated RAG for integrating GenAI in your apps
A Pythonic framework to simplify AI service building
Operating LLMs in production
The framework for building scalable agentic applications
Easiest and laziest way for building multi-agent LLMs applications
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere