Modern C++ REST Client library
Inference code for CodeLlama models
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
LLM training code for MosaicML foundation models
A gradio web UI for running Large Language Models like LLaMA
An elegent pytorch implement of transformers
Open source alternative to ChatGPT that runs 100% offline
Open-source, high-performance AI model with advanced reasoning
Towards Human-Sounding Speech
Deep learning framework
Run Local LLMs on Any Device. Open-source
A powerful, lighweight and cross-platform C/C++ IDE
Distribute and run LLMs with a single file
Go ahead and axolotl questions
Self-hosted, community-driven, local OpenAI compatible API
A Pythonic framework to simplify AI service building
LLM Frontend for Power Users
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Opiniated RAG for integrating GenAI in your apps
The framework for building scalable agentic applications
Operating LLMs in production
Voxel game engine in C++ with OpenGL
Linux port of FAR v2