C++ IPC Library: A high-performance inter-process communication
Inference code for CodeLlama models
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
Modern C++ REST Client library
A gradio web UI for running Large Language Models like LLaMA
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
LLM training code for MosaicML foundation models
An elegent pytorch implement of transformers
Open source alternative to ChatGPT that runs 100% offline
Autonomous agents for everyone
Open-source, high-performance AI model with advanced reasoning
Chat with private and local large language models
Towards Human-Sounding Speech
Deep learning framework
Run Local LLMs on Any Device. Open-source
A powerful, lighweight and cross-platform C/C++ IDE
Distribute and run LLMs with a single file
Intended to make Gradle C++ working more comfortable.
Go ahead and axolotl questions
Self-hosted, community-driven, local OpenAI compatible API
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Opiniated RAG for integrating GenAI in your apps
The framework for building scalable agentic applications
LLM Frontend for Power Users