C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Open source alternative to ChatGPT that runs 100% offline
Autonomous agents for everyone
Open-source, high-performance AI model with advanced reasoning
Chat with private and local large language models
Towards Human-Sounding Speech
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
Go ahead and axolotl questions
LLM Frontend for Power Users
Self-hosted, community-driven, local OpenAI compatible API
Easiest and laziest way for building multi-agent LLMs applications
A Pythonic framework to simplify AI service building
SGLang is a fast serving framework for large language models
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Evaluate and compare LLM outputs, catch regressions, improve prompts
Operating LLMs in production
Private chat with local GPT with document, images, video, etc.
Pruna is a model optimization framework built for developers
PyTorch library of curated Transformer models and their components
Agents-Flex is an elegant LLM Application Framework like LangChain
Speech-AI-Forge is a project developed around TTS generation model
Low code tool to rapidly build and coordinate multi-agent teams
An MCP client for Neovim that seamlessly integrates MCP servers