An elegent pytorch implement of transformers
Open source alternative to ChatGPT that runs 100% offline
Autonomous agents for everyone
Open-source, high-performance AI model with advanced reasoning
Towards Human-Sounding Speech
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
Self-hosted, community-driven, local OpenAI compatible API
Go ahead and axolotl questions
LLM Frontend for Power Users
A Pythonic framework to simplify AI service building
Operating LLMs in production
Easiest and laziest way for building multi-agent LLMs applications
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Evaluate and compare LLM outputs, catch regressions, improve prompts
SGLang is a fast serving framework for large language models
Private chat with local GPT with document, images, video, etc.
Pruna is a model optimization framework built for developers
PyTorch library of curated Transformer models and their components
Agents-Flex is an elegant LLM Application Framework like LangChain
Low code tool to rapidly build and coordinate multi-agent teams
Speech-AI-Forge is a project developed around TTS generation model
An MCP client for Neovim that seamlessly integrates MCP servers
A Conversational Speech Generation Model