An elegent pytorch implement of transformers
Open source alternative to ChatGPT that runs 100% offline
Open-source, high-performance AI model with advanced reasoning
Chat with private and local large language models
Towards Human-Sounding Speech
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
Go ahead and axolotl questions
Self-hosted, community-driven, local OpenAI compatible API
LLM Frontend for Power Users
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Easiest and laziest way for building multi-agent LLMs applications
A Pythonic framework to simplify AI service building
SGLang is a fast serving framework for large language models
Operating LLMs in production
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Evaluate and compare LLM outputs, catch regressions, improve prompts
Pruna is a model optimization framework built for developers
PyTorch library of curated Transformer models and their components
Private chat with local GPT with document, images, video, etc.
Agents-Flex is an elegant LLM Application Framework like LangChain
Speech-AI-Forge is a project developed around TTS generation model
An MCP client for Neovim that seamlessly integrates MCP servers
Project showcasing Llama 3.3 70B HTML codegen abilities
A Conversational Speech Generation Model