An elegent pytorch implement of transformers
Open source alternative to ChatGPT that runs 100% offline
Open-source, high-performance AI model with advanced reasoning
Towards Human-Sounding Speech
Run Local LLMs on Any Device. Open-source
Distribute and run LLMs with a single file
Self-hosted, community-driven, local OpenAI compatible API
LLM Frontend for Power Users
Go ahead and axolotl questions
Easiest and laziest way for building multi-agent LLMs applications
A Pythonic framework to simplify AI service building
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
SGLang is a fast serving framework for large language models
Operating LLMs in production
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Evaluate and compare LLM outputs, catch regressions, improve prompts
Private chat with local GPT with document, images, video, etc.
Pruna is a model optimization framework built for developers
PyTorch library of curated Transformer models and their components
Agents-Flex is an elegant LLM Application Framework like LangChain
Speech-AI-Forge is a project developed around TTS generation model
An MCP client for Neovim that seamlessly integrates MCP servers
GPT4V-level open-source multi-modal model based on Llama3-8B
Project showcasing Llama 3.3 70B HTML codegen abilities
A Conversational Speech Generation Model