Get up and running with Llama 2 and other large language models
Advanced language and coding AI model
LLM Frontend for Power Users
Port of Facebook's LLaMA model in C/C++
Run Local LLMs on Any Device. Open-source
The all-in-one Desktop & Docker AI application with full RAG and AI
Agentic, Reasoning, and Coding (ARC) foundation models
Open-source, high-performance AI model with advanced reasoning
Kimi K2 is the large language model series developed by Moonshot AI
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3 is the large language model series developed by Qwen team
A high-throughput and memory-efficient inference and serving engine
Powerful AI language model (MoE) optimized for efficiency/performance
Desktop app for prototyping and debugging LangGraph applications
Qwen3-Coder is the code version of Qwen3
Self-hosted, community-driven, local OpenAI compatible API
The official repo of Qwen chat & pretrained large language model
Low-code app builder for RAG and multi-agent AI applications
MiniMax M2.1, a SOTA model for real-world dev & agents.
Distribute and run LLMs with a single file
Interact with your documents using the power of GPT
Open source LLM engineering platform: LLM Observability, metrics, etc.
Dramatron uses large language models to generate coherent scripts
Python bindings for llama.cpp
⚡ Building applications with LLMs through composability ⚡