Get up and running with Llama 2 and other large language models
Advanced language and coding AI model
LLM Frontend for Power Users
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Agentic, Reasoning, and Coding (ARC) foundation models
The all-in-one Desktop & Docker AI application with full RAG and AI
Kimi K2 is the large language model series developed by Moonshot AI
Open-source, high-performance AI model with advanced reasoning
Qwen3 is the large language model series developed by Qwen team
Powerful AI language model (MoE) optimized for efficiency/performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A high-throughput and memory-efficient inference and serving engine
Desktop app for prototyping and debugging LangGraph applications
Self-hosted, community-driven, local OpenAI compatible API
Qwen3-Coder is the code version of Qwen3
Distribute and run LLMs with a single file
MiniMax M2.1, a SOTA model for real-world dev & agents.
Dramatron uses large language models to generate coherent scripts
Interact with your documents using the power of GPT
Low-code app builder for RAG and multi-agent AI applications
One API for plugins and datasets, one interface for prompt engineering
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Open source LLM engineering platform: LLM Observability, metrics, etc.
SimpleMem: Efficient Lifelong Memory for LLM Agents