Get up and running with Llama 2 and other large language models
Advanced language and coding AI model
LLM Frontend for Power Users
Run Local LLMs on Any Device. Open-source
Port of Facebook's LLaMA model in C/C++
Agentic, Reasoning, and Coding (ARC) foundation models
The all-in-one Desktop & Docker AI application with full RAG and AI
Kimi K2 is the large language model series developed by Moonshot AI
Qwen3 is the large language model series developed by Qwen team
Open-source, high-performance AI model with advanced reasoning
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Powerful AI language model (MoE) optimized for efficiency/performance
A high-throughput and memory-efficient inference and serving engine
Desktop app for prototyping and debugging LangGraph applications
Self-hosted, community-driven, local OpenAI compatible API
Qwen3-Coder is the code version of Qwen3
Distribute and run LLMs with a single file
MiniMax M2.1, a SOTA model for real-world dev & agents.
Dramatron uses large language models to generate coherent scripts
Interact with your documents using the power of GPT
Low-code app builder for RAG and multi-agent AI applications
One API for plugins and datasets, one interface for prompt engineering
SimpleMem: Efficient Lifelong Memory for LLM Agents
The official repo of Qwen chat & pretrained large language model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)