Python bindings for llama.cpp
The official repo of Qwen chat & pretrained large language model
DeepSeek Coder: Let the Code Write Itself
Capable of understanding text, audio, vision, video
State-of-the-art (SoTA) text-to-video pre-trained model
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Qwen3-TTS is an open-source series of TTS models
Open-source multi-speaker long-form text-to-speech model
Qwen2.5-VL is the multimodal large language model series
A SOTA open-source image editing model
Open Source Speech Language Model
Ling is a MoE LLM provided and open-sourced by InclusionAI
Large-language-model & vision-language-model based on Linear Attention
Multi-modal large language model designed for audio understanding
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.1 models
Contexts Optical Compression
Qwen3-omni is a natively end-to-end, omni-modal LLM
A theoretical reconstruction of the Claude Mythos architecture
Ultra-Efficient LLMs on End Device
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
OCR expert VLM powered by Hunyuan's native multimodal architecture
Open-source framework for intelligent speech interaction
Long-form streaming TTS system for multi-speaker dialogue generation
General-purpose image editing model that delivers high-fidelity