Python bindings for llama.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
DeepSeek Coder: Let the Code Write Itself
The official repo of Qwen chat & pretrained large language model
Capable of understanding text, audio, vision, video
From Vibe Coding to Agentic Engineering
State-of-the-art (SoTA) text-to-video pre-trained model
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Qwen3-TTS is an open-source series of TTS models
A SOTA open-source image editing model
Open-source multi-speaker long-form text-to-speech model
Qwen2.5-VL is the multimodal large language model series
Open Source Speech Language Model
Ling is a MoE LLM provided and open-sourced by InclusionAI
Multi-modal large language model designed for audio understanding
Large-language-model & vision-language-model based on Linear Attention
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.1 models
Qwen3-omni is a natively end-to-end, omni-modal LLM
Contexts Optical Compression
Ultra-Efficient LLMs on End Device
A theoretical reconstruction of the Claude Mythos architecture
Multimodal model achieving SOTA performance
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
OCR expert VLM powered by Hunyuan's native multimodal architecture