Python bindings for llama.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
DeepSeek Coder: Let the Code Write Itself
From Vibe Coding to Agentic Engineering
Capable of understanding text, audio, vision, video
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Open-source multi-speaker long-form text-to-speech model
Qwen2.5-VL is the multimodal large language model series
Qwen3-TTS is an open-source series of TTS models
Open Source Speech Language Model
Ling is a MoE LLM provided and open-sourced by InclusionAI
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.1 models
Large-language-model & vision-language-model based on Linear Attention
Qwen3-omni is a natively end-to-end, omni-modal LLM
Contexts Optical Compression
A theoretical reconstruction of the Claude Mythos architecture
Ultra-Efficient LLMs on End Device
Multimodal model achieving SOTA performance
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Long-form streaming TTS system for multi-speaker dialogue generation
General-purpose image editing model that delivers high-fidelity
Generate Any 3D Scene in Seconds
Controllable & emotion-expressive zero-shot TTS
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning