Qwen3-TTS is an open-source series of TTS models
A Family of Open Sourced Music Foundation Models
Wan2.2: Open and Advanced Large-Scale Video Generative Model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
High-Resolution Image Synthesis with Latent Diffusion Models
Official inference repo for FLUX.1 models
Open-source, high-performance AI model with advanced reasoning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A Systematic Framework for Interactive World Modeling
Tiny vision language model
Powerful AI language model (MoE) optimized for efficiency/performance
DeepSeek Coder: Let the Code Write Itself
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
The most powerful local music generation model
Open Source Speech Language Model
Official inference repo for FLUX.2 models
Agentic, Reasoning, and Coding (ARC) foundation models
Python inference and LoRA trainer package for the LTX-2 audio–video
ChatGPT interface with better UI
Official Python inference and LoRA trainer package
Advanced language and coding AI model
Awesome multilingual OCR toolkits based on PaddlePaddle
Recovering the Visual Space from Any Views
Easy Docker setup for Stable Diffusion with user-friendly UI
Advancing Open-source World Models