Qwen3-TTS is an open-source series of TTS models
Official inference repo for FLUX.1 models
DeepSeek Coder: Let the Code Write Itself
Revolutionizing Database Interactions with Private LLM Technology
A Family of Open Sourced Music Foundation Models
Text and image to video generation: CogVideoX and CogVideo
High-Resolution Image Synthesis with Latent Diffusion Models
Recovering the Visual Space from Any Views
Easy Docker setup for Stable Diffusion with user-friendly UI
Z80-μLM is a 2-bit quantized language model
A Systematic Framework for Interactive World Modeling
Fast-stable-diffusion + DreamBooth
Advancing Open-source World Models
ChatGPT interface with better UI
Tiny vision language model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Repo for SeedVR2 & SeedVR
Generate Any 3D Scene in Seconds
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Block Diffusion for Ultra-Fast Speculative Decoding
Ling is a MoE LLM provided and open-sourced by InclusionAI
Open Source Speech Language Model
Open-source industrial-grade ASR models
Hunyuan Translation Model Version 1.5
Implementation of "MobileCLIP" CVPR 2024