Z80-μLM is a 2-bit quantized language model
A Powerful Native Multimodal Model for Image Generation
From Images to High-Fidelity 3D Assets
Easy Docker setup for Stable Diffusion with user-friendly UI
HY-Motion model for 3D character animation generation
Qwen3-TTS is an open-source series of TTS models
LTX-Video Support for ComfyUI
Official repository for LTX-Video
Multimodal Diffusion with Representation Alignment
Pokee Deep Research Model Open Source Repo
Advanced language and coding AI model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
Qwen-Image is a powerful image generation foundation model
State-of-the-art TTS model under 25MB
Powerful AI language model (MoE) optimized for efficiency/performance
The official repo of Qwen chat & pretrained large language model
Contexts Optical Compression
Qwen3-Coder is the code version of Qwen3
RGBD video generation model conditioned on camera input
Advancing Open-source World Models
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
AlphaFold 3 inference pipeline
Lets make video diffusion practical