High-Resolution Image Synthesis with Latent Diffusion Models
Visual Causal Flow
Research code artifacts for Code World Model (CWM)
Contexts Optical Compression
Block Diffusion for Ultra-Fast Speculative Decoding
CLIP, Predict the most relevant text snippet given an image
Release for Improved Denoising Diffusion Probabilistic Models
Qwen3-TTS is an open-source series of TTS models
Lets make video diffusion practical
Reference PyTorch implementation and models for DINOv3
Official implementation of DreamCraft3D
A Customizable Image-to-Video Model based on HunyuanVideo
Ling is a MoE LLM provided and open-sourced by InclusionAI
Personalize Any Characters with a Scalable Diffusion Transformer
Official DeiT repository
HY-Motion model for 3D character animation generation
Code for running inference with the SAM 3D Body Model 3DB
DeepSeek Coder: Let the Code Write Itself
A Powerful Native Multimodal Model for Image Generation
Long-form streaming TTS system for multi-speaker dialogue generation
Large Multimodal Models for Video Understanding and Editing
Official inference repo for FLUX.1 models
Python SDK for Claude Agent
Generating Immersive, Explorable, and Interactive 3D Worlds
Multimodal Diffusion with Representation Alignment