Visual Causal Flow
Research code artifacts for Code World Model (CWM)
Contexts Optical Compression
A Customizable Image-to-Video Model based on HunyuanVideo
Release for Improved Denoising Diffusion Probabilistic Models
Lets make video diffusion practical
Reference PyTorch implementation and models for DINOv3
Qwen3-TTS is an open-source series of TTS models
Block Diffusion for Ultra-Fast Speculative Decoding
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Personalize Any Characters with a Scalable Diffusion Transformer
Official inference repo for FLUX.1 models
Official implementation of DreamCraft3D
DeepSeek Coder: Let the Code Write Itself
Python SDK for Claude Agent
Official DeiT repository
Code for running inference with the SAM 3D Body Model 3DB
HY-Motion model for 3D character animation generation
Large Multimodal Models for Video Understanding and Editing
A Powerful Native Multimodal Model for Image Generation
Tongyi Deep Research, the Leading Open-source Deep Research Agent
PyTorch code and models for the DINOv2 self-supervised learning
Multimodal Diffusion with Representation Alignment
Generating Immersive, Explorable, and Interactive 3D Worlds