DeepSeek Coder: Let the Code Write Itself
Reference PyTorch implementation and models for DINOv3
The official repo of Qwen chat & pretrained large language model
Generate Any 3D Scene in Seconds
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
A Customizable Image-to-Video Model based on HunyuanVideo
Qwen-Image is a powerful image generation foundation model
The official PyTorch implementation of Google's Gemma models
Multimodal Diffusion with Representation Alignment
Pushing the Limits of Mathematical Reasoning in Open Language Models
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen3-Coder is the code version of Qwen3
Towards Real-World Vision-Language Understanding
Release for Improved Denoising Diffusion Probabilistic Models
A Powerful Native Multimodal Model for Image Generation
Uncommon Objects in 3D dataset
Fast and Universal 3D reconstruction model for versatile tasks
Lets make video diffusion practical
VMZ: Model Zoo for Video Modeling
CLIP, Predict the most relevant text snippet given an image
AlphaFold 3 inference pipeline