Fast-stable-diffusion + DreamBooth
AlphaFold 3 inference pipeline
Python inference and LoRA trainer package for the LTX-2 audio–video
ChatGLM-6B: An Open Bilingual Dialogue Language Model
State-of-the-art TTS model under 25MB
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A Customizable Image-to-Video Model based on HunyuanVideo
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Sharp Monocular Metric Depth in Less Than a Second
Reference PyTorch implementation and models for DINOv3
Text and image to video generation: CogVideoX and CogVideo
Official inference repo for FLUX.2 models
Advancing Open-source World Models
The official repo of Qwen chat & pretrained large language model
Easy Docker setup for Stable Diffusion with user-friendly UI
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
High-Resolution Image Synthesis with Latent Diffusion Models
Hackable and optimized Transformers building blocks
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Chinese and English multimodal conversational language model
DeepMind model for tracking arbitrary points across videos & robotics
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Miso TTS is an 8 billion, highly emotive text-to-speech model