AlphaFold 3 inference pipeline
Python inference and LoRA trainer package for the LTX-2 audio–video
ChatGLM-6B: An Open Bilingual Dialogue Language Model
State-of-the-art TTS model under 25MB
Wan2.2: Open and Advanced Large-Scale Video Generative Model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A Customizable Image-to-Video Model based on HunyuanVideo
Official inference repo for FLUX.2 models
Reference PyTorch implementation and models for DINOv3
Sharp Monocular Metric Depth in Less Than a Second
The official repo of Qwen chat & pretrained large language model
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen-Image is a powerful image generation foundation model
RGBD video generation model conditioned on camera input
gpt-oss-120b and gpt-oss-20b are two open-weight language models
DeepMind model for tracking arbitrary points across videos & robotics
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
FAIR Sequence Modeling Toolkit 2
Hackable and optimized Transformers building blocks
Memory-efficient and performant finetuning of Mistral's models
Diffusion Transformer with Fine-Grained Chinese Understanding
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project