A Family of Open Sourced Music Foundation Models
Official inference repo for FLUX.2 models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
High-Resolution Image Synthesis with Latent Diffusion Models
Python inference and LoRA trainer package for the LTX-2 audio–video
gpt-oss-120b and gpt-oss-20b are two open-weight language models
From Images to High-Fidelity 3D Assets
Reference PyTorch implementation and models for DINOv3
Official DeiT repository
Long-form streaming TTS system for multi-speaker dialogue generation
CogView4, CogView3-Plus and CogView3(ECCV 2024)
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
A SOTA open-source image editing model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Industrial-level controllable zero-shot text-to-speech system
VMZ: Model Zoo for Video Modeling
Generate Any 3D Scene in Seconds
The official PyTorch implementation of Google's Gemma models
Inference script for Oasis 500M
Fast and Universal 3D reconstruction model for versatile tasks
High-Fidelity and Controllable Generation of Textured 3D Assets
A Pragmatic VLA Foundation Model
Multimodal embedding and reranking models built on Qwen3-VL
High-resolution models for human tasks