The most powerful local music generation model
A Family of Open Sourced Music Foundation Models
Official Python inference and LoRA trainer package
Multimodal Diffusion with Representation Alignment
Wan2.1: Open and Advanced Large-Scale Video Generative Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A Unified Framework for Text-to-3D and Image-to-3D Generation
From Images to High-Fidelity 3D Assets
A Powerful Native Multimodal Model for Image Generation
Fast stable diffusion on CPU and AI PC
CodeGeeX2: A More Powerful Multilingual Code Generation Model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
RGBD video generation model conditioned on camera input
Qwen2.5-VL is the multimodal large language model series
Official inference repo for FLUX.2 models
Generating Immersive, Explorable, and Interactive 3D Worlds
Official implementation of DreamCraft3D
A Customizable Image-to-Video Model based on HunyuanVideo
Foundation model for image generation
State-of-the-art (SoTA) text-to-video pre-trained model
Unified Multimodal Understanding and Generation Models
Long-form streaming TTS system for multi-speaker dialogue generation
Advancing Open-source World Models
Inference script for Oasis 500M