Open-source, high-performance AI model with advanced reasoning
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Official Python inference and LoRA trainer package
From Images to High-Fidelity 3D Assets
Agentic, Reasoning, and Coding (ARC) foundation models
The most powerful local music generation model
Advanced language and coding AI model
Awesome multilingual OCR toolkits based on PaddlePaddle
Fast stable diffusion on CPU and AI PC
AlphaFold 3 inference pipeline
A Family of Open Sourced Music Foundation Models
An experimental version of DeepSeek model
Tool for exploring and debugging transformer model behaviors
State-of-the-art TTS model under 25MB
Open-source multi-speaker long-form text-to-speech model
LTX-Video Support for ComfyUI
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Official inference repo for FLUX.1 models
Industrial-level controllable zero-shot text-to-speech system
ChatGPT interface with better UI
State-of-the-art (SoTA) text-to-video pre-trained model
Easy Docker setup for Stable Diffusion with user-friendly UI
Multimodal-Driven Architecture for Customized Video Generation
Generating Immersive, Explorable, and Interactive 3D Worlds