Video Object and Interaction Deletion
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official Python inference and LoRA trainer package
Open-source, high-performance AI model with advanced reasoning
From Images to High-Fidelity 3D Assets
The most powerful local music generation model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
Agentic, Reasoning, and Coding (ARC) foundation models
Advanced language and coding AI model
Fast stable diffusion on CPU and AI PC
AlphaFold 3 inference pipeline
Official inference repo for FLUX.1 models
An experimental version of DeepSeek model
Generating Immersive, Explorable, and Interactive 3D Worlds
Open-source multi-speaker long-form text-to-speech model
A Family of Open Sourced Music Foundation Models
tiktoken is a fast BPE tokeniser for use with OpenAI's models
State-of-the-art TTS model under 25MB
Towards Real-World Vision-Language Understanding
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Easy Docker setup for Stable Diffusion with user-friendly UI
Controllable & emotion-expressive zero-shot TTS
Industrial-level controllable zero-shot text-to-speech system
State-of-the-art (SoTA) text-to-video pre-trained model