Datasets, transforms and models specific to Computer Vision
A Simple and Universal Swarm Intelligence Engine
State-of-the-art 2D and 3D Face Analysis Project
A simple, high-quality voice conversion tool focused on ease of use
Industry leading face manipulation platform
AI agent harness for AI coding agents
Official Python inference and LoRA trainer package
AI video generator optimized for low VRAM and older GPUs use
TTS with kokoro and onnx runtime
Run Local LLMs on Any Device. Open-source
The most powerful and modular diffusion model GUI, api and backend
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
1 min voice data can also be used to train a good TTS model
3D reconstruction software
Wan2.2: Open and Advanced Large-Scale Video Generative Model
The most powerful local music generation model
Open-source, high-performance AI model with advanced reasoning
Claude Code skill for generating production-quality SVG+PNG technical
From Images to High-Fidelity 3D Assets
Improve your Baduk skills by training with KataGo
Generate short videos with one click using AI LLM
InvokeAI is a leading creative engine for Stable Diffusion models
Advanced language and coding AI model
Generating Immersive, Explorable, and Interactive 3D Worlds