GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
Industry leading face manipulation platform
State-of-the-art 2D and 3D Face Analysis Project
Comprehensive Gradio WebUI for audio processing
Stable Diffusion web UI
Focus on prompting and generating
A Simple and Universal Swarm Intelligence Engine
Run Local LLMs on Any Device. Open-source
The most powerful and modular diffusion model GUI, api and backend
Personal AI, On Personal Devices
A simple, high-quality voice conversion tool focused on ease of use
Deep Research framework, combining language models with tools
The agent that grows with you
TTS with kokoro and onnx runtime
Official Python inference and LoRA trainer package
The most powerful local music generation model
OCRmyPDF adds an OCR text layer to scanned PDF files
Public repository for Agent Skills
Awesome multilingual OCR toolkits based on PaddlePaddle
3D reconstruction software
Robust Speech Recognition via Large-Scale Weak Supervision
1 min voice data can also be used to train a good TTS model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Powerful AI language model (MoE) optimized for efficiency/performance