GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
A Simple and Universal Swarm Intelligence Engine
Stable Diffusion web UI
Industry leading face manipulation platform
Focus on prompting and generating
Run Local LLMs on Any Device. Open-source
A simple, high-quality voice conversion tool focused on ease of use
The most powerful and modular diffusion model GUI, api and backend
Awesome multilingual OCR toolkits based on PaddlePaddle
Official Python inference and LoRA trainer package
Deep Research framework, combining language models with tools
Robust Speech Recognition via Large-Scale Weak Supervision
Public repository for Agent Skills
3D reconstruction software
Personal AI, On Personal Devices
OCRmyPDF adds an OCR text layer to scanned PDF files
The agent that grows with you
1 min voice data can also be used to train a good TTS model
TTS with kokoro and onnx runtime
Agentic, Reasoning, and Coding (ARC) foundation models
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Powerful AI language model (MoE) optimized for efficiency/performance
AI video generator optimized for low VRAM and older GPUs use