Browse the web, directly from Cursor etc.
Linkedin Automation Tool
GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Industry leading face manipulation platform
A Simple and Universal Swarm Intelligence Engine
Stable Diffusion web UI
Focus on prompting and generating
Comprehensive Gradio WebUI for audio processing
The most powerful and modular diffusion model GUI, api and backend
Personal AI, On Personal Devices
Run Local LLMs on Any Device. Open-source
Official Python inference and LoRA trainer package
A simple, high-quality voice conversion tool focused on ease of use
Deep Research framework, combining language models with tools
Public repository for Agent Skills
The agent that grows with you
OCRmyPDF adds an OCR text layer to scanned PDF files
Robust Speech Recognition via Large-Scale Weak Supervision
3D reconstruction software
The most powerful local music generation model
TTS with kokoro and onnx runtime
Powerful AI language model (MoE) optimized for efficiency/performance
Awesome multilingual OCR toolkits based on PaddlePaddle