TTS with kokoro and onnx runtime
Public repository for Agent Skills
A Simple and Universal Swarm Intelligence Engine
Real time face swap and one-click video deepfake
A simple, high-quality voice conversion tool focused on ease of use
State-of-the-art 2D and 3D Face Analysis Project
The most powerful and modular diffusion model GUI, api and backend
Comprehensive Gradio WebUI for audio processing
Create UIs for your machine learning model in Python in 3 minutes
OCR software, free and offline
Agent Zero AI framework
AI tool that removes hardcoded subtitles and text from videos locally
Deep Research framework, combining language models with tools
AI Fully Automated Short Video Engine
Use Microsoft Edge's online text-to-speech service from Python
OCRmyPDF adds an OCR text layer to scanned PDF files
Run Local LLMs on Any Device. Open-source
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Unified web UI for training and running open models locally
Ready-to-use OCR with 80+ supported languages
Image polygonal annotation with Python
Specification and documentation for Agent Skills
An AI personal assistant for your digital brain
The agent that grows with you
Open-source autonomous AI software engineer