GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Focus on prompting and generating
Run Local LLMs on Any Device. Open-source
Agentic, Reasoning, and Coding (ARC) foundation models
The most powerful and modular diffusion model GUI, api and backend
Stable Diffusion web UI
Wan2.2: Open and Advanced Large-Scale Video Generative Model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The most powerful local music generation model
3D reconstruction software
OCRmyPDF adds an OCR text layer to scanned PDF files
A simple, high-quality voice conversion tool focused on ease of use
YOLOv5 is the world's most loved vision AI
Open-source, high-performance AI model with advanced reasoning
Advanced language and coding AI model
Robust Speech Recognition via Large-Scale Weak Supervision
Image inpainting tool powered by SOTA AI Model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR software, free and offline
Core ML tools contain supporting tools for Core ML model conversion
1 min voice data can also be used to train a good TTS model
Synchronized Translation for Videos
TTS with kokoro and onnx runtime