GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Focus on prompting and generating
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
Stable Diffusion web UI
Run Local LLMs on Any Device. Open-source
Video-based AI memory library. Store millions of text chunks in MP4
Industry leading face manipulation platform
A Simple and Universal Swarm Intelligence Engine
The most powerful and modular diffusion model GUI, api and backend
OCRmyPDF adds an OCR text layer to scanned PDF files
A simple, high-quality voice conversion tool focused on ease of use
Wan2.2: Open and Advanced Large-Scale Video Generative Model
3D reconstruction software
OCR software, free and offline
Robust Speech Recognition via Large-Scale Weak Supervision
Awesome multilingual OCR toolkits based on PaddlePaddle
Code for running inference and finetuning with SAM 3 model
Open-source, high-performance AI model with advanced reasoning
Official Python inference and LoRA trainer package
The most powerful local music generation model
Agentic, Reasoning, and Coding (ARC) foundation models
1 min voice data can also be used to train a good TTS model
NVR with realtime local object detection for IP cameras