GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Video-based AI memory library. Store millions of text chunks in MP4
Focus on prompting and generating
Stable Diffusion web UI
A Lightweight Face Recognition and Facial Attribute Analysis
Industry leading face manipulation platform
The most powerful and modular diffusion model GUI, api and backend
Deep Research framework, combining language models with tools
Run Local LLMs on Any Device. Open-source
A Simple and Universal Swarm Intelligence Engine
OCRmyPDF adds an OCR text layer to scanned PDF files
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Robust Speech Recognition via Large-Scale Weak Supervision
A simple, high-quality voice conversion tool focused on ease of use
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
Code for running inference and finetuning with SAM 3 model
Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
1 min voice data can also be used to train a good TTS model
Agentic, Reasoning, and Coding (ARC) foundation models
Open-source, high-performance AI model with advanced reasoning
The most powerful local music generation model
Wan2.1: Open and Advanced Large-Scale Video Generative Model