GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Focus on prompting and generating
Video-based AI memory library. Store millions of text chunks in MP4
Stable Diffusion web UI
Run Local LLMs on Any Device. Open-source
Industry leading face manipulation platform
The most powerful and modular diffusion model GUI, api and backend
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
A Simple and Universal Swarm Intelligence Engine
Deep Research framework, combining language models with tools
OCRmyPDF adds an OCR text layer to scanned PDF files
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Code for running inference and finetuning with SAM 3 model
Robust Speech Recognition via Large-Scale Weak Supervision
A simple, high-quality voice conversion tool focused on ease of use
Open-source, high-performance AI model with advanced reasoning
Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
Agentic, Reasoning, and Coding (ARC) foundation models
1 min voice data can also be used to train a good TTS model
The most powerful local music generation model
A Lightweight Face Recognition and Facial Attribute Analysis
Public repository for Agent Skills