GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Run Local LLMs on Any Device. Open-source
The most powerful and modular diffusion model GUI, api and backend
Stable Diffusion web UI
Focus on prompting and generating
Industry leading face manipulation platform
OCRmyPDF adds an OCR text layer to scanned PDF files
Wan2.2: Open and Advanced Large-Scale Video Generative Model
3D reconstruction software
Code for running inference and finetuning with SAM 3 model
A simple, high-quality voice conversion tool focused on ease of use
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful local music generation model
Advanced language and coding AI model
Open-source, high-performance AI model with advanced reasoning
A high-throughput and memory-efficient inference and serving engine
Powerful AI language model (MoE) optimized for efficiency/performance
A Lightweight Face Recognition and Facial Attribute Analysis
OCR software, free and offline
Official Python inference and LoRA trainer package
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Agentic, Reasoning, and Coding (ARC) foundation models
Qwen3 is the large language model series developed by Qwen team