GUI for a Vocal Remover that uses Deep Neural Networks
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Stable Diffusion web UI
Advanced language and coding AI model
Focus on prompting and generating
Run Local LLMs on Any Device. Open-source
Wan2.2: Open and Advanced Large-Scale Video Generative Model
The most powerful and modular diffusion model GUI, api and backend
3D reconstruction software
OCRmyPDF adds an OCR text layer to scanned PDF files
Agentic, Reasoning, and Coding (ARC) foundation models
Code for running inference and finetuning with SAM 3 model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen3 is the large language model series developed by Qwen team
Open-source, high-performance AI model with advanced reasoning
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
A simple, high-quality voice conversion tool focused on ease of use
1 min voice data can also be used to train a good TTS model
Image inpainting tool powered by SOTA AI Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
TensorFlow is an open source library for machine learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
NVR with realtime local object detection for IP cameras