AutoML toolkit for automate machine learning lifecycle
Real time face swap and one-click video deepfake
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
GUI for a Vocal Remover that uses Deep Neural Networks
It's possible for machines to become self-aware.
State-of-the-art 2D and 3D Face Analysis Project
Robust Speech Recognition via Large-Scale Weak Supervision
Wan2.2: Open and Advanced Large-Scale Video Generative Model
3D reconstruction software
Run Local LLMs on Any Device. Open-source
Focus on prompting and generating
OCRmyPDF adds an OCR text layer to scanned PDF files
Agentic, Reasoning, and Coding (ARC) foundation models
YOLOv5 is the world's most loved vision AI
The most powerful and modular diffusion model GUI, api and backend
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
RGBD video generation model conditioned on camera input
Powerful AI language model (MoE) optimized for efficiency/performance
Stable Diffusion web UI
Open-source, high-performance AI model with advanced reasoning
Open-Sora: Democratizing Efficient Video Production for All
NVR with realtime local object detection for IP cameras
A deep learning toolkit for Text-to-Speech, battle-tested in research
Image inpainting tool powered by SOTA AI Model
Comprehensive Gradio WebUI for audio processing