AutoML toolkit for automate machine learning lifecycle
Real time face swap and one-click video deepfake
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
GUI for a Vocal Remover that uses Deep Neural Networks
State-of-the-art 2D and 3D Face Analysis Project
Robust Speech Recognition via Large-Scale Weak Supervision
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Run Local LLMs on Any Device. Open-source
3D reconstruction software
It's possible for machines to become self-aware.
Focus on prompting and generating
OCRmyPDF adds an OCR text layer to scanned PDF files
Agentic, Reasoning, and Coding (ARC) foundation models
YOLOv5 is the world's most loved vision AI
Image/video AI upscaler app (BSRGAN)
The most powerful and modular diffusion model GUI, api and backend
RGBD video generation model conditioned on camera input
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Powerful AI language model (MoE) optimized for efficiency/performance
Stable Diffusion web UI
Open-source, high-performance AI model with advanced reasoning
Open-Sora: Democratizing Efficient Video Production for All
NVR with realtime local object detection for IP cameras
A deep learning toolkit for Text-to-Speech, battle-tested in research
Image inpainting tool powered by SOTA AI Model