World's first open-source, agentic video production system
Stable Diffusion web UI
Powerful AI language model (MoE) optimized for efficiency/performance
AI tool that removes hardcoded subtitles and text from videos locally
OCR software, free and offline
OCRmyPDF adds an OCR text layer to scanned PDF files
Fast and memory-efficient exact attention
Python tool for converting files and office documents to Markdown
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A simple, high-quality voice conversion tool focused on ease of use
Open-source, high-performance AI model with advanced reasoning
Effortless data labeling with AI support from Segment Anything
Official Python inference and LoRA trainer package
Deep Research framework, combining language models with tools
Robust Speech Recognition via Large-Scale Weak Supervision
Improve your Baduk skills by training with KataGo
3D reconstruction software
Official inference repo for FLUX.1 models
Comprehensive Gradio WebUI for audio processing
NVR with realtime local object detection for IP cameras
Wan2.1: Open and Advanced Large-Scale Video Generative Model
The most powerful local music generation model
OBLITERATE THE CHAINS THAT BIND YOU
Native and Compact Structured Latents for 3D Generation
Agentic, Reasoning, and Coding (ARC) foundation models