Deep Research framework, combining language models with tools
Image inpainting tool powered by SOTA AI Model
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Official Python inference and LoRA trainer package
Robust Speech Recognition via Large-Scale Weak Supervision
3D reconstruction software
1 min voice data can also be used to train a good TTS model
OCRmyPDF adds an OCR text layer to scanned PDF files
Open-source, high-performance AI model with advanced reasoning
TTS with kokoro and onnx runtime
The most powerful local music generation model
OCR software, free and offline
AI tool that removes hardcoded subtitles and text from videos locally
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A simple, high-quality voice conversion tool focused on ease of use
Awesome multilingual OCR toolkits based on PaddlePaddle
A Lightweight Face Recognition and Facial Attribute Analysis
Official inference repo for FLUX.2 models
Synchronized Translation for Videos
Qwen3-TTS is an open-source series of TTS models
Agentic, Reasoning, and Coding (ARC) foundation models
Improve your Baduk skills by training with KataGo
YOLOv5 is the world's most loved vision AI
NVR with realtime local object detection for IP cameras