Effortless data labeling with AI support from Segment Anything
Robust Speech Recognition via Large-Scale Weak Supervision
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful local music generation model
AI Fully Automated Short Video Engine
Powerful AI language model (MoE) optimized for efficiency/performance
Oobabooga - The definitive Web UI for local AI, with powerful features
OBLITERATE THE CHAINS THAT BIND YOU
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.1 models
1 min voice data can also be used to train a good TTS model
Lets make video diffusion practical
NVR with realtime local object detection for IP cameras
Agentic, Reasoning, and Coding (ARC) foundation models
Hindsight: Agent Memory That Learns
Comprehensive Gradio WebUI for audio processing
Faster Whisper transcription with CTranslate2
OCR software, free and offline
Native and Compact Structured Latents for 3D Generation
FlashInfer: Kernel Library for LLM Serving
Fast Python collaborative filtering for implicit feedback datasets
A Lightweight Face Recognition and Facial Attribute Analysis
Generate audiobooks from e-books, voice cloning & 1107+ languages
Deepfakes Software For All
Kimi Code CLI is your next CLI agent