Personal AI, On Personal Devices
Improve your Baduk skills by training with KataGo
The most powerful local music generation model
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The agent that grows with you
Advanced language and coding AI model
Lets make video diffusion practical
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Deep Research framework, combining language models with tools
Instant voice cloning by MIT and MyShell. Audio foundation model
Automatic Speech Recognition with Word-level Timestamps
1 min voice data can also be used to train a good TTS model
AI Fully Automated Short Video Engine
Official inference repo for FLUX.1 models
Powerful AI language model (MoE) optimized for efficiency/performance
Public repository for Agent Skills
NVR with realtime local object detection for IP cameras
OBLITERATE THE CHAINS THAT BIND YOU
AI video generator optimized for low VRAM and older GPUs use
Native and Compact Structured Latents for 3D Generation
Generate audiobooks from e-books
A lightweight audio-to-MIDI converter with pitch bend detection
Offline Text To Speech synthesis for python