Awesome multilingual OCR toolkits based on PaddlePaddle
1 min voice data can also be used to train a good TTS model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Improve your Baduk skills by training with KataGo
Fast stable diffusion on CPU and AI PC
An AI personal assistant for your digital brain
AI tool that removes hardcoded subtitles and text from videos locally
A high-throughput and memory-efficient inference and serving engine
Python SDK for Claude Agent
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Ready-to-use OCR with 80+ supported languages
Open-source AI agent framework
Oobabooga - The definitive Web UI for local AI, with powerful features
NVR with realtime local object detection for IP cameras
A lightweight audio-to-MIDI converter with pitch bend detection
Open-source multi-speaker long-form text-to-speech model
Synchronized Translation for Videos
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Qwen3 is the large language model series developed by Qwen team
Python-based neural networks API
Generate audiobooks from e-books, voice cloning & 1107+ languages
Generate short videos with one click using AI LLM
A Family of Open Sourced Music Foundation Models
A community-supported supercharged version of paperless
Interact with your documents using the power of GPT