Robust Speech Recognition via Large-Scale Weak Supervision
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The most powerful local music generation model
Advanced language and coding AI model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Improve your Baduk skills by training with KataGo
Python inference and LoRA trainer package for the LTX-2 audio–video
1 min voice data can also be used to train a good TTS model
AI agent harness for AI coding agents
Image inpainting tool powered by SOTA AI Model
Agentic, Reasoning, and Coding (ARC) foundation models
A modular, primitive-first, python-first PyTorch library
Python tool for converting files and office documents to Markdown
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open-source, high-performance AI model with advanced reasoning
A high-throughput and memory-efficient inference and serving engine
OCR software, free and offline
A Lightweight Face Recognition and Facial Attribute Analysis
A lightweight audio-to-MIDI converter with pitch bend detection
Fast stable diffusion on CPU and AI PC
World's first open-source, agentic video production system
Official inference repo for FLUX.2 models
Oobabooga - The definitive Web UI for local AI, with powerful features
Code for running inference and finetuning with SAM 3 model
Ready-to-use OCR with 80+ supported languages