Python & command-line tool to gather text on the Web
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Crowdsourcing platform for full text transcription and tagging
A TTS that fits in your CPU (and pocket)
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Qwen3-TTS is an open-source series of TTS models
Cut videos with a text editor
Python bindings for MuPDF's rendering library.
The python library for real-time communication
A Python utility / library to sort imports
Converts text to speech in realtime
Automatic Speech Recognition with Word-level Timestamps
Open source no-code system for text annotation and building of text
High-Quality Voice Cloning TTS for 600+ Languages
Robust Speech Recognition via Large-Scale Weak Supervision
An open-source toolkit for monitoring Language Learning Models (LLMs)
PDF to Markdown with vision models
Official inference repo for FLUX.2 models
A Python toolbox for gaining geometric insights
Ready-to-use OCR with 80+ supported languages
The behavior guidance framework for customer-facing LLM agents
OCR software, free and offline
A pure-python PDF library capable of splitting, merging, cropping
A simple tool for reading in poorly redacted documents
Official MiniMax Model Context Protocol (MCP) server