A generative speech model for daily dialogue
OCR software, free and offline
Offline inference engine for art, real-time voice conversations
Qwen3-TTS is an open-source series of TTS models
Extensions for Python Markdown
Open source healthcare AI
Cut videos with a text editor
Speech recognition module for Python
Video-based AI memory library. Store millions of text chunks in MP4
EPUB to audiobook converter, optimized for Audiobookshelf
A TTS that fits in your CPU (and pocket)
Generate audiobooks from EPUBs, PDFs and text with captions
A simple native web interface that uses ChatTTS to synthesize text
A simple tool for reading in poorly redacted documents
Robust Speech Recognition via Large-Scale Weak Supervision
A high-quality rapid TTS voice cloning model
The simplest, fastest repository for training/finetuning models
Edit PDF files with Nano Banana
Official MiniMax Model Context Protocol (MCP) server
The Ren'Py Visual Novel Engine
Python bindings for MuPDF's rendering library.
A Family of Open Sourced Music Foundation Models
Automatic Speech Recognition with Word-level Timestamps
Python module for parsing semi-structured text into python tables
State-of-the-art TTS model under 25MB