Provides line-oriented text file editing capabilities
Document (PDF, Word, PPTX ...) extraction and parse API
Python bindings for MuPDF's rendering library.
Edit PDF files with Nano Banana
OCRmyPDF adds an OCR text layer to scanned PDF files
Open source plain text editor designed for writing novels
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A remote monitoring & management tool, built with Django, Vue and Go
Comprehensive Gradio WebUI for audio processing
Open-Source Python3 tool for recognizing layouts, tables, and math
Python library and CLI tool to interface with Google Translate
Ready-to-use OCR with 80+ supported languages
TTS with kokoro and onnx runtime
A text-to-speech, speech-to-text and speech-to-speech library
Python binding to the Apache Tika™ REST services
Cut videos with a text editor
EPUB to audiobook converter, optimized for Audiobookshelf
Generate audiobooks from e-books, voice cloning & 1107+ languages
Speech recognition module for Python
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
A Sublime Text 2/3 plugin to see git diff in gutter
Compute distance between sequences
Speakr is a personal, self-hosted web application
Claude Code skill implementing Manus-style persistent planning
ComfyUI integration for Microsoft's VibeVoice text-to-speech model