Windows GUI Automation with Python (based on text properties)
Sphinx source parser for Jupyter notebooks
High-quality multi-lingual text-to-speech library by MyShell.ai
File Parser optimised for LLM Ingestion with no loss
OCRmyPDF adds an OCR text layer to scanned PDF files
CLI tool and python library
Open source plain text editor designed for writing novels
CommonMark compliant markdown parser in Rust with ASTs and extensions
Code for running inference and finetuning with SAM 3 model
A robust, efficient, low-latency speech-to-text library
Awesome multilingual OCR toolkits based on PaddlePaddle
Extensions for Python Markdown
Rich is a Python library for rich text and beautiful formatting
Speech-to-text, text-to-speech, and speaker recognition
Comprehensive Gradio WebUI for audio processing
Focus on prompting and generating
Python library and CLI tool to interface with Google Translate
Robust Speech Recognition via Large-Scale Weak Supervision
Offline Text To Speech synthesis for python
Python implementation of TextRank algorithms
A text-to-speech, speech-to-text and speech-to-speech library
Use Microsoft Edge's online text-to-speech service from Python
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Python tool for converting files and office documents to Markdown
Speech recognition module for Python