OCRmyPDF adds an OCR text layer to scanned PDF files
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Open-Source Python3 tool for recognizing layouts, tables, and math
Focus on prompting and generating
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
FastAPI framework, high performance, easy to learn, fast to code
Vim Win32 Installer
Use Microsoft Edge's online text-to-speech service from Python
Python library and CLI tool to interface with Google Translate
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Offline Text To Speech synthesis for python
Sphinx source parser for Jupyter notebooks
ASCII art library for Python
Contexts Optical Compression
Awesome multilingual OCR toolkits based on PaddlePaddle
File Parser optimised for LLM Ingestion with no loss
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Official inference repo for FLUX.1 models
A robust, efficient, low-latency speech-to-text library
Rich is a Python library for rich text and beautiful formatting
Open source annotation tool for machine learning practitioners
Windows GUI Automation with Python (based on text properties)
Code for running inference and finetuning with SAM 3 model
CLI tool and python library
Extensions for Python Markdown