Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCR software, free and offline
Contexts Optical Compression
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Speech recognition module for Python
A cross-platform software for text translation and recognition
A pure Javascript Multilingual OCR
An Open-Source Toolkit for General-OCR Research and Applications
Automatic Speech Recognition with Word-level Timestamps
Open-Source Python3 tool for recognizing layouts, tables, and math
Library for OCR-related tasks powered by Deep Learning
A free, open source, and extensible speech-to-text application
Faster Whisper transcription with CTranslate2
Enhances Tesseract OCR output using LLMs (local or API)
Open source semantic search and text analytics for large document sets
Toolkit for conversational AI
Open-source industrial-grade ASR models
OCRmyPDF adds an OCR text layer to scanned PDF files
Open source annotation tool for machine learning practitioners
Underthesea - Vietnamese NLP Toolkit