Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Contexts Optical Compression
OCR software, free and offline
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A pure Javascript Multilingual OCR
Speech recognition module for Python
A cross-platform software for text translation and recognition
A free, open source, and extensible speech-to-text application
Open-Source Python3 tool for recognizing layouts, tables, and math
Open source semantic search and text analytics for large document sets
Automatic Speech Recognition with Word-level Timestamps
Cross-platform AI language practice app
Underthesea - Vietnamese NLP Toolkit
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Audio foundation model excelling in audio understanding
Library for OCR-related tasks powered by Deep Learning
Open-source industrial-grade ASR models
Open source annotation tool for machine learning practitioners
Faster Whisper transcription with CTranslate2
Voice Recognition to Text Tool