Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Contexts Optical Compression
OCR software, free and offline
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A pure Javascript Multilingual OCR
Speech recognition module for Python
A cross-platform software for text translation and recognition
A free, open source, and extensible speech-to-text application
Open-Source Python3 tool for recognizing layouts, tables, and math
Open source semantic search and text analytics for large document sets
Library for OCR-related tasks powered by Deep Learning
Automatic Speech Recognition with Word-level Timestamps
Audio foundation model excelling in audio understanding
Underthesea - Vietnamese NLP Toolkit
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Cross-platform AI language practice app
Voice Recognition to Text Tool
Open-source industrial-grade ASR models
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Open source annotation tool for machine learning practitioners