Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Contexts Optical Compression
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR software, free and offline
Speech recognition module for Python
A pure Javascript Multilingual OCR
A cross-platform software for text translation and recognition
A free, open source, and extensible speech-to-text application
Open source semantic search and text analytics for large document sets
Library for OCR-related tasks powered by Deep Learning
Cross-platform AI language practice app
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Automatic Speech Recognition with Word-level Timestamps
Audio foundation model excelling in audio understanding
Voice Recognition to Text Tool
Open-source industrial-grade ASR models
Open source annotation tool for machine learning practitioners
Underthesea - Vietnamese NLP Toolkit
OCRmyPDF adds an OCR text layer to scanned PDF files
Toolkit for conversational AI