Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Contexts Optical Compression
Offline speech recognition API for Android, iOS, Raspberry Pi
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Library for OCR-related tasks powered by Deep Learning
Speech recognition module for Python
A pure Javascript Multilingual OCR
Toolkit for conversational AI
Port of OpenAI's Whisper model in C/C++
A full spaCy pipeline and models for scientific/biomedical documents
Unofficial (Golang) Go bindings for the Hugging Face Inference API
Underthesea - Vietnamese NLP Toolkit
Han Language Processing
A ranked list of awesome machine learning Python libraries
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A free, open source, and extensible speech-to-text application
The behavior guidance framework for customer-facing LLM agents
OpenVINO™ Toolkit repository
ITTT is a Free tool designed to Scan and extract Text from Images.
Training data (data labeling, annotation, workflow) for all data types
NLP Cloud serves high performance pre-trained or custom models for NER