Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Contexts Optical Compression
A pure Javascript Multilingual OCR
Speech recognition module for Python
Port of OpenAI's Whisper model in C/C++
Toolkit for conversational AI
A full spaCy pipeline and models for scientific/biomedical documents
Unofficial (Golang) Go bindings for the Hugging Face Inference API
The behavior guidance framework for customer-facing LLM agents
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Underthesea - Vietnamese NLP Toolkit
Han Language Processing
A ranked list of awesome machine learning Python libraries
A free, open source, and extensible speech-to-text application
OpenVINO™ Toolkit repository
Olares: An Open-Source Sovereign Cloud OS for Local AI
Persian NLP Toolkit
Training data (data labeling, annotation, workflow) for all data types
ITTT is a Free tool designed to Scan and extract Text from Images.
kaldi-asr/kaldi is the official location of the Kaldi project
A cross-platform software for text translation and recognition