A Lightweight Face Recognition and Facial Attribute Analysis
State-of-the-art 2D and 3D Face Analysis Project
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Speech recognition module for Python
NLP Cloud serves high performance pre-trained or custom models for NER
OCR software, free and offline
Contexts Optical Compression
kaldi-asr/kaldi is the official location of the Kaldi project
A ranked list of awesome machine learning Python libraries
Image polygonal annotation with Python
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Underthesea - Vietnamese NLP Toolkit
A full spaCy pipeline and models for scientific/biomedical documents
A PyTorch-based Speech Toolkit
Toolkit for conversational AI
Open source annotation tool for machine learning practitioners
Han Language Processing
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Image processing in Python
Library for OCR-related tasks powered by Deep Learning
Multilingual Automatic Speech Recognition with word-level timestamps
OCR expert VLM powered by Hunyuan's native multimodal architecture
OCRmyPDF adds an OCR text layer to scanned PDF files
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX