Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Contexts Optical Compression
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Speech recognition module for Python
Toolkit for conversational AI
A full spaCy pipeline and models for scientific/biomedical documents
The behavior guidance framework for customer-facing LLM agents
Underthesea - Vietnamese NLP Toolkit
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Han Language Processing
A ranked list of awesome machine learning Python libraries
Persian NLP Toolkit
Training data (data labeling, annotation, workflow) for all data types
kaldi-asr/kaldi is the official location of the Kaldi project
NLP Cloud serves high performance pre-trained or custom models for NER
OCRmyPDF adds an OCR text layer to scanned PDF files
Obsei is a low code AI powered automation tool
Ready-to-use OCR with 80+ supported languages
Chat & pretrained large vision language model
StreamSpeech is a seamless model for offline speech recognition
Conversational voice AI agents
A framework to enable multimodal models to operate a computer
OCR expert VLM powered by Hunyuan's native multimodal architecture