Offline speech recognition API for Android, iOS, Raspberry Pi
Face recognition with deep neural networks
NLP Cloud serves high performance pre-trained or custom models for NER
A ranked list of awesome machine learning Python libraries
Robust Speech Recognition via Large-Scale Weak Supervision
State-of-the-art 2D and 3D Face Analysis Project
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCRmyPDF adds an OCR text layer to scanned PDF files
Image polygonal annotation with Python
Speech-to-text, text-to-speech, and speaker recognition
Awesome multilingual OCR toolkits based on PaddlePaddle
A Lightweight Face Recognition and Facial Attribute Analysis
Open Source Computer Vision Library
Speech recognition module for Python
Library for OCR-related tasks powered by Deep Learning
Ready-to-use OCR with 80+ supported languages
Automatic SQL injection and database takeover tool
Open-Source Python3 tool for recognizing layouts, tables, and math
A high-quality tool for convert PDF to Markdown and JSON
A PyTorch-based Speech Toolkit
An open and fair framework for everyone to build AI agents
Industrial-strength Natural Language Processing (NLP)
Crowdsourcing platform for full text transcription and tagging
Toolkit for conversational AI
A framework to enable multimodal models to operate a computer