Speech recognition for your site
Open Source Computer Vision Library
A cross-platform software for text translation and recognition
Open source semantic search and text analytics for large document sets
Interactive video and image annotation tool for computer vision
Build your own AI friend
A free, open source, and extensible speech-to-text application
Multilingual Automatic Speech Recognition with word-level timestamps
Cross-platform AI language practice app
Voice Recognition to Text Tool
Image polygonal annotation with Python
Enhances Tesseract OCR output using LLMs (local or API)
Underthesea - Vietnamese NLP Toolkit
Open source AI VTuber platform with voice chat and Live2D avatars
Open-Source AI Camera. Empower any camera/CCTV
Recognition and resolution of numbers, units, date/time, etc.
A full spaCy pipeline and models for scientific/biomedical documents
Toolkit for conversational AI
High-performance neural network inference framework for mobile
OCR offline image text recognition command line windows program
OCRmyPDF adds an OCR text layer to scanned PDF files
OCR expert VLM powered by Hunyuan's native multimodal architecture
Replace OpenAI GPT with another LLM in your app
Training data (data labeling, annotation, workflow) for all data types
Accurate × Fast × Comprehensive