Ready-to-use OCR with 80+ supported languages
Multilingual Document Layout Parsing in a Single Vision-Language Model
OCR offline image text recognition command line windows program
Visual Causal Flow
OCR expert VLM powered by Hunyuan's native multimodal architecture
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Accurate × Fast × Comprehensive
Implementation of Nougat Neural Optical Understanding
Uniform deep learning inference framework for mobile
Automatic license plate recognition library