Ready-to-use OCR with 80+ supported languages
Multilingual Document Layout Parsing in a Single Vision-Language Model
Visual Causal Flow
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Accurate × Fast × Comprehensive
Implementation of Nougat Neural Optical Understanding