Formula recognition based on LaTeX-OCR and ONNXRuntime
OCR software, free and offline
Math OCR model that outputs LaTeX and markdown
Accurate × Fast × Comprehensive
Contexts Optical Compression
PDF to Markdown with vision models
JupyterLab extension for live editing of LaTeX documents
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
OCR expert VLM powered by Hunyuan's native multimodal architecture
Enhances Tesseract OCR output using LLMs (local or API)
Awesome multilingual OCR toolkits based on PaddlePaddle
Package for converting and rendering markdown documents in TeX
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
A high-quality tool for convert PDF to Markdown and JSON
Multilingual Document Layout Parsing in a Single Vision-Language Model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Re-editable LaTeX/ typst graphics for Inkscape
Pure Python library for LaTeX to MathML conversion
LaTeX CV generator from a YAML/JSON input file
PDF scientific paper translation with preserved formats
minted is a LaTeX package that provides syntax highlighting
An Inkscape extension: Latex/Tex editor for Inkscape