Formula recognition based on LaTeX-OCR and ONNXRuntime
OCR software, free and offline
Math OCR model that outputs LaTeX and markdown
Accurate × Fast × Comprehensive
Contexts Optical Compression
PDF to Markdown with vision models
JupyterLab extension for live editing of LaTeX documents
Visual Causal Flow
OCRmyPDF adds an OCR text layer to scanned PDF files
OCR expert VLM powered by Hunyuan's native multimodal architecture
Enhances Tesseract OCR output using LLMs (local or API)
Package for converting and rendering markdown documents in TeX
Awesome multilingual OCR toolkits based on PaddlePaddle
A high-quality tool for convert PDF to Markdown and JSON
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Re-editable LaTeX/ typst graphics for Inkscape
Multilingual Document Layout Parsing in a Single Vision-Language Model
Pure Python library for LaTeX to MathML conversion
minted is a LaTeX package that provides syntax highlighting
LaTeX CV generator from a YAML/JSON input file
Python library for converting Python calculations into rendered latex
PDF scientific paper translation with preserved formats