Download Latest Version ocrmypdf-17.5.0.tar.gz (7.4 MB)
Email in envelope

Get an email when there's a new version of OCRmyPDF

Home / v17.5.0
Name Modified Size InfoDownloads / Week
Parent folder
ocrmypdf-17.5.0-py3-none-any.whl.sigstore.json 2026-05-27 11.2 kB
ocrmypdf-17.5.0.tar.gz.sigstore.json 2026-05-27 11.2 kB
ocrmypdf-17.5.0-py3-none-any.whl 2026-05-27 495.3 kB
ocrmypdf-17.5.0.tar.gz 2026-05-27 7.4 MB
README.md 2026-05-27 1.2 kB
v17.5.0 source code.tar.gz 2026-05-27 7.4 MB
v17.5.0 source code.zip 2026-05-27 7.7 MB
Totals: 7 Items   23.0 MB 39
  • Added support for the end alias in --pages, denoting the last page of the document. For example, --pages 3-end OCRs from page 3 through the final page. {issue}1615
  • Added --ghostscript-jpeg-quality and --ghostscript-jpeg-maxdpi advanced options for tuning Ghostscript's PDF/A output. The optimizer's --jpeg-quality remains the recommended file-size control.
  • Fixed pypdfium2 rasterizer clipping content when the CropBox was smaller than the MediaBox (e.g. JSTOR or cropped PDFs). {issue}1685
  • Fixed Form XObject cycle detection in the optimizer's image xref scan. Self-referential or DAG-shaped Form graphs (notably from PowerPoint exports) previously produced floods of recursion warnings and could hang for minutes. {issue}1321
  • Tesseract config errors are now surfaced as TesseractConfigError with actionable guidance, instead of crashing later with a confusing FileNotFoundError on the missing hOCR output. {issue}1687
  • Refreshed the Chinese README translation. Thanks @cislunarspace.
  • Internal refactoring of the _exec and subprocess modules to separate probing from execution.
  • CI dependency updates.
Source: README.md, updated 2026-05-27