OCRmyPDF - Browse /v17.5.0 at SourceForge.net

The interactive file manager requires Javascript. Please enable it or use sftp or scp.
You may still browse the files here.

Name	Modified	Size	InfoDownloads / Week
Parent folder
ocrmypdf-17.5.0-py3-none-any.whl.sigstore.json	2026-05-27	11.2 kB	5
ocrmypdf-17.5.0.tar.gz.sigstore.json	2026-05-27	11.2 kB	1
ocrmypdf-17.5.0-py3-none-any.whl	2026-05-27	495.3 kB	0
ocrmypdf-17.5.0.tar.gz	2026-05-27	7.4 MB	0
README.md	2026-05-27	1.2 kB	0
v17.5.0 source code.tar.gz	2026-05-27	7.4 MB	0
v17.5.0 source code.zip	2026-05-27	7.7 MB	33
Totals: 7 Items		23.0 MB	39

Added support for the end alias in --pages, denoting the last page of the document. For example, --pages 3-end OCRs from page 3 through the final page. {issue}1615
Added --ghostscript-jpeg-quality and --ghostscript-jpeg-maxdpi advanced options for tuning Ghostscript's PDF/A output. The optimizer's --jpeg-quality remains the recommended file-size control.
Fixed pypdfium2 rasterizer clipping content when the CropBox was smaller than the MediaBox (e.g. JSTOR or cropped PDFs). {issue}1685
Fixed Form XObject cycle detection in the optimizer's image xref scan. Self-referential or DAG-shaped Form graphs (notably from PowerPoint exports) previously produced floods of recursion warnings and could hang for minutes. {issue}1321
Tesseract config errors are now surfaced as TesseractConfigError with actionable guidance, instead of crashing later with a confusing FileNotFoundError on the missing hOCR output. {issue}1687
Refreshed the Chinese README translation. Thanks @cislunarspace.
Internal refactoring of the _exec and subprocess modules to separate probing from execution.
CI dependency updates.