| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| ocrmypdf-17.5.0-py3-none-any.whl.sigstore.json | 2026-05-27 | 11.2 kB | |
| ocrmypdf-17.5.0.tar.gz.sigstore.json | 2026-05-27 | 11.2 kB | |
| ocrmypdf-17.5.0-py3-none-any.whl | 2026-05-27 | 495.3 kB | |
| ocrmypdf-17.5.0.tar.gz | 2026-05-27 | 7.4 MB | |
| README.md | 2026-05-27 | 1.2 kB | |
| v17.5.0 source code.tar.gz | 2026-05-27 | 7.4 MB | |
| v17.5.0 source code.zip | 2026-05-27 | 7.7 MB | |
| Totals: 7 Items | 23.0 MB | 39 | |
- Added support for the
endalias in--pages, denoting the last page of the document. For example,--pages 3-endOCRs from page 3 through the final page. {issue}1615 - Added
--ghostscript-jpeg-qualityand--ghostscript-jpeg-maxdpiadvanced options for tuning Ghostscript's PDF/A output. The optimizer's--jpeg-qualityremains the recommended file-size control. - Fixed pypdfium2 rasterizer clipping content when the CropBox was smaller
than the MediaBox (e.g. JSTOR or cropped PDFs). {issue}
1685 - Fixed Form XObject cycle detection in the optimizer's image xref scan.
Self-referential or DAG-shaped Form graphs (notably from PowerPoint
exports) previously produced floods of recursion warnings and could hang
for minutes. {issue}
1321 - Tesseract config errors are now surfaced as
TesseractConfigErrorwith actionable guidance, instead of crashing later with a confusingFileNotFoundErroron the missing hOCR output. {issue}1687 - Refreshed the Chinese README translation. Thanks @cislunarspace.
- Internal refactoring of the
_execandsubprocessmodules to separate probing from execution. - CI dependency updates.