| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| opendataloader-pdf-cli-2.1.1.zip | 2026-03-26 | 22.4 MB | |
| README.md | 2026-03-26 | 4.4 kB | |
| Release v2.1.1 source code.tar.gz | 2026-03-26 | 30.6 MB | |
| Release v2.1.1 source code.zip | 2026-03-26 | 30.9 MB | |
| Totals: 4 Items | 83.9 MB | 0 | |
What's Changed
- feat: add --detect-strikethrough option for strikethrough text detection (#295) by @hnc-jglee in https://github.com/opendataloader-project/opendataloader-pdf/pull/298
- fix: filter narrow outlier elements in vertical gap detection by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/300
- Refactoring for StrikethroughProcessor and XYCutPlusPlusSorter by @MaximPlusov in https://github.com/opendataloader-project/opendataloader-pdf/pull/325
- chore: remove Claude Code GitHub workflows by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/334
- fix: use asyncio event loop on Windows to avoid uvloop error by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/328
- docs: fix hybrid_timeout type and hybrid_fallback default by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/299
- feat: detect CID font extraction failure and route to OCR fallback by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/291
- fix: run converter.convert() in thread pool to prevent event loop blocking by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/322
- Update outdated contributing instructions by @JCZhang2025 in https://github.com/opendataloader-project/opendataloader-pdf/pull/306
- docs: create whats-new-v2 article by @bdoubrov in https://github.com/opendataloader-project/opendataloader-pdf/pull/339
- test: clean up stale TextProcessor regression by @JCZhang2025 in https://github.com/opendataloader-project/opendataloader-pdf/pull/308
- fix: skip hybrid backend checks when no pages remain by @JCZhang2025 in https://github.com/opendataloader-project/opendataloader-pdf/pull/311
- chore: remove LFS, move benchmark to opendataloader-bench by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/340
- fix: handle null textColor in HeadingProcessor for hybrid mode by @justperson94 in https://github.com/opendataloader-project/opendataloader-pdf/pull/320
- fix(tables): normalize under-segmented spreadsheet tables by @sickn33 in https://github.com/opendataloader-project/opendataloader-pdf/pull/338
- fix: change hybrid timeout default to unlimited (0) by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/337
- chore: upgrade GitHub Actions to Node 24-compatible versions by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/346
- fix: handle merged cells in Markdown table generation by @hnc-jglee in https://github.com/opendataloader-project/opendataloader-pdf/pull/342
- Add double quotes to whats-new-v2.mdx by @MaximPlusov in https://github.com/opendataloader-project/opendataloader-pdf/pull/348
- chore: update dependencies to fix security vulnerabilities by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/347
- fix: replace PR [#320] defensive NPE catches with proper graceful degradation by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/350
- ci: add benchmark results to step summary by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/355
- feat: add MCP server for AI agent integration by @bejoyfuuul in https://github.com/opendataloader-project/opendataloader-pdf/pull/351
- fix: remove fallback 0 for missing thresholds in step summary by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/356
- fix: add install instructions to hybrid server error and CLI help by @bundolee in https://github.com/opendataloader-project/opendataloader-pdf/pull/357
New Contributors
- @JCZhang2025 made their first contribution in https://github.com/opendataloader-project/opendataloader-pdf/pull/306
- @bdoubrov made their first contribution in https://github.com/opendataloader-project/opendataloader-pdf/pull/339
- @justperson94 made their first contribution in https://github.com/opendataloader-project/opendataloader-pdf/pull/320
- @sickn33 made their first contribution in https://github.com/opendataloader-project/opendataloader-pdf/pull/338
- @bejoyfuuul made their first contribution in https://github.com/opendataloader-project/opendataloader-pdf/pull/351
Full Changelog: https://github.com/opendataloader-project/opendataloader-pdf/compare/v2.0.2...v2.1.1