| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| 3.12.0 source code.tar.gz | 2025-10-30 | 2.3 MB | |
| 3.12.0 source code.zip | 2025-10-30 | 3.1 MB | |
| README.md | 2025-10-30 | 1.8 kB | |
| Totals: 3 Items | 5.3 MB | 0 | |
Download distribution zip (or tar.gz)
Full Changelog | Javadoc | Maven Central
New features
- ConfigurableExtractorJS: Regex rules to skip extracting
<script>tags when their attributes match. #672
Bug fixes
- Docs: Switch bean docs generation to an annotation processor, fixing the bean reference broken by Java language changes. #683
- StatisticsTracker: Don’t restore
crawlEndTimewhen resuming from a checkpoint. #669 - ExtractorJS: Fix overriding the
strictsetting in sheets. #670 - Berkeley DB: Handle more shutdown interrupts gracefully. #671
Dependency upgrades
- amqp-client: 5.26.0 → 5.27.0
- groovy: 4.0.28 → 5.0.2
- jaxb-runtime: 4.0.5 → 4.0.6
- jetty: 12.0.27 → 12.0.29
- jsch: 2.27.3 → 2.27.4
- junit-jupiter: 5.13.4 → 6.0.0
- kafka-clients: 3.9.1 → 4.1.0
- pdfbox: 3.0.5 → 3.0.6
- rethinkdb-driver: 2.3.3 → 2.4.4
- spring: 6.2.11 → 6.2.12
- webarchive-commons: 3.0.0 → 3.0.1
- webjars-locator-lite: 1.1.0 → 1.1.2