Download Latest Version 3.13.0 source code.tar.gz (2.3 MB)
Email in envelope

Get an email when there's a new version of Heritrix

Home / 3.13.0
Name Modified Size InfoDownloads / Week
Parent folder
3.13.0 source code.tar.gz 2025-12-11 2.3 MB
3.13.0 source code.zip 2025-12-11 3.1 MB
README.md 2025-12-11 2.5 kB
Totals: 3 Items   5.3 MB 0

Download distribution zip (or tar.gz)

Full Changelog | Javadoc | Maven Central

New features

  • Config editor: IDE-style completions for bean names and Spring XML (powered by the new bean docs generator). #684
  • Job status API: The sizeTotalsReport now includes a sizeOnDisk value totaling the size of the files in latest/warcs. #700
  • ExtractorJson: New extractor that extracts URI strings from JSON documents. #701

Bug fixes

  • AbstractCookieStore: Fixed cookies with leading dot (.example.com) being ignored #691
  • ExtractorHTML: Fixed attribute values longer than 2048 characters causing extraction of truncated strings. #697
  • ClientFTP: Fixed MalformedServerReplyException when FTP sends a response with only an error code and no message. #694
  • BdbMultipleWorkQueues: Added null checks, type validation, and warning logs in BdbMultipleWorkQueues.delete() to improve frontier stability in the case of corrupted or partially persisted CrawlURIs. #693
  • BeanDocProcessor: Fixed compiler IllegalArgumentException when IntelliJ runs the annotation processor with a ProcessingEnvironment wrapper.

Dependency upgrades

  • amqp-client: 5.27.0 → 5.27.1
  • commons-cli: 1.10.0 → 1.11.0
  • commons-codec: 1.19.0 → 1.20.0
  • commons-io: 2.20.0 → 2.21.0
  • jackson: 2.20.0 → 2.20.1
  • jetty: 12.0.29 → 12.0.30
  • jsch: 2.27.4 → 2.27.7
  • junit-jupiter: 6.0.0 → 6.0.1
  • kafka-clients: 4.1.0 → 4.1.1
  • lz4-java: 1.8.0 → 1.10.1
  • spring-framework: 6.2.12 → 7.0.1
  • webarchive-commons: 3.0.1 → 3.0.2
Source: README.md, updated 2025-12-11