|
From: Alex O. <no...@gi...> - 2025-12-09 07:58:54
|
Branch: refs/heads/master Home: https://github.com/internetarchive/heritrix3 Commit: fb791e2b0c3a15add2f8a09e15d510cc825ad4ff https://github.com/internetarchive/heritrix3/commit/fb791e2b0c3a15add2f8a09e15d510cc825ad4ff Author: Leslie Bellony <les...@bn...> Date: 2025-12-04 (Thu, 04 Dec 2025) Changed paths: M modules/src/main/java/org/archive/modules/extractor/ExtractorHTML.java M modules/src/test/java/org/archive/modules/extractor/ExtractorHTMLTest.java Log Message: ----------- Issue #688 ignore extracted strings longer than 2048 characters Commit: 305204af8b3777e93b25109fc27835249660b818 https://github.com/internetarchive/heritrix3/commit/305204af8b3777e93b25109fc27835249660b818 Author: Leslie Bellony <les...@bn...> Date: 2025-12-04 (Thu, 04 Dec 2025) Changed paths: M modules/src/test/java/org/archive/modules/extractor/ExtractorHTMLTest.java M modules/src/test/java/org/archive/modules/extractor/JerichoExtractorHTMLTest.java Log Message: ----------- Issue #689 unit tests for image attributes Commit: 771452ee83af224e6eb9f42804005c2e54482a84 https://github.com/internetarchive/heritrix3/commit/771452ee83af224e6eb9f42804005c2e54482a84 Author: Leslie Bellony <les...@bn...> Date: 2025-12-08 (Mon, 08 Dec 2025) Changed paths: M modules/src/main/java/org/archive/modules/extractor/ExtractorHTML.java Log Message: ----------- when skipping too long value, move matcher pointer to the end of value Commit: 4bcd5b3a68c98046e775826f2dacde4842441478 https://github.com/internetarchive/heritrix3/commit/4bcd5b3a68c98046e775826f2dacde4842441478 Author: Alex Osborne <aos...@nl...> Date: 2025-12-09 (Tue, 09 Dec 2025) Changed paths: M modules/src/main/java/org/archive/modules/extractor/ExtractorHTML.java M modules/src/test/java/org/archive/modules/extractor/ExtractorHTMLTest.java M modules/src/test/java/org/archive/modules/extractor/JerichoExtractorHTMLTest.java Log Message: ----------- Merge pull request #697 from bnfleb/bnf_2025 Issue #688 ignore extracted strings longer than 2048 characters Compare: https://github.com/internetarchive/heritrix3/compare/df5920656c03...4bcd5b3a68c9 To unsubscribe from these emails, change your notification settings at https://github.com/internetarchive/heritrix3/settings/notifications |