Jericho HTML Parser is a simple but powerful java HTML parser library allowing analysis and manipulation of HTML documents. Version 1.3 introduces some minor features such as the ability to ignore specified sections of the document when parsing, and to parse attribute lists in any part of the document.
It also fixes a major bug related to the presence of comments in the document.
- Deprecated Source.getSourceTextLowerCase()
- Added ignoreWhenParsing methods to Source and Segment classes
(See sample called JSPTest)
- Added parseAttributes methods to Source, Segment and StartTag classes
- Added BlankOutputSegment class
- Bug fixes