Jericho HTML Parser
Description
Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
Jericho HTML Parser Web SiteUser Ratings
User Reviews
-
Great software! Compared to the alternatives (e.g Jtidy, HTML cleaner) it is predictable and reliable. A big plus also is that it accepts any number of html elements even without a root element (html fragments). We use it in production!
-
Excellent work.
-
Good.
-
jerichohtml is fast and stable
-
Excellent Library. Nicely converts HTML to ASCII only text representation. Wish it was released in Apache License.
-
Great and well-documented library.