Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
Follow Jericho HTML Parser
Rate This ProjectLogin To Rate This Project
Best HTML parser there is
Very useful and reliable library to parse HTML!
I needed to transform many JSP pages from a real open source project downloaded form the Internet. Initially, I tried jsoup because it looks like having a easier/more intuitive/higher level API, but it become a headache with lots of errors to compile the transformed project. Then, I tried jericho and, it worked incredibly fine from the first time. Awesome library to parse and transform JSP pages!!
Impressible project - more powerful than most commercial solutions. Incredible powerful and flexible. Saved me countless hours.
We needed to validate and compare different dynamic generated web pages inside of Selenium2 application tests. Jericho HTML Parser provided all required high-level methods for HTML content analysis and evaluation (especially getDebugInfo() method for fast allocation of problem code). Very good API documentation and set of examples allows us fast finish all necessary application tests. Excellent work - thanks.