HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.
- Cleans up HTML
- Outputs XML, HTML or JDOM
- Supports foreign markup using namespaces
- Full command line and chaining support
- Optional GUI
I could not make it run on the Mac OS X 10.9.5. I do have the latest Java build installed.
Any issues get a quick response
Great library, probably the most advanced in processing broke HTML!
Realy cool API for quick and effective use to get up and running with a clean html code base.
nice project, good support - some improvements in design and performance may be possible...