HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.
Features
- Cleans up HTML
- Outputs XML, HTML or JDOM
- Supports foreign markup using namespaces
- Full command line and chaining support
- Optional GUI
Categories
HTML/XHTMLLicense
BSD LicenseFollow HtmlCleaner
Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Rate This Project
Login To Rate This Project
User Reviews
-
I could not make it run on the Mac OS X 10.9.5. I do have the latest Java build installed.
-
Any issues get a quick response
-
Great library, probably the most advanced in processing broke HTML!
-
Realy cool API for quick and effective use to get up and running with a clean html code base.
-
nice project, good support - some improvements in design and performance may be possible...