Java library for working with real-world HTML
Extract data from a wide range of Internet sources
Convert Bootstrap CSS code to Tailwind CSS code
Automatically extract body content (and other cool stuff) from HTML
Picks up text from a web page using a html template.
Xidel is a cli webpage scraping tool supporting XPath/XQuery 3 and CSS