Lexbor is development of an open source HTML Renderer library
Converts CSS selectors to XPath expressions
Thin wrapper for "pandoc" (MIT)
Dominate is a Python library for creating and manipulating HTML docs
A python package for building DOM of the HTML documents
An HTML5 parsing library in pure C99