htmlcxx is a simple non-validating html parser library for C++. It allows to fully dump the original html document, character by character, from the parse tree. It also has an intuitive tree traversal API.
License
GNU General Public License version 2.0 (GPLv2)Follow htmlcxx
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
Rate This Project
Login To Rate This Project
User Reviews
-
A fast, solid, robust and easy to use HTML parser in C++. Nice usage of the tree.hh, STL like tree class, makes this parser really easy to use. Great work!