One platform to build, fine-tune, and deploy ML models. No MLOps team required.
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Try Free
Earn up to 16% annual interest with Nexo.
Access competitive interest rates on your digital assets.
Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform.
Geographic restrictions, eligibility, and terms apply.
NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.
Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
jStyleParser is a CSS parser written in Java. It has its own application interface that is designed to allow an efficient CSS processing in Java and mapping the values to the Java data types. It is also able to apply the parsed style sheets to a DOM that represents an HTML or XML document and to compute the resulting style of the individual document elements.
FreePascal / Lazarus / Delphi / Kylix - compatible bindings and object-oriented wrapper for LibCURL, the client-side URL transfer library, and TidyLib, the HTML parsing and formatting library based on HTML-Tidy.
HTMLparser which can be used for screen-scraping applications
htmldom parses the HTML file and provides methods for iterating and searching the parse tree in a similar way as Jquery. To report bugs please mail me at bhimsen.pes@gmail.com
hcxselect is a small and fast CSS selector engine for C++. It parses CSS selector expressions and applies them to a set of document nodes (or a whole tree) parsed via htmlcxx, a simple non-validating HTMLparser. Thus, it allows you to use CSS selectors in your C++ program without much bloat.
HXPath is a command line tool useful to extract data from HTML documents. HXPath can select sub trees, like the standard xpath tool, but is also able to read contents and attributes and output them in a bash friendly format. HTML Tidy and HTTP/HTTPS get are built in too.
NBBC is a high-speed, extensible, sophisticated, well-documented, easy-to-use validating BBCode parser written in PHP.
________________________
NOTE: THIS COPY OF NBBC HAS NOT BEEN UPDATED IN A LONG TIME.
There are currently two active maintainers of NBBC:
- Theyak (https://github.com/theyak/nbbc) has a copy of NBBC that still works with PHP 4, and is still more-or-less the 1.x source tree, but it has had some of the outstanding bugs fixed.
- Vanilla...
WhiteCrane not only allows you to update your sites files but it also lets you create and delete pages, and thanks to our custom built syntax parser any content added is rendered in 100% valid xhtml and css.
XhtmlParser is a simple HTML/XHTML parser written in PHP5 object oriented. XHtmlParser allows you to iterate/modify XHTML nodes and get/set their attributes. It parse XHTML data to build object structure.
MozillaParser is a Java Htmlparser based on mozilla's htmlparser. it acts as a bridge from java classes to Mozilla's classes and outputs a java Document object from a raw ( and dirty) HTML input
Wow Log Parser is a combat log parser for the game World of Warcraft. The purpose of the program is to parse the files generated with the /combatlog command. The source code can be found on: http://www.gurre.eu/wowlogparser/forum
NeatCleaner is a velocity template file parser written in Java. It transforms dirty HTML/VTL/JS to well-formed Node Tree and
render the Node Tree to output pretty source.
A simple PHP parser class that scans through poorly formed html documents as if they were valid XML, and generates calls to customizable callback functions.
`parsehtml.php` is a HTMLparser written in PHP. It should be pretty fast and can even cope with invalid HTML input.
Additionally a function is provided to indent HTML code properly.
PTM is a Perl/HTML hybrid, similar to PHP in tag syntax. It allows for the ease of PHP-style inline code in your dynamic HTML documents while supplying the power of Perl that many find to be sorely lacking in PHP.
The project is a collection of object pascal libraries for parsing text strings and markup languages HTML, XHTML, XML, CSS and others. Libraries are written in minimalism concept with striving for wide unification.
HtmlClient provides an SGML/HTML/XHTML parser and connection client making web-spidering as easy for developers as actually surfing the web with a premade browser. Based on Apache's HttpClient.