NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
Spondulas is browser emulator designed to retrieve web pages for hunti
Spondulas is browser emulator and parser designed to retrieve web pages for hunting malware. It supports generation of browser user agents, GET/POST requests, and SOCKS5 proxy. It can be used to parse HTML files sent via e-mail. Monitor mode allows a website to be monitored at intervals to discover changes in DNS or content over time. Autolog mode creates an investigation file that documents redirection chains.
Serve is a platform designed for PHP4 and used by Fundi Technologies. We are releasing portions that we believe will be helpful to the community. The first two releases are an HTMLparser and a MySQL database wrapper that fixes problems with PearDB
The program tHE HTML packer is intended for compression of ready HTML-files before loading them on a WEB-site.
For operation of the packed pages it is necessary to have a browser with support of the language JavaScript. In the given moment the program wa
Arachnid is a Java-based web spider framework. It includes a simple HTMLparser object that parses an input stream containing HTML content. Simple Web spiders can be created by sub-classing Arachnid and adding a few lines of code called after each page
LogAnal is a quick hack to parse Apache Log Files and produce graphical and textual web server statistics.
Works in incremental mode only. Supports Templates for the output HTML, as well as localization (defaults to English).