The DataExtractor (HTMLtoXML) extracts data from a HTML page according to a configuration file and puts the data into an XML file according to a specified structure. It is a tool to extract data from HTML pages and to store the data in XML files.

Project Activity

See All Activity >

Categories

XML, HTML/XHTML

License

GNU General Public License version 2.0 (GPLv2)

Follow DataExtractor - HTMLtoXML

DataExtractor - HTMLtoXML Web Site

Other Useful Business Software
Auth for GenAI | Auth0 Icon
Auth for GenAI | Auth0

Enable AI agents to securely access tools, workflows, and data with fine-grained control and just a few lines of code.

Easily implement secure login experiences for AI Agents - from interactive chatbots to background workers with Auth0. Auth for GenAI is now available in Developer Preview
Try free now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DataExtractor - HTMLtoXML!

Additional Project Details

Operating Systems

Linux, BSD, Windows

Intended Audience

Information Technology, Developers, End Users/Desktop

User Interface

Command-line

Programming Language

Java

Database Environment

XML-based

Related Categories

Java XML Software, Java HTML XHTML

Registered

2006-09-18