A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow HtmlList

HtmlList Web Site

Other Useful Business Software
Secure File Transfer for Windows with Cerberus by Redwood Icon
Secure File Transfer for Windows with Cerberus by Redwood

Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
Try for Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of HtmlList!

Additional Project Details

Intended Audience

Developers

Programming Language

Python

Related Categories

Python HTML XHTML, Python Information Analysis Software, Python Libraries

Registered

2009-06-16