A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow HtmlList

HtmlList Web Site

You Might Also Like
Run applications fast and securely in a fully managed environment Icon
Run applications fast and securely in a fully managed environment

Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of Google's scalable infrastructure.

Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of HtmlList!

Additional Project Details

Intended Audience

Developers

Programming Language

Python

Related Categories

Python HTML XHTML, Python Information Analysis Software, Python Libraries

Registered

2009-06-16