A python package to find repetitive format pattern in HTML pages and extract information from them using this pattern. The idea is that in pages that have some kind of a list, there will be a repetitive pattern for the human eye (the page format).
Sign up for the SourceForge newsletter:
You seem to have CSS turned off.
Please don't fill out this field.