Attributes cannot contain quotes
HTML parser which can be used for screen-scraping applications
Brought to you by:
bhimsen92
The HTML spec allows attributes delimited with double quotes to contain single quotes and vice versa. However code like
page = """<a title="It's bugged!"></a>""" dom = htmldom.HtmlDom().createDom(page)
enters an infinite loop. It looks the the regular expression used for attributes does not allow for this.
Anonymous