From: Trent M. <tr...@gm...> - 2008-07-03 23:53:24
|
> As far as I can tell, the only way to get around that sort of problem > would be using BeautifulSoup to parse the input... But there goes any > hope of a significant speedup. Or html5lib: http://code.google.com/p/html5lib/wiki/UserDocumentation There is an example there that will build an ElementTree. I've had good success with html5lib parsing HTML that isn't XHTML. I'm not sure what the speed implications are though. Could perhaps fallback to html5lib (or BeautifulSoup) if there is an input XML parsing problem. Trent -- Trent Mick tr...@gm... |