|
From: Yuri T. <qar...@gm...> - 2008-07-04 03:21:13
|
> Btw, if anyone is interested in performance of html serializers and > parsers in python, here's a decent comparison: > http://blog.ianbicking.org/2008/03/30/python-html-parser-performance/ Interesting. It looks like lxml is way way faster than ElementTree. Also, the website for lxml seems to suggest that ElementTree has some serious problems in handling unicode (http://codespeak.net/lxml/compatibility.html, third bullet). This really worries me, more so than performance. This may not affect us, but we need to make sure that ElementTree can handle unicode properly if we would be using it. However, it looks like lxml is included with nothing at this point, and would require building stuff from C, which may raise the bar for using markdown... - yuri -- http://sputnik.freewisdom.org/ |