From: Guenter M. <mi...@us...> - 2011-04-13 09:49:50
|
On 2011-04-04, Stefan Merten wrote: > The command :: > rst2xml.py --debug --traceback --input-encoding=3Dutf-8 --output-encodi= > ng=3Dutf-8 --error-encoding=3Dutf-8 umlaut.rst /dev/null > on input file `umlaut.rst` :: > =E4=F6=FC=C4=D6=DC=DF > crashes with a misleading error message:: > UnicodeEncodeError: 'ascii' codec can't encode characters in position 51-= > 57: ordinal not in range(128) > The specified output encoding (utf-8) cannot > handle all of the output. > Try setting "--output-encoding-error-handler" to > * "xmlcharrefreplace" (for HTML & XML output); > the output will contain "äöüÄÖÜß" an= > d should be usable. > * "backslashreplace" (for other output formats); > look for "\xe4\xf6\xfc\xc4\xd6\xdc\xdf" in the output. > * "replace"; look for "?" in the output. > "--output-encoding-error-handler" is currently set to "xmlcharrefreplace". > I guess it is in many places that debug output can not handle > non-ASCII input files properly :-( . Actually, this is a Python bug. It should be fine with Python >= 2.6 and with the workaround I commited yesterday: - Work around Issue2517_ to allow unicode messages in `Exception` instances with Python < 2.6. .. _Issue2517: http://bugs.python.org/issue2517 Can you please test? Günter |