Trying to get started with jython, I tried to run one of my script
that count word frequencies in a document, but I've run in the error
UnicodeDecodeError: 'unicodeescape' codec can't decode bytes in
position 2845-2851: illegal Unicode character
I've interpreted this error message as "jython" somehow has an issue
parsing purekeyworddbtest.py (not sure this is the right
interpretation, but the lack of stack trace seems to hint in that
So I fired up python to figure out what was in the file at this position?
Python 2.6.2 (r262:71605, Apr 14 2009, 22:40:02) [MSC v.1500 32 bit
(Intel)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> text = open('purekeyworddbtest.py','rb').read()
>>> print repr(text[2845:2851])
>>> print repr(text[2800:2900])
Turns out that it is somehow stumbling on \u1fdc which is part of a
regular expression character range. Any ideas why and how to
This script executes fine with regular python. I'm using Jython 2.5.1
(Release_2_5_1:6813, Sep 26 2009, 13:47:54).