Thread: [Jython-dev] Initial 'u' on repr(string)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hello jython-dev,

I've just been bitten by what to me looks like a misfeature in Jython.  I
have a Python client and a Jython server on the same machine, talking over
a localhost socket.  I'm using repr() and eval() as my wire protocol
(because there's no possibility of anyone else being able to connect to
this socket; though I'm sufficiently paranoid that I'm pre-parsing the
repr()'d packets anyway to ensure that they're safe).

My problem is this: when repr()'ing a string, Jython only add the u'' if
the string contains char values > 255.

So when I send a list of strings over the socket from my Jython server to
my Python client, some of them come out as plain strings and others come
out as unicode strings.  This had me swearing for several hours yesterday
(I've calmed down now 8-) because it never occurred to me that a list that
went in one end homogeneous could come out of the other end heterogeneous.

This looks to me like a terrible violation of the Principle of Least
Surprise.  Everything works perfectly until a character over 255 comes
along and suddenly your code breaks with an encoding error.  It also makes
Jython significantly different from CPython.

Any comments?  Can I consider this a bug and enter a bug report / patch?
Or am I simply abusing repr()?

-- 
Richie Hindle
ri...@en...