Docutils: Documentation Utilities / Bugs / #134 No test case for Text.shortrepr with long string.

Jeffrey C. Jacobs - 2010-03-24

Okay, that's wasn't so hard -- I forgot I'd written node test cases before; so there you have it: this patch to the test_nodes.py test cases demonstrates the flaw in the current text node __repr__ and shortrepr functions. I need help to fix the issue though as I don't understand why the code is failing.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jeffrey C. Jacobs - 2010-03-24

Fixed a couple of typos in the test case; also deduced the issue: when calling reprunicode.__repr__ with a parameter that is not of type reprunicode you will receive the error described. I have not tried Python 3.x but I assume since you don't need to strip the u from the output string like you do in the 2.x version that the code works in 3.x and since I don't think this logic: an instance function acting like a free / static function or the unicode string being promoted to a reprunicode string -- has been implemented in any of the 2.x versions. Therefore, I don't yet see a clear solution; stay tuned.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jeffrey C. Jacobs - 2010-03-24

A fix for the issue listed; including test case

issue2975987.patch

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jeffrey C. Jacobs - 2010-03-24

Okay, there y'all have it! I've written a patch to solve this issue that I think should work for both Python 3.x and Python 2.x >= 2.3. Enjoy, I hope!

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jeffrey C. Jacobs - 2010-03-25

FYI, on my installation:

2.5.1 (r251:54863, Feb 6 2009, 19:02:12)
[GCC 4.0.1 (Apple Inc. build 5465)]

(The version installed with Mac OS X 10.5 (Leopard))

I see the following exception when I try to run the "Mary had a little lamb..." test:

ERROR: test_longrepr (test_nodes.TextTests)
----------------------------------------------------------------------
Traceback (most recent call last):
File "/Users/darklord/Documents/Remote/docutils/test/test_nodes.py", line 59, in test_longrepr
self.assertEquals(repr(self.longtext), r"<#text: Mary had a "
File "/Users/darklord/Documents/Remote/docutils/docutils/nodes.py", line 341, in __repr__
data = reprunicode.__repr__(self[:64] + ' ...')
TypeError: unbound method __repr__() must be called with reprunicode instance as first argument (got unicode instance instead)

=================

As you can see from the error, I believe the problem is that in this version of python, the call of the form reprunicode.__repr__(self) passes because self is derived from reprunicode so there is no casting involved in the base class call. But when a string is added to the result of an indexing operation (which is allowed because nodes.Text is derived from unicode), this is coerced automatically to unicode type, NOT to reprunicode type. Now, in some versions of python this will be okay because the call to reprunicode.__repr__ will automatically coerce the resultant unicode string into a reprunicode type, but not, apparently in version 2.5.1 as packaged with Mac OS X Leopard. Normally, I'd say upgrade but the thing is, when you're talking bundled software that is built into the OS framework, this isn't so easy; you and I can do it, but not an average user. That, coupled with the fact that all we're doing is forcing a cast from the unicode string generated into a reprunicode object and just calling the __repr__ method directly (we could also call it via the repr keyword since the object is now of type reprunicode, not nodes.Text) and since this isn't all that much glue code -- if the object is already a unicode string, all that's happening there is a reference count increment, and if it's before 3.x, it's still likely a reference count increment since reprunicode is a subclass of and thus container for unicode object anyway. So, even if you can't reproduce this issue with your version of Python, the solution should not adversely effect any version of python to any great extent and has the added bonus of fixing a problem with the Mac OS X Leopard default install.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Günter Milde - 2010-03-26

Fixed; thanks for the bug report.

You can download a current snapshot from:
http://docutils.sf.net/docutils-snapshot.tgz

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Günter Milde - 2010-03-26

status: open --> closed-fixed
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

No test case for Text.shortrepr with long string.

Searches

Help

#134 No test case for Text.shortrepr with long string.

Discussion