Hello Laura,

Thank you very much for your interest. I subscribed to the diversity mailing list and posted an e-mail about the issue (though the post has not appeared in the list yet).

This is the e-mail I posted to diversity:


My name is Firat Ozgul. I am the one who posted the e-mail in docutils-develop mailing list relating to the Turkish characters cropped from url permalinks...

As Laura stated, the solution is just a few lines of code. In my own project, I use the patched version of nodes.py. This is the code I use, if relevant:


...and this is the patch:


Note that, besides converting Turkish-specific characters into their ASCII look-alikes, I also delete "apostrophe" from the permalink to prevent cluttered output.

Thanks a lot for your interest.

2009/8/29 Laura Creighton <lac@openend.se>
Hi Firat (and everybody else).

I forwarded your article to the diversity mailing list (diversity@python.org)
where we are taking about how to make things more accessible for speakers
of non-English, and non-European languages.  I got this back off list.
Hope it helps.


To:      Laura Creighton <lac@openend.se>
From:    James Bennett <ubernostrum@gmail.com>
Subject: Re: [Diversity] This showed up in docutils-develop


(off-list since it's more docutils-related, but I'm not currently
subscribed to that one)

There's a little utility function in Django that does the right thing
here ("Sözlük Öğelerini Alfabe Sırasına Dizmek" becomes
"sozluk-ogelerini-alfabe-srasna-dizmek"). It's literally just a couple
lines of code to do the conversion, and is just relying on the stdlib
unicodedata module:


If someone in docutils/Sphinx would like to drop that, or a functional
equivalent, in, it should solve this pretty easily.