Re: [Docutils-develop] [ docutils-Bugs-3573416 ] Change transscription of special german chars

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

On Mon, Oct 8, 2012 at 4:37 AM, Stefan Merten <sm...@oe...> wrote:
> Hi!
>
> Last week (7 days ago) SourceForge net wrote:
>> Bugs item #3573416, was opened at 2012-10-01 03:42
>> Message generated for change (Tracker Item Submitted) made by normanjaeckel
>> You can respond by visiting:
>> https://sourceforge.net/tracker/?func=detail&atid=422030&aid=3573416&group_id=38414
> [...]
>> Initial Comment:
>> The function make_id() in nodes.py converts a string into an
>> identifier. Therefor some non-ascii-chars were escaped or changed.
>> In German the normal way to change umlauts and eszett is: ä -> ae; ö
>> -> oe, ü -ue, ß -> ss. Docutils does ä -> a, ö -> o, ü -> u, ß ->
>> sz, but this is wrong.
>>
>> The dict _non_id_translate_digraphs should be changed (0x00df:
>> u'ss', # ligature sz/ss) and updated (0x00e4: u'ae', 0x00f6: u'oe',
>> 0x00fc: u'ue').
>
> Well, I didn't look into this really. I understand that this
> suggestion will change the way ids are generated for HTML.
>
> I think this is really a bad idea. I for one love to use deep links
> into documents and for reStructuredText this means that I'm using the
> ids generated for one document often in other documents. After all
> that is what hypertext is all about.
>
> If you now change the way ids are generated then external deep links
> break. This is particularly bad in scenarios where the target document
> is likely to be regenerated - for instance because it is part of a
> Plone site or a MoinMoin Wiki.
>
> Therefore I'd suggest to keep the id generation as is and refuse this
> bug.

Agreed.

Also, languages don't all transcribe accents the same way. We
shouldn't give preference to one language over others.

-- 
David Goodger <http://python.net/~goodger>