|
From: David G. <go...@py...> - 2012-10-08 19:04:32
|
On Mon, Oct 8, 2012 at 4:37 AM, Stefan Merten <sm...@oe...> wrote: > Hi! > > Last week (7 days ago) SourceForge net wrote: >> Bugs item #3573416, was opened at 2012-10-01 03:42 >> Message generated for change (Tracker Item Submitted) made by normanjaeckel >> You can respond by visiting: >> https://sourceforge.net/tracker/?func=detail&atid=422030&aid=3573416&group_id=38414 > [...] >> Initial Comment: >> The function make_id() in nodes.py converts a string into an >> identifier. Therefor some non-ascii-chars were escaped or changed. >> In German the normal way to change umlauts and eszett is: ä -> ae; ö >> -> oe, ü -ue, ß -> ss. Docutils does ä -> a, ö -> o, ü -> u, ß -> >> sz, but this is wrong. >> >> The dict _non_id_translate_digraphs should be changed (0x00df: >> u'ss', # ligature sz/ss) and updated (0x00e4: u'ae', 0x00f6: u'oe', >> 0x00fc: u'ue'). > > Well, I didn't look into this really. I understand that this > suggestion will change the way ids are generated for HTML. > > I think this is really a bad idea. I for one love to use deep links > into documents and for reStructuredText this means that I'm using the > ids generated for one document often in other documents. After all > that is what hypertext is all about. > > If you now change the way ids are generated then external deep links > break. This is particularly bad in scenarios where the target document > is likely to be regenerated - for instance because it is part of a > Plone site or a MoinMoin Wiki. > > Therefore I'd suggest to keep the id generation as is and refuse this > bug. Agreed. Also, languages don't all transcribe accents the same way. We shouldn't give preference to one language over others. -- David Goodger <http://python.net/~goodger> |