From: Stefan M. <sm...@oe...> - 2012-10-08 17:48:43
|
Hi! Last week (7 days ago) SourceForge net wrote: > Bugs item #3573416, was opened at 2012-10-01 03:42 > Message generated for change (Tracker Item Submitted) made by normanjaeckel > You can respond by visiting: > https://sourceforge.net/tracker/?func=detail&atid=422030&aid=3573416&group_id=38414 [...] > Initial Comment: > The function make_id() in nodes.py converts a string into an > identifier. Therefor some non-ascii-chars were escaped or changed. > In German the normal way to change umlauts and eszett is: ä -> ae; ö > -> oe, ü -ue, ß -> ss. Docutils does ä -> a, ö -> o, ü -> u, ß -> > sz, but this is wrong. > > The dict _non_id_translate_digraphs should be changed (0x00df: > u'ss', # ligature sz/ss) and updated (0x00e4: u'ae', 0x00f6: u'oe', > 0x00fc: u'ue'). Well, I didn't look into this really. I understand that this suggestion will change the way ids are generated for HTML. I think this is really a bad idea. I for one love to use deep links into documents and for reStructuredText this means that I'm using the ids generated for one document often in other documents. After all that is what hypertext is all about. If you now change the way ids are generated then external deep links break. This is particularly bad in scenarios where the target document is likely to be regenerated - for instance because it is part of a Plone site or a MoinMoin Wiki. Therefore I'd suggest to keep the id generation as is and refuse this bug. Grüße Stefan |
From: David G. <go...@py...> - 2012-10-08 19:04:32
|
On Mon, Oct 8, 2012 at 4:37 AM, Stefan Merten <sm...@oe...> wrote: > Hi! > > Last week (7 days ago) SourceForge net wrote: >> Bugs item #3573416, was opened at 2012-10-01 03:42 >> Message generated for change (Tracker Item Submitted) made by normanjaeckel >> You can respond by visiting: >> https://sourceforge.net/tracker/?func=detail&atid=422030&aid=3573416&group_id=38414 > [...] >> Initial Comment: >> The function make_id() in nodes.py converts a string into an >> identifier. Therefor some non-ascii-chars were escaped or changed. >> In German the normal way to change umlauts and eszett is: ä -> ae; ö >> -> oe, ü -ue, ß -> ss. Docutils does ä -> a, ö -> o, ü -> u, ß -> >> sz, but this is wrong. >> >> The dict _non_id_translate_digraphs should be changed (0x00df: >> u'ss', # ligature sz/ss) and updated (0x00e4: u'ae', 0x00f6: u'oe', >> 0x00fc: u'ue'). > > Well, I didn't look into this really. I understand that this > suggestion will change the way ids are generated for HTML. > > I think this is really a bad idea. I for one love to use deep links > into documents and for reStructuredText this means that I'm using the > ids generated for one document often in other documents. After all > that is what hypertext is all about. > > If you now change the way ids are generated then external deep links > break. This is particularly bad in scenarios where the target document > is likely to be regenerated - for instance because it is part of a > Plone site or a MoinMoin Wiki. > > Therefore I'd suggest to keep the id generation as is and refuse this > bug. Agreed. Also, languages don't all transcribe accents the same way. We shouldn't give preference to one language over others. -- David Goodger <http://python.net/~goodger> |