From: Rik F. <fa...@di...> - 2003-05-05 09:31:52
|
On Sat 3 May 2003 08:23:04 +0200, ho...@fr... <ho...@fr...> wrote: > On 29 Apr, Rik Faith wrote: > > On Tue 29 Apr 2003 09:54:16 +0200, > > Eyermann Horst ICM Bocholt <hor...@si...> wrote: > >> Step 3: Choose your Entity Sets > >> do we need any of them, or is utf-8 just fine? > > > > Is there anything that utf-8 doesn't capture? If someone uses other > > entities, is it easy to convert them to utf-8? Is this a readability > > issue? > > Well, the entities are just other ways of displaying the > characters. "& / < / >" are for "& / < / >" > there are all other kind of entities for non ASCII characters, > for example in german ö is ö (o") - but as we want to suppor > all kind of languages (and IPA phonetic characters), I think we > should either support everything, or nothing, to make it > consistent. (XML requires/recommends &, <, and > to be escaped.) In this case, I would advocate supporting all entities in the interest of human readability -- there is a lot that can be done with a 7-bit ASCII editor and entities, even though the document is strictly UTF-8. |