From: Tony S. <ton...@gm...> - 2005-06-30 11:21:18
|
Sorry isnt this just for character encoding conversion? I really want to constrain my dataset to the latin 1 codepage (or a suitable one for this windows app to render, or 7bit ASCII for freetext indexing) which requires transliteration? I'm new to this so may well be getting confused. For example if I choose ISO8859-1 (or windows-1252) output encoding for my XML, then 0x80 - 0xff characters are outputted along with the 7bit ASCII ones. However everything above this is encoded as &#ddd; which then won't be rendered. Thanks Tony On 6/29/05, Markus Scherer <mar...@gm...> wrote: > I _think_ what you are looking for is to use an ICU converter for > ISO-8859-1 (for example) and set the "escape" callback with the option > of producing XML numeric character references. >=20 > However, please consider that Windows internally works entirely in > Unicode, and modern Windows applications do so, too. You might find > that you need not convert out of Unicode, and if you needn't you > shouldn't. >=20 > markus >=20 > On 6/29/05, Tony Scerri <ton...@gm...> wrote: > > Yes I do have XML encoded as UTF-8 (or anything encoding scheme). >=20 > -- > Opinions expressed here may not reflect my company's positions unless > otherwise noted. >=20 >=20 > ------------------------------------------------------- > SF.Net email is sponsored by: Discover Easy Linux Migration Strategies > from IBM. Find simple to follow Roadmaps, straightforward articles, > informative Webcasts and more! Get everything you need to get up to > speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id=16492&opclick > _______________________________________________ > icu-support mailing list - icu...@li... > To Un/Subscribe: https://lists.sourceforge.net/lists/listinfo/icu-support > |