From: Tatu S. <cow...@ya...> - 2006-11-03 05:16:57
|
--- Jimmy Zhang <cra...@co...> wrote: > Tatu, woodstox still has to recognize various > encoding delcarations right,, > the question is really what kind of encoding > declaration a parser has to > recognize.... for ascii, Well, not necessarily, since I can just try to construct JDK-based decoder (as InputStreamReader) with the given encoding. However: > one can have encoding="ascii" or > encoding="us-ascii".. does woodstox > support both? Yes. I do check for some of aliases (from the linked-to IANA page). > what about for windows encoding? cp-1250 or cp1250 Those I defer to JDK. ;-) But any encodings Woodstox natively supports, I do use loose comparison, which essentially strips out non-alphanum characters (spaces, dashes, underscores), and does case-insensitive comparison. So cp-1250 and cp1250 are considered equivalent. -+ Tatu +- ____________________________________________________________________________________ Everyone is raving about the all-new Yahoo! Mail (http://advision.webevents.yahoo.com/mailbeta/) |