#6 Allow HTML-style char. repres. for XHTML


Saxon version: 6.5.1

The 'saxon:character-representation' currently only accepts the values 'decimal' and 'hex' for XHTML ('method="saxon:xhtml"'). It would be useful if the same values as for HTML (e.g. 'decimal;decimal') was supported.

(Reason for this RFE: I currently write all my documents in XHTML, using UTF-8 as the character encoding. But some browsers and search engines doesn't support this, so it's better to use numeric character references for non-ASCII character. But I still want to use the UTF-8 encoding, since US-ASCII support is not strictly required for XML processors. Currently, one solution is to set the 'encoding' in the XSLT style sheet to 'US-ASCII', and then do an search and replace from 'US-ASCII' to 'UTF-8' in the resulting document. But this is obviously not an ideal solution.)

It would also be nice if the semicolon separated way of choosing character representation was available for ordinary XML documents too (i.e. for 'html', 'xml' and 'saxon:xhtml').


Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

No, thanks