From: Jeff D. <da...@da...> - 2001-06-08 17:59:56
|
>Does & still have to be a magic character if it is only generated on output? I am not advocating writing (entering data in the form) entitized >-- this is broken and counter-productive; only displaying entitized, to >allow a wider audience with less breakage. This is _almost_ a non-issue >with Latin-1, since most browsers render that by default, but on >non-Latin-1 charsets (and thus, locales and languages), this _will_ be an >issue. Is this besides the point? Okay, I see that perhaps I misunderstood your initial suggestion. Stock PhpWiki really only supports ISO-8859-1. (Though I suppose it would be easy enough to hack it to support any other single eight-bit character set.) PhpWiki specifies the charset in a <meta http-equiv> tag in the HTML headers on each and every page: <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> As you note, the charset currently is not specified in the HTTP headers. If that is causing problems with some browsers (is it?), that's easy enough to fix (we should probably fix that anyway). I suppose entitizing upon output as you suggest doesn't hurt anything, but it still seems unnecessary to me. Is anyone using PhpWiki with any other non ISO-8859-1 eight-bit charset? (I know there have been many requests for multi-byte character support, but that's not an easy fix. I think that pretty much requires switching to using unicode/UTF-8 internally, and this won't be practical without unicode support compiled into PHP and its regexp libraries.) |