From: Gea-Suan L. <gs...@cc...> - 2004-02-24 14:34:22
|
Hello, I use UTF-8 (for Traditional Chinese) as phpwiki charset, and there are some problem: When phpwiki translates data to html, char 0xA0 will translate to space (0x20). You may see the html source code in http://wiki.abpe.org/index.php/Sand= Box : 00000f60 6e 61 62 6c 65 20 62 6f 74 74 6f 6d 22 3e 55 54 |nable bottom= ">UT| 00000f70 46 2d 38 20 e9 20 81 e9 9d a2 e6 b8 ac e8 a9 a6 |F-8 ?.?=A2=E6= =B8=AC=E8=A9=A6| 00000f80 3c 2f 70 3e 0a 3c 2f 64 69 76 3e 0a 0a 3c 64 69 |</p>.</div>.= .<di| and the original statement: 00000000 55 54 46 2d 38 20 e9 a0 81 e9 9d a2 e6 b8 ac e8 |UTF-8 ?.?=A2= =E6=B8=AC=E8| 00000010 a9 a6 |=A9=A6| --=20 * Gea-Suan Lin (public key: http://ccreader.nctu.edu.tw/~gslin/key.txt) * If you cannot convince them, confuse them. -- Harry S Truman |
From: Reini U. <ru...@x-...> - 2004-02-24 15:55:02
|
Gea-Suan Lin schrieb: > I use UTF-8 (for Traditional Chinese) as phpwiki charset, and there > are some problem: > > When phpwiki translates data to html, char 0xA0 will translate to > space (0x20). > > You may see the html source code in http://wiki.abpe.org/index.php/SandBox : > > 00000f60 6e 61 62 6c 65 20 62 6f 74 74 6f 6d 22 3e 55 54 |nable bottom">UT| > 00000f70 46 2d 38 20 e9 20 81 e9 9d a2 e6 b8 ac e8 a9 a6 |F-8 ?.?X葫閰帆 > 00000f80 3c 2f 70 3e 0a 3c 2f 64 69 76 3e 0a 0a 3c 64 69 |</p>.</div>..<di| > > and the original statement: > > 00000000 55 54 46 2d 38 20 e9 a0 81 e9 9d a2 e6 b8 ac e8 |UTF-8 ?.?X葫鋧 > 00000010 a9 a6 |岫| Hi, This is really interesting, but my possibilities to debug this problem is limited. I have no mule-enabled Xemacs (yet). And with my mule-enabled emacs I just cannot enter these characters. On my screen 0xa0 looks like 0x20. There's no dot. Can that be a PHP problem? Do you know any 0xa0 combination that actually represents another character that doesn't look like a space? Or some hints how to debug this? -- Reini Urban http://xarch.tu-graz.ac.at/home/rurban/ |
From: Gea-Suan L. <gs...@cc...> - 2004-02-24 16:07:34
|
There dots are generated by hexdump, not by php or other else. URL: http://wiki.abpe.org/index.php/SandBox Screenshot: http://netnews.nctu.edu.tw/~gslin/tmp/sandbox.png You can use wget and hexdump to see it about 00000f60 ~ 00000f80: % wget http://wiki.abpe.org/index.php/SandBox % hexdump -C Sandbox : : 00000f60 6e 61 62 6c 65 20 62 6f 74 74 6f 6d 22 3e 55 54 |nable bottom= ">UT| 00000f70 46 2d 38 20 e9 20 81 e9 9d a2 e6 b8 ac e8 a9 a6 |F-8 ?.?=A2=E6= =B8=AC=E8=A9=A6| 00000f80 3c 2f 70 3e 0a 3c 2f 64 69 76 3e 0a 0a 3c 64 69 |</p>.</div>.= .<di| : : On Tue, Feb 24, 2004 at 04:54:11PM +0100, Reini Urban wrote: > Hi, > This is really interesting, but my possibilities to debug this problem=20 > is limited. I have no mule-enabled Xemacs (yet). And with my=20 > mule-enabled emacs I just cannot enter these characters. >=20 > On my screen 0xa0 looks like 0x20. There's no dot. > Can that be a PHP problem? >=20 > Do you know any 0xa0 combination that actually represents another=20 > character that doesn't look like a space? > Or some hints how to debug this? > --=20 > Reini Urban > http://xarch.tu-graz.ac.at/home/rurban/ --=20 * Gea-Suan Lin (public key: http://ccreader.nctu.edu.tw/~gslin/key.txt) * If you cannot convince them, confuse them. -- Harry S Truman |
From: electron <ele...@mg...> - 2004-02-24 16:20:30
|
Lets start by creating a road map From User End: Browser->Post to Web Server (Is the browser converting this?) Server End: Web Server->PHP (Is it here?) PHP->PhpWiki (Here?) PhpWiki Finally, If we get inside PHPWiki: Inside Request? Inside parser? Does it convert when we throw it to the DB? Does it convert when cached? Output: Output Back through server (Here?) Browser display. (Here?) -Jtp ---- Hi, This is really interesting, but my possibilities to debug this problem is limited. I have no mule-enabled Xemacs (yet). And with my mule-enabled emacs I just cannot enter these characters. On my screen 0xa0 looks like 0x20. There's no dot. Can that be a PHP problem? Do you know any 0xa0 combination that actually represents another character that doesn't look like a space? Or some hints how to debug this? -- Reini Urban http://xarch.tu-graz.ac.at/home/rurban/ |