From: Oliver M. B. <ol...@fa...> - 2005-01-04 10:26:59
|
Still clearing my personal patch queue ;-) Slash::Utility::Data::xmlencode() converts all characters that are not ASCII-printable into entities. This is rather ugly and greatly increases data size of RSS feeds for sites that mostly have non-ASCII content. The XML declaration says UTF-8 it the encoding, thus we should be clear to output UTF-8 characters. And if the input to xmlencode() wasn't UTF-8, the entitizing wouldn't help anywy. The patch below is against R_2_5_0_41 and makes this entitizing dependent on a new constant "rss_entitize_nonascii". Please consider it for merge. # I also have a set of patches for a completely UTF-8ized Slashcode, but # it depends on Perl 5.8, a patch to DBD::MySQL and some ugly trickery # and is thus not yet ready for public consumption. If anybody is interested, # I can post some details. -- Oliver M. Bolzer ol...@go... GPG (PGP) Fingerprint = 621B 52F6 2AC1 36DB 8761 018F 8786 87AD EF50 D1FF |