From: Ionut N. <io...@ef...> - 2001-11-28 18:47:00
|
> The "right thing" to do would be to either not decode SGML entities > at all, but somehow compensate for that in the word matching, or to > decode all standard or proposed entities UNAMBIGUOUSLY so that you can > map them back correctly in htsearch. This means not being limited to > 256 characters in a single byte. htsearch would then have to be aware > of the encoding used on output, and map the characters to the correct > single character or SGML encoding as appropriate. Thanks a lot Gilles, I'll keep monitoring the list for news on that. Ionut Nistor io...@ef... |