From: Markus F. <m.f...@gm...> - 2002-08-13 08:37:04
|
Hello, I'm using htdig version 3.1.6 on solaris, my problem is that htdig should index a German website which is completely utf-8 encoded. Htdig creates db.wordlist but since htdig is not familiar with the German umlauts, it just splits the words, e.g. "europäisch" is split in "europ" and "isch". So you cannot search for words with German umlauts. I used the htdig.conf with: translate_latin1: false translate_lt_gt: false translate_quot: false translate_amp: false locale: de.UTF-8@euro The locale on solaris is also set to de.UTF-8@euro The htdig website shows on its TODO list that they are working on "Better Internationalization - Support for UTF-8". It is not possible to switch the website to ISO-8859-1. Is there anyone who had the same problem and solved it?? Thanks Markus -- GMX - Die Kommunikationsplattform im Internet. http://www.gmx.net |