From: Bernier, M. <mel...@sd...> - 2001-11-12 13:47:17
|
> Hi guys! >=20 > I have installed htdig and I have a little problem with German = Umlaut. I > can search for words with Umlaut without any problem. When I search = for > say 'C34644' (a file containing Umlaut), the results from htdig comes = back > with strange characters instead of Umlaut (for example, I get a = circle (=D8) > instead of =FC, or I get a bit =C4 instead of a small =E4), and it = seems to > return that kind of results only for word documents. What could be = the > problem? >=20 > In order to use german dictionary, in my htdig.conf I have the = following: >=20 > ispell_dir=3D/usr/lib/ispell >=20 > # to use german dictionary > locale: de_DE > lang_dir: ${ispell_dir} > endings_affix_file: ${lang_dir}/german.aff > endings_dictionary: ${lang_dir}/german.0 >=20 In order to parse word documents, in my htdig.conf I have the = following: # to convert different binary documents (.doc, .ps, .pdf, .xls, .ppt) #doc2html=3D/usr/local/bin/doc2html.pl external_parsers: application/pdf->text/html /usr/local/bin/doc2html.pl \ application/postscript->text/html /usr/local/bin/doc2htm application/msword->text/html /usr/local/bin/doc2html.pl application/msexcel->text/html /usr/local/bin/doc2html.p application/vnd.ms-excel->text/html /usr/local/bin/doc2h application/vnd.ms-powerpoint->text/html /usr/local/bin/ > By the way, I am using htdig on Suse 7.2 Linux distribution. >=20 > Could you please reply by email since I didn't subscribe to the = mailing > list? >=20 > Thanks in advance for any help. >=20 > Melanie. |