|
From: Bernier, M. <mel...@sd...> - 2001-11-12 13:47:17
|
> Hi guys!
>=20
> I have installed htdig and I have a little problem with German =
Umlaut. I
> can search for words with Umlaut without any problem. When I search =
for
> say 'C34644' (a file containing Umlaut), the results from htdig comes =
back
> with strange characters instead of Umlaut (for example, I get a =
circle (=D8)
> instead of =FC, or I get a bit =C4 instead of a small =E4), and it =
seems to
> return that kind of results only for word documents. What could be =
the
> problem?
>=20
> In order to use german dictionary, in my htdig.conf I have the =
following:
>=20
> ispell_dir=3D/usr/lib/ispell
>=20
> # to use german dictionary
> locale: de_DE
> lang_dir: ${ispell_dir}
> endings_affix_file: ${lang_dir}/german.aff
> endings_dictionary: ${lang_dir}/german.0
>=20
In order to parse word documents, in my htdig.conf I have the =
following:
# to convert different binary documents (.doc, .ps, .pdf, .xls, .ppt)
#doc2html=3D/usr/local/bin/doc2html.pl
external_parsers: application/pdf->text/html
/usr/local/bin/doc2html.pl \
application/postscript->text/html
/usr/local/bin/doc2htm
application/msword->text/html
/usr/local/bin/doc2html.pl
application/msexcel->text/html
/usr/local/bin/doc2html.p
application/vnd.ms-excel->text/html
/usr/local/bin/doc2h
application/vnd.ms-powerpoint->text/html
/usr/local/bin/
> By the way, I am using htdig on Suse 7.2 Linux distribution.
>=20
> Could you please reply by email since I didn't subscribe to the =
mailing
> list?
>=20
> Thanks in advance for any help.
>=20
> Melanie.
|