htdig-dev Mailing List for ht://Dig (Page 65)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi all, I'm up and running, well sort of.=20
=20
I did install HTDig, and pointed it towards a folder with just .html
files in it to test it...and voila got results.  I actually need this to
index pdf files though.
=20
I have so far done the following:
=20
Added to htdig.conf:
external_parsers <http://www.htdig.org/attrs.html#external_parsers> :
application/pdf->text/html /opt/www/htdig/bin/doc2html/doc2html.pl
=20
Installed the xpdf rpm
Installed the doc2html directory and scripts
=20
Set the paths for the pdftotext and pdfinfo, as well as setting the path
in doc2html.pl for the pdf2text.pl script
=20
I checked the largest file size of a pdf and increased the max file size
in htdig.conf as well.
=20
I run .rundig -v and it indexes one html document that I have at the top
level.  All permissions on files are fine I actually set them to 777 to
make sure it could get into the folders.  But it doesn't want to index
the pdfs...any ideas... =20
=20
I don't receive any error messages either.
=20
My file setup is /archives/folder/folder...etc
=20
I set htdig start_url at http://192.168.0.25/archives/
=20
I've tried moving a .pdf to the /archives file, but that doesn't work
either.
=20
Thanks!
=20
Abbie
=20
=20
=20
=20

2001	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (47)	Nov (74)	Dec (66)
2002	Jan (95)	Feb (102)	Mar (83)	Apr (64)	May (55)	Jun (39)	Jul (23)	Aug (77)	Sep (88)	Oct (84)	Nov (66)	Dec (46)
2003	Jan (56)	Feb (129)	Mar (37)	Apr (63)	May (59)	Jun (104)	Jul (48)	Aug (37)	Sep (49)	Oct (157)	Nov (119)	Dec (54)
2004	Jan (51)	Feb (66)	Mar (39)	Apr (113)	May (34)	Jun (136)	Jul (67)	Aug (20)	Sep (7)	Oct (10)	Nov (14)	Dec (3)
2005	Jan (40)	Feb (21)	Mar (26)	Apr (13)	May (6)	Jun (4)	Jul (23)	Aug (3)	Sep (1)	Oct (13)	Nov (1)	Dec (6)
2006	Jan (2)	Feb (4)	Mar (4)	Apr (1)	May (11)	Jun (1)	Jul (4)	Aug (4)	Sep	Oct (4)	Nov	Dec (1)
2007	Jan (2)	Feb (8)	Mar (1)	Apr (1)	May (1)	Jun	Jul (2)	Aug	Sep (1)	Oct	Nov	Dec
2008	Jan (1)	Feb	Mar (1)	Apr (2)	May	Jun	Jul (1)	Aug	Sep (1)	Oct	Nov	Dec
2009	Jan	Feb	Mar (2)	Apr	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2010	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec (1)
2011	Jan	Feb	Mar (1)	Apr	May (1)	Jun	Jul	Aug	Sep	Oct (1)	Nov	Dec
2012	Jan	Feb	Mar	Apr	May	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec
2013	Jan	Feb	Mar	Apr (1)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2016	Jan (1)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2017	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov (1)	Dec

htdig-dev Mailing List for ht://Dig (Page 65)

htdig-dev — Developer Discussion for the ht://Dig project