From: Anthony J P. <ant...@su...> - 2003-05-22 15:12:22
|
On Thu, 2003-05-22 at 10:48, Patrick Marquetecken wrote: > Hi, > > I'm migration a Windows2000 IIS with Indexserver to Linux. > > IIS to Apache is no problem all went "well". > > But i have a search program on IIS that uses Indexserver and a Adobe -pluging to do a text search in 60,000 pdf files. > > Witch programs must i have on Linux to do the same? > > Any advice would help me > > Patrick I have learned to love phpdig http://phpdig.toiletoine.net/ indexes pdfs docs rtf & text. relevant keywords & link generation are stored in a MySQL database. From their FAQ "PhpDig is a http spider/search engine written in Php with a MySql database in backend. Indexing after indexing, PhpDig builds a glossary with the key words found in indexed pages. On a search query, it displays a result page with documents which contains the search keys, ranked by occurrence." Take a bit of time to build the initial index (overnight on 13 gigs of PDFs for me) but once it's done it's lightning quick for returns -- Tony Placilla ant...@su... perl -e 'print $i=pack(c5, (41*2), sqrt(7056), (unpack(c,H)-2), oct(115), 10);' |