From: Ulf H. <U1...@ul...> - 2004-10-14 09:13:54
|
>> say one would want to build a full-text index into document manger >> leveraging a file system search functionality. > >> How would you do it? >> >> Of course, parsing all docs into a blob upon load would work - but I >> dont think that would be sexy enough. ;) > > The docs would have to be text files unless you've some way of parsing > Word or PDF or whatever. There are various tools for that already on > Linux at least that I know of. You'd not want to search all file types. > > Maybe do it via an existing tool like htdig? > Shawn, thanks a lot for replying. I guess that one of the various linux tools out there is what I am looking for. htdig would not do the job, as I would like to include pdf, xls, doc files into the search. In the meantime, I found http://www.site-search-pro.com/order/ - but I am not sure if this is really what I am looking for. What I am looking for is a simple way to include attached files, such as pdf, xls, doc in the search function of document manager, which is residing on a SUSE linux server. Any suggestions are greatly aprechiated - of course I would be more than happy to feed any results back into document manager code, in case you guys are interested. Greetings from Munich Ulf |