From: Jim C. <gre...@yg...> - 2002-07-17 22:46:54
|
J.J...@dn...'s bits of Wed, 17 Jul 2002 translated to: >Running htdig just fails. >- I use the same config file as on HP-UX 10.20. >- I removed all pdf-files... >- All db-files but db.wordlist are larger than the filesystem. >- Rundig -vvv ends with error message: DB2 problem...: unrecognized file >type. >- rundig.log looks fine What do you mean by "looks fine"? Can you tell for certain that it is only retrieving the small number of documents that you expect to be indexed? One of the things that it important to eliminate is the possibility that it indexing outside of the intended limits or getting stuck in some sort of loop involving symbolic links, bad HTML, etc. Perhaps you have checked this and this is what you mean by "looks fine". Just checking :) >Any suggestions? >- Compile with debugging (how do you do that) >- HP Ansi-C compiler? >- Give it up? You might want to take a look at the way rundig works and try running the components manually, one at a time. In particular, run htdig and check to see if the generated files appear to be a reasonable size. Then move on to htmerge, and do the same. If things still look reasonable, move on to htnotify, and if appropriate, htfuzzy. This is of course not likely to fix anything, but it might further narrow down where the problem is occurring. Jim |