|
From: Geoff H. <ghu...@ws...> - 2002-05-09 15:05:40
|
On Wed, 8 May 2002, Neal Richter wrote: > value of setmode() turns up at one point as 0x4000 (_O_TEXT). So > somewhere the file is opened, probably for reading as _O_TEXT. This > causes NL to be translated to NL-CR during a subsequent write. Hmm. > probably in the HtDig code somewhere.. we noticed that the db.words.db > file is written, closed, opened for read, and rewritten during the > course of htdig.exe spidering a single web-page. Basically, the BDB files > should never be opened in TEXT mode. Wait. What version of ht://Dig is this? I'm guessing 3.2... Otherwise, htdig should never hit the db.words.db file, only htmerge should hit that. And we get the same bug reports from 3.1.x... As far as 3.2 writing, closing, opening, etc., I'd take a look in the htword/ (mifluz) code. The htdig/htcommon/HtWordList class is our interface when indexing and it should always have the database open for writing. Thanks for the work, this sounds like you're getting to the bottom of some long-standing (and hard to pin down) bugs! -Geoff |