|
From: Lachlan A. <lh...@us...> - 2003-10-03 13:10:03
|
Greetings Neal, I'm not sure that I understand this. If a page 'X' is linked only by=20 a page 'Y' which isn't changed since the previous dig, do we parse=20 the unchanged page 'Y'? If so, why not run htdig -i? If not, how=20 do we know that page 'X' should still be in the database? I'd be inclined not to fix this until after we've released the next=20 "archive point", whether that be 3.2.0b5 or 3.2.0rc1... Cheers, Lachlan On Fri, 3 Oct 2003 08:56, Neal Richter wrote: > The workaround is to use 'htdig -i'. This is a disadvantage as we > will revisit and index pages even if they haven't changes since the > last run of htdig. > > Here's the Fix: > > 1) At the start of Htdig, after we've opened the DBs we 'walk' the > docDB and mark EVERY document as Reference_obsolete. I wrote code > to do this.. very short. --=20 lh...@us... ht://Dig developer DownUnder (http://www.htdig.org) |