From: Jim C. <gre...@yg...> - 2002-07-08 02:42:02
|
Christopher Murtagh's bits of Thu, 20 Jun 2002 translated to: > Currently, I have htDig configured to ignore any URLs with '?' in them >because it indexes thousands of pages within our University that shouldn't >be and has potential infinite loop problems. However, there is one URL >that I would like htDig that has a list of these URLs that I would also >like it to include. So, my question is: > > Is it possible to tell htDig to exclude pattern '?', but index URLs that >match 'www.foobar.com/?foo=' ? You could dig the two cases separately and then merge the resulting databases (see the -m option of htmerge). You might also take a look at htdig's -m option. If you only have one (or a few) exceptions, you might be able to get away with just running htdig again with -m before running htmerge. Jim |