From: Gilles D. <gr...@sc...> - 2003-01-24 21:10:43
|
According to Conrad Schilbe: > It isn't running in update mode. I even added `remove_bad_urls: false' > to the configuration file. OK, then it must be that the server is never returning any 404 status codes. Are you sure this site has links to non-existant URLs? > > If that doesn't help, have a look at how 404 errors are dealt with on > > that site. It may be that htdig is never seeing that status code > > there, > > but is instead getting some other document (e.g. an error page), with > > a normal status code, for any unresolvable URL on that site. > > Even if it is not seeing any bad URLs possibly caused by the way 404s > are handled, it should still output `Errors to take note of:' in the > report. That text should be there even when there are no errors... I > have seen it in my tests. Which makes me believe that something is > failing. No, the logic in the code is as follows... if (notFound.length() > 0) { cout << "\n" << name << ": Errors to take note of:\n"; cout << notFound; } so if "notFound" is never set to anything, it won't put out the "Errors to take note of" message either. notFound is only set (i.e. appended to) when there is one of the following errors occurs for a given URL: "Not found", "Unknown host", "Unable to contact server". The latter two are detected internally by htdig, if the name lookup fails or the attempt to open the connection fails. The first one, "Not found", only occurs if the HTTP server returns a status code other than 200, 30*, or 401. If none of these errors occurs, htdig shouldn't tell you to take note of them. -- Gilles R. Detillieux E-mail: <gr...@sc...> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) |