|
From: Chris V. <cv...@gm...> - 2008-02-04 20:19:10
|
Hi all, I am having a problem retrieving harvested resources whose urls include port numbers using Wayback 1.0.1. We have a seed that includes a port number that was harvested using heritrix. The resulting arc files were indexed using wayback, and the urls stored in the index include the port number. Using the wayback web address search interface, I am able to find the urls by including the port number in the search string (if the port number is not included, no results are found - which is expected). The link for the search result does not include the port number, however, and clicking it does not retrieve the harvested resource. If the port number is inserted into the search result link, retrieval works fine. Even so, rewritten links on the retrieved page do not include a port number where applicable. So my question is, how do I ensure that port numbers are preserved in wayback search results and in rewritten links? Thanks, Chris |