|
From: Dang N. H. <dan...@ya...> - 2006-12-06 07:18:36
|
Hi everyone,=0AMy project used Wayback to render the webpages crawled by He= ritrix. However, we encounted some problem related to server-side redirecte= d link. =0AThe website that we want to crawl is JSP site running on Tomcat.= There are many link which are redirected by the server (not using js, but = by the jsp script itself). I can check that Heritrix actually follows these= redirected link and crawls these webpage into ARC file. However, when we t= ry to render using Wayback, we can not follow these redirected link (and th= e Wayback display error of "No resource available"). So I wonder whether an= y of you have a plan to fix it and what is your approach?=0AThanks=0ANam Ha= i=0A=0A=0A =0A_____________________________________________________________= _______________________=0AAny questions? Get answers on any topic at www.An= swers.yahoo.com. Try it now. |