From: Ko, L. <Lau...@un...> - 2011-04-28 18:45:39
|
Hello Pramila, I think there is still a problem in wayback-1.6.1 when deploying in a non-ROOT context. I noticed that, when deploying under name "wb" with access point called "coll1," http://localhost:8080/wb/coll1/ works correctly as expected, but if you attempt to access http://localhost:8080/wb/, you no longer get the "You seem to be accessing this Wayback via an incorrect URL." Rather you get the default search page with the image and css broken, and all of the links (Help, Home, Take Me Back) all dropping back to using the prefix of http://localhost:8080/ without the "wb/." I think this is the same thing you are describing below. Here is my example configuration that does work at http://localhost:8080/wb/coll1/ when deploying under name "wb", setting an access point at "coll1", and using a CDX index for my WARCs: >From wayback.xml: <bean class="org.springframework.beans.factory.config.PropertyPlaceholderConfigurer"> <property name="properties"> <value> wayback.basedir=/home/me/Desktop/wayback wayback.urlprefix=http://localhost:8080/wb/ </value> </property> </bean> <bean id="waybackCanonicalizer" class="org.archive.wayback.util.url.AggressiveUrlCanonicalizer" /> <import resource="CDXCollection.xml"/> <import resource="ArchivalUrlReplay.xml"/> <bean name="+" class="org.archive.wayback.webapp.ServerRelativeArchivalRedirect"> <property name="matchPort" value="8080" /> <property name="useCollection" value="true" /> </bean> <bean name="8080:coll1" class="org.archive.wayback.webapp.AccessPoint"> <property name="serveStatic" value="true" /> <property name="bounceToReplayPrefix" value="false" /> <property name="bounceToQueryPrefix" value="false" /> <property name="replayPrefix" value="${wayback.urlprefix}coll1/" /> <property name="queryPrefix" value="${wayback.urlprefix}coll1/" /> <property name="staticPrefix" value="${wayback.urlprefix}coll1/" /> <property name="collection" ref="localcdxcollection" /> <property name="replay" ref="archivalurlreplay" /> <property name="query"> <bean class="org.archive.wayback.query.Renderer"> <property name="captureJsp" value="/WEB-INF/query/CalendarResults.jsp" /> </bean> </property> <property name="uriConverter"> <bean class="org.archive.wayback.archivalurl.ArchivalUrlResultURIConverter"> <property name="replayURIPrefix" value="${wayback.urlprefix}coll1/"/> </bean> </property> <property name="parser"> <bean class="org.archive.wayback.archivalurl.ArchivalUrlRequestParser"> <property name="maxRecords" value="10000" /> <property name="earliestTimestamp" value="2007" /> </bean> </property> </bean> And from CDXCollection.xml: <bean id="localcdxcollection" class="org.archive.wayback.webapp.WaybackCollection"> <property name="resourceStore"> <bean class="org.archive.wayback.resourcestore.LocationDBResourceStore"> <property name="db"> <bean class="org.archive.wayback.resourcestore.locationdb.FlatFileResourceFileLocationDB"> <property name="path" value="/home/me/Desktop/path-index.txt" /> </bean> </property> </bean> </property> <property name="resourceIndex"> <bean class="org.archive.wayback.resourceindex.LocalResourceIndex"> <property name="canonicalizer" ref="waybackCanonicalizer" /> <property name="source"> <bean class="org.archive.wayback.resourceindex.cdx.CDXIndex"> <property name="path" value="/home/me/Desktop/index.cdx" /> </bean> </property> <property name="maxRecords" value="10000" /> </bean> </property> </bean> Hope this helps, Lauren Ko Web Archiving Programmer UNT Libraries ________________________________________ From: Pra...@on... [Pra...@on...] Sent: Wednesday, April 27, 2011 9:51 AM To: arc...@li... Subject: [Archive-access-discuss] Replay help Hi Everyone, I used the latest wayback-1.6.1.tar.gz The Indexing seem to work OK. But now I have some configuration issues. Can someone please help me? In wayback.xml I have the following property set <bean class="org.springframework.beans.factory.config.PropertyPlaceholderConfigurer"> <property name="properties"> <value> wayback.basedir=/tmp/wayback wayback.urlprefix=http://localhost:8080/wayback-1.6.1/politicalCollection/ </value> </property> </bean> But when I go to http://localhost:8080/wayback-1.6.1 , I see the page but no collection list. When I type in the url of the archived site, the url changes to http://localhost:8080/query?type=urlquery&url=http%3A%2F%2Fwww.gpo.ca&date=&Submit=Take+Me+Back which is an error. How do I replay the site. I think I am missing something in the config. My arcs are present at ${wayback.basedir}/politicalCollection/ <bean id="datadirs" class="org.springframework.beans.factory.config.ListFactoryBean"> <property name="sourceList"> <list> <bean class="org.archive.wayback.resourcestore.resourcefile.DirectoryResourceFileSource"> <property name="name" value="politicalCollection" /> <property name="prefix" value="${wayback.basedir}/politicalCollection/" /> <property name="recurse" value="false" /> </bean> </list> </property> </bean> Am I missing something? Please help. Thanks, --Pramila Thakur ________________________________ From: Bradley Tofel [mailto:br...@ar...] Sent: Tuesday, April 19, 2011 7:12 PM To: arc...@li... Subject: Re: [Archive-access-discuss] Auto Indexr NOT working Eric, thanks again for the accurate and timely list response! Pramila, Eric is correct - wayback-1.6.0 uses a core Java GZIP library which was changed recently by a Java update. This change broke Wayback indexing code. Heritrix SVN now includes a work-around for the issue, and I've just built a 1.6.1 release candidate for Wayback with the latest Heritrix snapshot. The release address the GZIP issue, which fixes the W/ARC indexing code, and also corrects an issue which prevents Wayback from being deployed in a non-ROOT context. You can test out the release candidate at: http://home.us.archive.org/~brad/wayback-1.6.1.tar.gz which should address the indexing problem you reported. Thanks! Brad On 4/18/11 10:36 AM, Erik Hetzner wrote: At Mon, 18 Apr 2011 17:03:57 +0000, <Pra...@on...><mailto:Pra...@on...> wrote: Hi Everyone, I am getting started with wayback machine. But I am having problem indexing the arc files. I get an error on Tomcat as this java.io.IOException: Resetting to invalid mark […] Hi Pramila, Please see: https://webarchive.jira.com/browse/HER-1865 In summary, you will probably need to use a version of Java prior to 6u23. best, Erik Sent from my free software system <http://fsf.org/><http://fsf.org/>. ------------------------------------------------------------------------------ Benefiting from Server Virtualization: Beyond Initial Workload Consolidation -- Increasing the use of server virtualization is a top priority.Virtualization can reduce costs, simplify management, and improve application availability and disaster protection. Learn more about boosting the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev _______________________________________________ Archive-access-discuss mailing list Arc...@li...<mailto:Arc...@li...> https://lists.sourceforge.net/lists/listinfo/archive-access-discuss |