|
From: Brad T. <br...@ar...> - 2008-02-01 20:13:43
|
Hey Miguel, I think I just found the problem: I hadn't checked in a small but crucial change to the wayback-code pom.xml which increases the dependency on archive-commons from 2.0.0 to 2.0.1.. I'm betting this makes all the difference. Please try updating to the latest HEAD and let me know if that works for you. Brad Miguel Costa wrote: > Hello, > > I installed wayback 1.1.0-SNAPSHOT from svn. When I query the wayback with > an URL I get a: > > org.archive.io.NoGzipMagicException > org.archive.io.GzipHeader.readHeader(GzipHeader.java:122) > org.archive.io.GzipHeader.<init>(GzipHeader.java:107) > org.archive.io.GzippedInputStream.readHeader(GzippedInputStream.java:335) > > org.archive.io.GzippedInputStream.gzipMemberSeek(GzippedInputStream.java:370 > ) > > org.archive.io.arc.ARCReaderFactory$CompressedARCReader.get(ARCReaderFactory > .java:383) > > org.archive.io.arc.ARCReaderFactory$CompressedARCReader.get(ARCReaderFactory > .java:326) > > org.archive.wayback.resourcestore.LocalARCResourceStore.retrieveResource(Loc > alARCResourceStore.java:108) > org.archive.wayback.webapp.AccessPoint.handleReplay(AccessPoint.java:312) > org.archive.wayback.webapp.AccessPoint.handleRequest(AccessPoint.java:280) > org.archive.wayback.webapp.RequestFilter.handle(RequestFilter.java:106) > org.archive.wayback.webapp.RequestFilter.doFilter(RequestFilter.java:90) > > The wayback find de file and then check if it is OK. This check thows a > NoGzipMagicException because it doesn't find a "magic" number. > The code used is in commons-2.0.0-SNAPSHOT-sources.jar (from Heritrix) for > both projects - nutchwax and wayback. > > I also installed nutchax 0.11.0-SNAPSHOT from svn (both projects from trunk) > and indexed the same ARC files. The query's results are presented ok. > Other files present the same symptoms. > Does anyone have a clue of this problem? Does anyone use this version of > wayback without problems? > > > Thanks > -- > > Miguel Costa > > > > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------- > This SF.net email is sponsored by: Microsoft > Defy all challenges. Microsoft(R) Visual Studio 2008. > http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ > ------------------------------------------------------------------------ > > _______________________________________________ > Archive-access-discuss mailing list > Arc...@li... > https://lists.sourceforge.net/lists/listinfo/archive-access-discuss > |