From: Mat K. <mk...@cs...> - 2011-10-25 16:45:28
|
Lauren, I added a Content-Length field representative of the content plus the header still to no avail. I am extremely interested in getting this working and hope to pursue Erik's suggestion of validating the WARC with the "warc-indexer binary distributed with wayback" but have a subtle problem in that I do not know where to find this binary to invoke. I have a working Wayback installation on Ubuntu Linux. Where would I find this warc-indexer binary? Thank you, Mat ---------- Forwarded message ---------- From: Erik Hetzner <eri...@uc...> Date: Thu, Oct 20, 2011 at 1:06 PM Subject: Re: [Archive-access-discuss] WARC Manipulation and manually creating WARCs: Need guidance To: "me...@ma..." <me...@ma...> At Wed, 19 Oct 2011 22:46:37 +0000, Ko, Lauren wrote: > > Hi Mat, > > I don't think the warcvalid.py actually does a very thorough check > of what is in your WARC at validation, so you shouldn't rely on > this. > > […] You can use the warc-indexer binary, distributed with wayback, to check to see if wayback can read your warc file, which would be a pretty good indication that you have a valid WARC. best, Erik |