From: raffaele m. <raf...@at...> - 2011-10-27 14:08:01
|
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Oct 26, 2011, at 7:06 PM, Erik Hetzner wrote: > Thanks for the tip! I was confused, as the source for cdx-indexer > says: > > ## This script creates a CDX file for all ARC files in a directory > ## PUTs those CDX files into a remote pipeline, and informs a remote > ## LocationDB of the locations of all the ARC files. > > which is very different behavior. I have filed a ticket on the wayback > jira. hi Herik, check also this message that Bradley posted here some months ago http://sourceforge.net/mailarchive/message.php?msg_id=27948009 the cdx output of cdx-indexer has an extra field than warc-indexer ciao - -- raf...@at... -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (Darwin) iEYEARECAAYFAk6pZZoACgkQNEBieznDNrxmpwCfZhyYdKnBWDqYXdA0Y8RLKQcj 7o4AoNU9f5j0xnvkR8ldtAbYslBGS97Y =J5TZ -----END PGP SIGNATURE----- |