From: Armin S. <Arm...@ui...> - 2011-12-06 08:57:24
|
Hello List, i am wondering if it is possible to access the Wayback database. I am currently designing an archive website and at the moment we are harvesting using archive-it!. I have a cronjob running that fetches the new arc files and imports them into my local Wayback install. What i would like to do is to check what URL's are accessible via Wayback to put them into my database. My second option, since i create a Lucene index using NutchWAX woud be to get all the URL fields from there, but it seems like a workaround to me... Thanks for your help, Bests Armin |