|
From: Brad T. <br...@ar...> - 2010-08-12 21:35:25
|
Hi Allen, Glad you've got Wayback up and running to your satisfaction. The simplest way to get the navigation/categorization functionality you're talking about would be to create some web pages, either with HTML, or a wiki, that will allow you to manually create lists with links to either the Wayback calendar listing for the sites of interest, or even directly to specific capture dates of the sites, and see how far that gets you. Wayback does not support full-text search (show me pages containing the word "governor") but there is another package, NutchWAX, which is also discussed on this list, which provides full-text search of content stored in (W)ARC files. You can get started with NutchWAX at: //http://archive-access.sourceforge.net/projects/nutchwax/ Organizations doing lots of crawling have streamlined their crawl, ingestion, and access processes to allow associating metadata, which could include categorization, before crawls are launched, and automatically generate navigation systems which link into Wayback for seed URLs, etc, but we've definitely strayed into advanced topics here. Some existing software tools and services which help with parts of this process can be found at: NetArchive: http://netarchive.dk/ Web Curator Tool: http://webcurator.sourceforge.net/ Archive-It: http://www.archive-it.org/ Brad On 08/11/2010 09:54 AM, Allen Sim wrote: > Hi, > I am very happy with Wayback search function. It's really powerful! > Now that I am using heritrix to archive the websites and I use wayback for > seraching purpose. > I noticed that everytime when I need to search for an archived website,I > need to type the full URL address. Can wayback serach by keyword or name > instead of full URL? > Can I categorized my archived websites and make a link, for example : > > 1. Goverment websites > 2. Entertainment websites > 3. Food& Beverage websites > 4. Health websites > > and so on... insted of type in full URL address in the serch text box, is > there any possible wayback can provide alternative way that user is allow to > click on the categorized link as above and by clicking this link... a list > of url under that category will be show.. and by clicking the interested url > link it will link them to the timeline page then finally the websites. > > Hope to hear from your guidiance and reply. > Thanks in advance, > Allen Wilson > > > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by > > Make an app they can't live without > Enter the BlackBerry Developer Challenge > http://p.sf.net/sfu/RIM-dev2dev > > > _______________________________________________ > Archive-access-discuss mailing list > Arc...@li... > https://lists.sourceforge.net/lists/listinfo/archive-access-discuss > |