From: Brian G. <bdg...@pi...> - 2006-12-08 14:39:08
|
Juan, http://digital.library.pitt.edu/cgi-bin/b/broker20/broker20 is the URL. If you 'verb=ListSets' you will see a long list of sets (46). When I harvested the site I ended up with I believe only 7 I ran the harvest.sh and then the parser.sh scripts. I then accessed the arc site and used the advanced search. Initially I didn't see any "Archive Set" list. But today I see the list of 7 that it did harvest. So, initially it doesn't seem to load the sets into the arc interface. Is this due to the setting you mentioned in your first email? Thanks, Brian Gregg. con...@ur... wrote: > I have not found this problem. > > Now, the advanced search page only shows the sets obtained with the verb > ListSets (http://your_repository?verb=ListSets) > > Can you send me the address of the repositories where the arc fail? > > I am sorry for the problems, > > -- > Juan > > > Quoting Brian Gregg <bdg...@pi...>: > >> Juan, >> >> There currently seems to be a problem with the sets harvesting part. >> It gets some of the sets but not all of them for some reason. Out of >> about 25 sets in our archive the code only picked up 8 of them. It >> seems random. Have you noticed this? >> >> Thanks, >> Brian Gregg. >> >> con...@ur... wrote: >>> Hi Brian, >>> >>> We had think show the set Names and not the set ids in the next arc >>> release. >>> We have change the code to patch this feature. You can obtain the >>> code from cvs >>> and install it: >>> >>> cvs -d:pserver:ano...@oa...:/cvsroot/oaiarc >>> login >>> (click enter when password is asked) >>> cvs -z3 >>> -d:pserver:ano...@oa...:/cvsroot/oaiarc co -r >>> oaiarc_1_0_sandbox -P oaiarc >>> >>> There are a new table called "sets" with the set information. >>> >>> There are a timer in the search.AdvanceForm class. It refresh the >>> parsed data >>> all the 12 hours. Until this timer is executed, the new set >>> information is not >>> updated. You can relaunch the arc to be effective the changes in the >>> sets after >>> the parse is done. You can also change the refresh time in the >>> src/java/search.AdvanceForm before build the arc. >>> >>> Please, write us if you want more information, >>> >>> Best Regards, >>> >>> Juan >>> >>> >>> Quoting Brian Gregg <bdg...@pi...>: >>> >>>> When harvesting sites I notice that you seem to not store the Full Set >>>> information, particularly that they are not stored in a separate >>>> database table. >>>> >>>> Since I've been requested to list the setName instead of just setSpec >>>> can you suggest where in your harvesting code I could put code so >>>> that I >>>> could populate a table that contained simply the dataprovider id, >>>> setSpec, setName so that we could easily identify the sets >>>> available per >>>> archive? >>>> >>>> I've seen in the code where you harvest the sets (Partitions) but >>>> for a >>>> different reason. I'd like to take that a step further and store the >>>> set information if at all possible. >>>> Since I'm not very familiar with your harvesting code instead of >>>> writing >>>> my own routines separate to your code do you have any code in the >>>> project that would allow easy access to the setName and setSpec >>>> already >>>> so that I don't have to reinvent the wheel? I would of course >>>> contribute any changes I make back to you so that others can use >>>> this if >>>> wanted. >>>> >>>> Any hints would be appreciated. >>>> >>>> Thanks, >>>> Brian Gregg. >>>> >>>> -- >>>> >>>> +--------------------------------+------------------------------+ >>>> | Brian D. Gregg | | >>>> | Systems Analyst | | >>>> | University Library System | | >>>> | University of Pittsburgh | e-mail: bd...@pi... | >>>> | 7500 Thomas Blvd. | voice: 412-244-7507 | >>>> | Pittsburgh, PA 15208 | fax: 412-244-7515 | >>>> +--------------------------------+------------------------------+ >>>> | Member: | >>>> | ASNP - Association of Storage Networking Professionals | >>>> +---------------------------------------------------------------+ >>>> >>>> >>>> >>>> >>>> ------------------------------------------------------------------------- >>>> Take Surveys. Earn Cash. Influence the Future of IT >>>> Join SourceForge.net's Techsay panel and you'll get the chance to >>>> share your >>>> opinions on IT & business topics through brief surveys - and earn cash >>>> http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV >>>> _______________________________________________ >>>> Oaiarc-user mailing list >>>> Oai...@li... >>>> https://lists.sourceforge.net/lists/listinfo/oaiarc-user >>>> >>> >>> >>> >> >> -- >> >> +--------------------------------+------------------------------+ >> | Brian D. Gregg | | >> | Systems Analyst | | >> | University Library System | | >> | University of Pittsburgh | e-mail: bd...@pi... | >> | 7500 Thomas Blvd. | voice: 412-244-7507 | >> | Pittsburgh, PA 15208 | fax: 412-244-7515 | >> +--------------------------------+------------------------------+ >> | Member: | >> | ASNP - Association of Storage Networking Professionals | >> +---------------------------------------------------------------+ >> >> >> >> > > > -- +--------------------------------+------------------------------+ | Brian D. Gregg | | | Systems Analyst | | | University Library System | | | University of Pittsburgh | e-mail: bd...@pi... | | 7500 Thomas Blvd. | voice: 412-244-7507 | | Pittsburgh, PA 15208 | fax: 412-244-7515 | +--------------------------------+------------------------------+ | Member: | | ASNP - Association of Storage Networking Professionals | +---------------------------------------------------------------+ |