From: Ross S. <ros...@gm...> - 2010-03-26 14:20:25
|
This is not so much a vufind question as it is a SolrMarc one (since that's where imports happen). For Jangle, (similar principle to OAI-PMH -- the transport is just Atom, rather than OAI XML), I wrote a script to act as the harvesting client, which parsed the records out of the XML and fed them page by page (in OAI-PMH's case, resumption token by resumption token) into SolrMarc via STDIN (rather than than by filename). It works great. The only issues with this approach is that you need to deal with deleted entries yourself (which, if you're using OAI-PMH 2.0 shouldn't be too much of a problem) - but if you don't have access to the MARCXML at that point (since, it's deleted and, generally, OAI-PMH would just send the OAI ID of the record deleted), you'd need a method to delete from the Solr index yourself (although, again, this should be pretty trivial from your harvesting app). -Ross. On Fri, Mar 26, 2010 at 9:30 AM, Thomas Schwaerzler <tho...@ui...> wrote: > hi > > i am currently testing vufind. my plan is to get marc21 and marcxml from > our partner libraries and feed them into vufind. > until now i could import some sample data files and it works really > pretty good. > some of these files contain of raw marc21 others contain marcxml. > i also successfully tested a marcxml sample that was packed into a > OAI-PMH response. > > sample(incomplete): > > <?xml version="1.0" encoding="UTF-8" ?> > <OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ > http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"> > <responseDate>2010-03-05T14:28:17Z</responseDate> > <request verb="ListRecords" metadataPrefix="marc21" set="EODDONEOAI" > >http://aleph.vkol.cz/OAI</request> > <ListRecords> > <record> > <header> > > <identifier>oai:aleph.vkol.cz:SVK01-000794240</identifier> > <datestamp>2009-08-21T01:32:45Z</datestamp> > <setSpec>EODDONEOAI</setSpec> > </header> > <metadata> > <marc:record xmlns:marc="http://www.loc.gov/MARC21/slim" > > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xsi:schemaLocation="http://www.loc.gov/MARC21/slim > http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"> > <marc:leader> nam a22 7a 4500</marc:leader> > ... > > > > my question is now: can i connect vufind directly to a oai-pmh interface > and let it get the data via http or do i have to harvest my desired data > with a separate software and feed it to vufind as xml files? > > > thanks a lot > > > Thomas Schwaerzler > DEA Innsbruck > Department for Digitisation and Digital Preservation > University Innsbruck Library > 6020 Innsbruck - Innrain 52 - Austria > Phone: ++43-(0)512-507-2489 > Fax: ++43-(0)512-507-9842 > sip: sip:tho...@ek... > Email: <Tho...@ui...> > URL: http://www2.uibk.ac.at/ub/dea/ > > > > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > VuFind-General mailing list > VuF...@li... > https://lists.sourceforge.net/lists/listinfo/vufind-general > |