From: Xiaoming L. <xia...@gm...> - 2006-11-29 14:12:21
|
OAI-PMH supports different metadata formats, with oai_dc as the mandatory one. In OAI-PMH request, the metadata format can be specified, such as: http://www.citidel.org/oai/provider-2.0.pl?verb=GetRecord&identifier=oai:CSHistory:stepanek_metcalfe-gif&metadataPrefix=oai_dc request oai_dc; and http://www.citidel.org/oai/provider-2.0.pl?verb=GetRecord&identifier=oai:CSHistory:stepanek_metcalfe-gif&metadataPrefix=citidel request citidel native format. so in a correct implementation, the response format is decided by your request. If you are requesting oai_dc but get citidel native format, it's certainly a problem of citidel, so you may want to check exactly which request you send. The arc harvester supports harvesting any metadata formats, but currently there is only oai_dc parser. cheers, xiaoming On 11/28/06, GuoFang Teng <guo...@vi...> wrote: > Hi Xiaoming, > > I was able to harvest things from a repository, or from a set of a > repository. I harvested those metadata from the set "CSTC" of the > repository "CITIDEL". I used "history_mass" as my harvest method in > dps.xml. In my response.xml, I can see those metadata tagged in records. > But they are tagged in the following format: > > <header> > <identifier>oai:CSTC:264</identifier> > <datestamp>2002-12-01T23:19:16Z</datestamp> > <setSpec>CSTC</setSpec> > </header> > > <metadata> > <record> > <basic> > <title>Building a Virtual Topology Atop Wireless Devices</title> > <date>2002-05-13</date> > <url>http://www.cstc.org/cgi- > bin/show_abstract.pl?type=official&number=264</url> > <abstract> This poster summarizes my contributions to a project in > which ...</abstract> > <language>English</language> > <keyword> > network > </keyword> > <author>Nathaniel Waisbrot</author> > </basic> > <lom> > <role>1</role> > <role>2</role> > <edlevel>3</edlevel><edlevel>4</edlevel> > <edlevel>2</edlevel> > <language>1</language> > <classification> > <category namespace="CCS1998" priority="1">K</category> > <category namespace="CCS1998" priority="1">I</category> > <category namespace="CCS1998" priority="1">C</category> > <category namespace="CCS1998" priority="1">A.m</category> > <category namespace="CCS1998" priority="1">C</category> > <category namespace="CCS1998" priority="1">C.2</category> > </classification> > </lom> > </record> > </metadata> > > I am not sure if it is correct since according to the document: > http://www.openarchives.org/OAI/openarchivesprotocol.htm#Record > A record with metadata is supposed to be expressed in the Dublin Core > format, which is like: > > <metadata> > <oai_dc:dc > xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" > xmlns:dc="http://purl.org/dc/elements/1.1/" > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ > http://www.openarchives.org/OAI/2.0/oai_dc.xsd"> > <dc:title>Using Structural Metadata to Localize Experience of Digital > Content</dc:title> > <dc:creator>Dushay, Naomi</dc:creator> > <dc:subject>Digital Libraries</dc:subject> > <dc:description>With the increasing technical sophistication of both > information consumers and providers, there is increasing demand for > more meaningful experiences of digital information. We present a > framework that separates digital object experience, or rendering, > from digital object storage and manipulation, so the > rendering can be tailored to particular communities of users. > </dc:description> > <dc:description>Comment: 23 pages including 2 appendices, > 8 figures</dc:description> > <dc:date>2001-12-14</dc:date> > <dc:type>e-print</dc:type> > <dc:identifier>http://arXiv.org/abs/cs/0112017</dc:identifier> > </oai_dc:dc> > </metadata> > > I don't know if this is related to my harvest method, or some other > reasons. > > Thank you for your help and looking forward to your reply. > > Bryan > > ----- Original Message ----- > From: Xiaoming Liu <xia...@gm...> > Date: Wednesday, November 15, 2006 9:06 am > Subject: Re: [Oaiarc-user] Fwd: Fw: Questions about OAI > To: "con...@ur..." > <con...@ur...> > Cc: GuoFang Teng <guo...@vi...>, > oai...@li... > > > Yes, step (p) and (q) in Readme have additional information: > > > > p) If compiled from "ant" only > > If compiled from ant > > copy plugin/conf/*.xml build/conf/*.xml > > make directory tmp in build > > skip next step > > > > q) Create a directory called build and create the following > > directory structure > > build/ > > dspace/*.class ( All class files from plugin/dspace/*.java) > > conf/*.xml > > ns/*.class ( All class files from plugin/ns/*.java) > > oairequester/*.class ( All class files from > > plugin/oairequester/*.java) scheduler/*.class ( All class > > files from plugin/scheduler/*.java) > > tmp/ ( make directory called tmp) > > > > Basically the conf directory must be also in classpath. > > > > xiaoming > > > > > > On 11/15/06, con...@ur... > > <con...@ur...> wrote: > > > Hi Brian, > > > > > > I am sorry for my bad English. I have not installed the dspace and > > > the plugin, > > > but I think that you should copy the conf directory with the dps.xml > > > and map.xml > > > files customized (it should be in > > /home/gteng/citiDel/plugin/plugin or in > > > /home/gteng/citiDel/plugin) to the > > /home/gteng/citiDel/plugin/plugin/build> directory. Perhaps you > > should create the "temp" directory in > > > /home/gteng/citiDel/plugin/plugin/build too. > > > > > > Good luck, > > > > > > -- > > > Juan > > > > > > > > > Quoting GuoFang Teng <guo...@vi...>: > > > > > > > Hi Xiaoming, > > > > > > > > Thank you for the reply. I made sure the file was under conf > > and I did > > > > start the application from "build" directory. And it looks I have > > > > everything under my CALSSPATH. But it still complained the file > > cannot> > be found. I pasted my related operations from unix > > environment. Thank > > > > your again for your help. > > > > > > > > bash-2.03$ pwd > > > > /home/gteng/citiDel/plugin/plugin/build > > > > bash-2.03$ ls > > > > dspace ns oairequester scheduler > > > > bash-2.03$ env > > > > PWD=/home/gteng/citiDel/plugin/plugin/build > > > > CATALINA_BASE=/home/gteng/citiDel/tomcat > > > > TZ=US/Eastern > > > > CATALINA_HOME=/home/gteng/citiDel/tomcat > > > > CATALINA_TMPDIR=/home/gteng/citiDel/tomcat/temp > > > > HOSTNAME=what > > > > LD_LIBRARY_PATH=/usr/local/lib:/home/gteng/citiDel/pgsql/lib > > > > > > > CLASSPATH=.:/home/gteng/citiDel/xml/sax.jar:/home/gteng/citiDel/xml/xp.jar> > > MANPATH=/usr/man:/usr/local/man > > > > USER=gteng > > > > MACHTYPE=sparc-sun-solaris > > > > MAIL=/var/mail/gteng > > > > SSH2_CLIENT=68.81.210.32 12294 153.104.201.200 22 > > > > OLDPWD=/home/gteng/citiDel/plugin/plugin > > > > JAVA_HOME=/home/k12port/jdk1.5.0_04 > > > > LOGNAME=gteng > > > > SHLVL=1 > > > > SHELL=/bin/sh > > > > HOSTTYPE=sparc > > > > JRE_HOME=/home/k12port/jdk1.5.0_04/jre > > > > OSTYPE=solaris > > > > HOME=/home/gteng > > > > TERM=vt100 > > > > > > > PATH=/home/k12port/jdk1.5.0_04/bin:/home/gteng/citiDel/dspace/bin:/usr/bin:/usr/ucb:/etc:._=usr/bin/env/> > > _=/usr/bin/env > > > > bash-2.03$ java scheduler/Harvest > > > > Beginning to Harvest > > > > data source=xml > > > > url in XMLDPReader: ./conf/dps.xml > > > > Reach init() cl!=null: ./conf/dps.xml > > > > in: null > > > > An error occurred: java.lang.Exception: Data Provider > > configuration file > > > > './conf/dps.xml' not found > > > > doc is null > > > > Finish Harvesting task:2006-11-14 > > > > bash-2.03$ > > > > > > > > Bryan > > > > > > > > ----- Original Message ----- > > > > From: Xiaoming Liu <xia...@gm...> > > > > Date: Tuesday, November 14, 2006 3:40 pm > > > > Subject: Re: Fwd: Fw: Questions about OAI > > > > To: GuoFang Teng <guo...@vi...> > > > > Cc: oai...@li... > > > > > > > >> hi, > > > >> > > > >> The Dspace plug-in is based on an older version of ARC code, > > > >> however I > > > >> didn't develop the package by myself, would you please check if > > > >> dps.xml is located in conf directory? > > > >> > > > >> This is defined in ns/XMLDPReader.java > > > >> > > > >> public XMLDPReader() > > > >> { > > > >> this("./conf/dps.xml"); > > > >> } > > > >> > > > >> secondly, you will need start the application from "build" > > directory,> >> and put "current" directory in your classpath, such as: > > > >> > > > >> CLASSPATH=.:/home/gmanepal/xml/sax.jar:/home/gmanepal/xml/xp.jar; > > > >> export CLASSPATH > > > >> > > > >> let me know if you still have problems. > > > >> > > > >> xiaoming > > > >> > > > >> > > > >> >cd build > > > >> > > > >> > > > >> > > > >> On 11/14/06, GuoFang Teng <guo...@vi...> wrote: > > > >> > Hi Xiaoming, > > > >> > > > > >> > Thank you for your reply. I appreciate it. I am trying to > > harvest> >> > things from a given data provider into my dspace. I > > downloaded> >> > plugin.jar and xml.jar from > > > >> > http://dspace.cs.odu.edu/downloads.html > > > >> > > > > >> > And I followed the instructions in the Readme file. I went > > > >> through the > > > >> > steps until s, which is to Run Command: java scheduler/Harvest. > > > >> But the > > > >> > system told me: > > > >> > > > > >> > An error occurred: java.lang.Exception: Data Provider > > configuration> >> > file '/dps.xml' not found. > > > >> > > > > >> > I have copied dps.xml into the same directory XMLDPReader.class > > > >> is in. > > > >> > But the system still can not find it. I looked into your > > > >> > XMLDPReader.java file and found the object of ClassLoader cl > > was not > > > >> > null, which I think was OK. But after we call: > > > >> > > > > >> > in = cl.getResourceAsStream(xmlurl). The object of > > > >> java.io.InputStream> in was always null. So, the system threw an > > > >> exception. I know we need > > > >> > to set Classpath to point to sax.jar and xp.jar, which I had > > > >> done. But > > > >> > it looked still unable to getResourceAsStream. > > > >> > > > > >> > I am not sure if I did it correctly. And I attached dps.xml. > > > >> Thank you > > > >> > very much for your help on it. I really appreciate it. > > > >> > > > > >> > > > > >> > Bryan Teng > > > >> > > > > >> > ----- Original Message ----- > > > >> > From: Xiaoming Liu <xia...@gm...> > > > >> > Date: Tuesday, November 14, 2006 2:25 pm > > > >> > Subject: Fwd: Fw: Questions about OAI > > > >> > To: guo...@vi... > > > >> > Cc: oai...@li... > > > >> > > > > >> > > hi Bryan, > > > >> > > > > > >> > > Could you tell me where you download the software and what > > you try > > > >> > > to do? We > > > >> > > are still maintaining the arc software through > > sourceforge, and > > > >> > > the latest > > > >> > > version is available from sourceforge. > > > >> > > > > > >> > > thanks, > > > >> > > xiaoming > > > >> > > > > > >> > > > > > >> > > > > > >> > > ----- Forwarded by Kurt Maly/LOTUS on 11/14/2006 01:15 PM - > > ---- > > > >> > > *GuoFang Teng <guo...@vi...>* > > > >> > > Sent by: guo...@vi... > > > >> > > > > > >> > > 11/14/2006 11:13 AM > > > >> > > To > > > >> > > ma...@cs... cc > > > >> > > > > > >> > > Subject > > > >> > > Questions about OAI > > > >> > > > > > >> > > > > > >> > > > > > >> > > > > > >> > > > > > >> > > > > > >> > > Hi Dr.Maly, > > > >> > > > > > >> > > My name is Bryan and I am a graduate student working on OAI > > > >> > > harvester at > > > >> > > Villanova University. I downloaded plugin.jar and xml.jar > > files> >> from> > your website and followed your instructions until > > step s). > > > >> I tried to > > > >> > > run java scheduler/Harvest to start harvesting files based > > on data > > > >> > > provider xml file, but I was always told the configuration > > file> >> could> > not be found. I tried different approaches to > > debug. And > > > >> it looked the > > > >> > > file was found but cannot be read as a stream, although I > > do have > > > >> > > sax.jar as one of my classpath. > > > >> > > > > > >> > > I am wondering if you know about this problem or if you can > > > >> > > provide me > > > >> > > some information of your students who worked on it. Thank > > you very > > > >> > > muchfor your help. I appreciate it. > > > >> > > > > > >> > > Bryan Teng > > > >> > > > > > >> > > > > >> > > > > >> > > > > >> > > > > > > > > ---------------------------------------------------------------- > > --------- > > > > Take Surveys. Earn Cash. Influence the Future of IT > > > > Join SourceForge.net's Techsay panel and you'll get the chance > > to share your > > > > opinions on IT & business topics through brief surveys - and > > earn cash > > > > > > > http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV> > > _______________________________________________ > > > > Oaiarc-user mailing list > > > > Oai...@li... > > > > https://lists.sourceforge.net/lists/listinfo/oaiarc-user > > > > > > > > > > > > > > > > > > > |