You can subscribe to this list here.
| 2007 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(2) |
Jun
(9) |
Jul
(6) |
Aug
|
Sep
(2) |
Oct
|
Nov
|
Dec
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2008 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(1) |
Jun
|
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
|
From: David R. <dw...@co...> - 2008-05-13 20:00:27
|
Dear Kaushal,
For a demo of DPubS, the best thing to look at right now are the
various publications using DPubS. Nine current projects are listed here:
http://dpubs.org/projects.html
We're talking about setting up a DPubS sandbox, where users could
play with the backend more, but we've not gotten to this.
For installation instructions, see the DPubS wiki:
http://dpubs.org/wiki
and look under "DPubS Installation Documentation". A good starting
place for all DPubS info is http://dpubs.org.
Let us know if you don't find what you need.
Best,
--David Ruddy
At 12:30 PM 5/13/2008, Dr. Kaushal Srivastava wrote:
>1. Is there any DEMO for DPubs ?
>2. Is there any Manual for "How to install DPubs" or Step-by-step
>instructions for installations of DPubs ?
>
>-Kaushal
>
>-------------------------------------------------------------------------
>This SF.net email is sponsored by: Microsoft
>Defy all challenges. Microsoft(R) Visual Studio 2008.
>http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
>_______________________________________________
>Dpubs-tech mailing list
>Dpu...@li...
>https://lists.sourceforge.net/lists/listinfo/dpubs-tech
|
|
From: David R. <dw...@co...> - 2007-09-24 22:06:45
|
Henry, I'm sending this on to the tech list. Interesting problem. I think I understand what you're trying to do. I suspect the Importer will need to be tweaked a bit to put these images (sort of supplemental images) into the repository in a way that makes them accessible to your html file. We do this now with in-line images encoded in the abstract element (in the metadata). But we've not thought about the similar situation within the content files (in this case an HTML file). We'll talk about it here in the next couple of days and devise a solution. You're on the cutting/bleeding edge, as far as delivering HTML ;-) The record_filename_set was designed for a sequence of files that together make up the entire object. For example, a list of TIFF page image files that together represent the object described in that metadata <record>. Or a set of PDF files that together represent the entire object. In this case, the images are just components and don't themselves, even taken together, comprise the entire object. We've tended to call stuff like this "supplemental" or "auxiliary" datastreams. --David At 03:50 PM 9/24/2007, H Griffy wrote: >Hi All, > >I've been exploring how to display images in html files, and have >not been able to figure out an elegant way to do it.\ > >The situation: in the new issue of the journal Disability Studies >Quarterly, one article contains a few images of charts and >graphs. We are wanting to publish the journal as html (rather than >creating pdfs) so need to create a link between the main content >file and the images. > >I have tried uploading the articles alongside the html file as part >of a record_filename_set group -- see excerpt from the journal-issue >xml below -- but DPubS doesn't ingest the image files into the >repository or otherwise recognize them as far as I can tell. > >I have come up with a functional workaround that seems like it would >suffice for the current issue but would become unwieldy over the >long term. Namely, I created a sub-folder within the htdocs folder >and uploaded the images there. I am then able to hard-code links to >the images, as in > ><img >src="/publication/dsq.v3/images/in_articles/dsq_v27_04_2007_fall_st_07_figure01.jpg" >alt="..."/> > >This workaround gets the images to display in the article, but >doesn't seem like it's the way images are supposed to be included. > >Any guidance would be much appreciated. > >Thanks, >Henry > > >PS. I've tried coding the set of files a few different ways. Below >is my most recent attempt: > ><record_filename >filetype="html">dsq_v27_04_2007_fall_st_07_barnartt.htm</record_filename> ><record_filename_set setID="1"> ><set_item seq="1" filename="dsq_v27_04_2007_fall_st_07_figure01.jpg" >filetype='jpg' /> ><set_item seq="2" filename="dsq_v27_04_2007_fall_st_07_figure02.jpg" >filetype='jpg' /> ><set_item seq="3" filename="dsq_v27_04_2007_fall_st_07_figure03.jpg" >filetype='jpg' /> ><set_item seq="4" filename="dsq_v27_04_2007_fall_st_07_figure04.jpg" >filetype='jpg' /> ></record_filename_set> > >-- >Henry Griffy >Program Coordinator, Documentation & Training >Scholarly Resources Integration Dept >The Ohio State University Libraries >600 Ackerman Rd. >Columbus, OH 43202 >Ph: 614-247-4663 >Fx: 614-292-2015 >------------------------------------------------------------------------- >This SF.net email is sponsored by: Microsoft >Defy all challenges. Microsoft(R) Visual Studio 2005. >http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ >_______________________________________________ >Dpubs-content mailing list >Dpu...@li... >https://lists.sourceforge.net/lists/listinfo/dpubs-content |
|
From: H G. <gri...@gm...> - 2007-09-24 19:51:04
|
Hi All, I've been exploring how to display images in html files, and have not been able to figure out an elegant way to do it.\ The situation: in the new issue of the journal Disability Studies Quarterly, one article contains a few images of charts and graphs. We are wanting to publish the journal as html (rather than creating pdfs) so need to create a link between the main content file and the images. I have tried uploading the articles alongside the html file as part of a record_filename_set group -- see excerpt from the journal-issue xml below -- but DPubS doesn't ingest the image files into the repository or otherwise recognize them as far as I can tell. I have come up with a functional workaround that seems like it would suffice for the current issue but would become unwieldy over the long term. Namely, I created a sub-folder within the htdocs folder and uploaded the images there. I am then able to hard-code links to the images, as in <img src="/publication/dsq.v3/images/in_articles/dsq_v27_04_2007_fall_st_07_figure01.jpg" alt="..."/> This workaround gets the images to display in the article, but doesn't seem like it's the way images are supposed to be included. Any guidance would be much appreciated. Thanks, Henry PS. I've tried coding the set of files a few different ways. Below is my most recent attempt: <record_filename filetype="html">dsq_v27_04_2007_fall_st_07_barnartt.htm</record_filename> <record_filename_set setID="1"> <set_item seq="1" filename="dsq_v27_04_2007_fall_st_07_figure01.jpg" filetype='jpg' /> <set_item seq="2" filename="dsq_v27_04_2007_fall_st_07_figure02.jpg" filetype='jpg' /> <set_item seq="3" filename="dsq_v27_04_2007_fall_st_07_figure03.jpg" filetype='jpg' /> <set_item seq="4" filename="dsq_v27_04_2007_fall_st_07_figure04.jpg" filetype='jpg' /> </record_filename_set> -- Henry Griffy Program Coordinator, Documentation & Training Scholarly Resources Integration Dept The Ohio State University Libraries 600 Ackerman Rd. Columbus, OH 43202 Ph: 614-247-4663 Fx: 614-292-2015 |
|
From: David R. <dw...@co...> - 2007-07-05 23:18:47
|
Hi Henry, I'm forwarding this email to the tech list, where I think it belongs. Someone here should be able to give you some advice. We're within a day or two of a new release of DPubS--2.1. It has much improved the Lucene indexing and searching component, and may take care of all these problems. Though one of the experts on searching will have to say for sure. --David >Date: Thu, 05 Jul 2007 11:59:08 -0400 >To: Larry Wentzel <lr...@ps...> >From: Henry Griffy <gri...@os...> >Cc: dpu...@li... >Subject: Re: [Dpubs-content] Not Indexing > > >Hi Larry, > >Thanks for the reply and advice. > >Meanwhile, I've tried deleting and rebuilding the index from >scratch, but the content is still not showing up. If anyone could >help me figure out why, it'd be much appreciated. > >Specifically, I ran ../Services/Index/cleanup_indexes.sh > and then ran ../Services/Index/update_index.pl > >It seemed to run successfully with two qualifications: >1. While building the WAIS index, it reported the following error >three times: >Wide character in print at >/usr/local/DPubS/Services/Index/build-indexes.pl line 1157. > >but these don't seem to have been fatal errors, as it also reported: > >Completed Database Build at Thu, 5 Jul 2007 08:42:23 (Total Build >Time: 0:0:1) >2. The Lucene index produced a lot of errors (pasted as a p.s. >below, in case the details might be helpful.) The upshot is that >the Lucene service seems not to have been enabled. > > >Meanwhile, the bottom line is still that the content seems to have >been ingested, but nothing displays on the Current or Past Issues >pages. The name of the journal does display in the title bar, >suggesting that there is _some_ access to the publication metadata, >but there is no sign of the issue-level metadata. > >Hopefully this is enough information for a diagnosis? If not, >please let me know what else would need to be known > >Thanks, >Henry > >PS. This was the output following the Completed Database Build statement: >Starting Lucene >Exception in thread "main" java.lang.NoClassDefFoundError: while >resolving class >: org.apache.lucene.server.IndexServer > at java.lang.VMClassLoader.resolveClass(java.lang.Class) > (/usr/lib/libgcj.so. >5.0.0) > at java.lang.Class.initializeClass() (/usr/lib/libgcj.so.5.0.0) > at java.lang.Class.forName(java.lang.String, boolean, > java.lang.ClassLoader) >(/usr/lib/libgcj.so.5.0.0) > at java.lang.Class.forName(java.lang.String) (/usr/lib/libgcj.so.5.0.0) > at gnu.gcj.runtime.FirstThread.run() (/usr/lib/libgcj.so.5.0.0) > at _Jv_ThreadRun(java.lang.Thread) (/usr/lib/libgcj.so.5.0.0) > at _Jv_RunMain(java.lang.Class, byte const, int, byte const, > boolean) (/usr/l >ib/libgcj.so.5.0.0) > at __gcj_personality_v0 > (/usr/local/DPubS/Services/Index/java.version=1.4.2) > at __libc_start_main (/lib/tls/libc-2.3.4.so) > at _Jv_RegisterClasses > (/usr/local/DPubS/Services/Index/java.version=1.4.2) >Caused by: java.lang.ClassNotFoundException: >javax.xml.parsers.DocumentBuilderFa >ctory not found in >[file:/usr/local/DPubS/Services/Index/Lucene/lucene-core-1.9. >1.jar, file:/usr/local/DPubS/Services/Index/Lucene/jdom.jar, >file:/usr/local/DPu >bS/Services/Index/Lucene/luceneServer.jar, >file:/usr/local/DPubS/Services/Index/ >Lucene/jaxen-core.jar, >file:/usr/local/DPubS/Services/Index/Lucene/jaxen-jdom.ja >r, file:/usr/local/DPubS/Services/Index/Lucene/saxpath.jar, >file:/usr/share/java >/libgcj-3.4.6.jar, file:./, core:/] > at java.net.URLClassLoader.findClass(java.lang.String) > (/usr/lib/libgcj.so.5. >0.0) > at gnu.gcj.runtime.VMClassLoader.findClass(java.lang.String) > (/usr/lib/libgcj >.so.5.0.0) > at java.lang.ClassLoader.loadClass(java.lang.String, boolean) > (/usr/lib/libgc >j.so.5.0.0) > at _Jv_FindClass(_Jv_Utf8Const, java.lang.ClassLoader) > (/usr/lib/libgcj.so.5. >0.0) > at java.lang.Class.forName(java.lang.String, boolean, > java.lang.ClassLoader) >(/usr/lib/libgcj.so.5.0.0) > at > _Jv_BytecodeVerifier.type.compatible(_Jv_BytecodeVerifier.type&, _Jv_Bytec >odeVerifier) (/usr/lib/libgcj.so.5.0.0) > at _Jv_BytecodeVerifier.verify_instructions_0() > (/usr/lib/libgcj.so.5.0.0) > at _Jv_VerifyMethod(_Jv_InterpMethod) (/usr/lib/libgcj.so.5.0.0) > at _Jv_PrepareClass(java.lang.Class) (/usr/lib/libgcj.so.5.0.0) > at _Jv_WaitForState(java.lang.Class, int) (/usr/lib/libgcj.so.5.0.0) > at java.lang.VMClassLoader.linkClass0(java.lang.Class) > (/usr/lib/libgcj.so.5. >0.0) > at java.lang.VMClassLoader.resolveClass(java.lang.Class) > (/usr/lib/libgcj.so. >5.0.0) > ...9 more >IndexServer error : Operation not allowed >Exception in thread "main" java.lang.ExceptionInInitializerError >*** Got java.lang.NoClassDefFoundError: gnu.gcj.runtime.NameFinder >while trying >to print stack trace. > > > > >At 11:36 AM 7/3/2007, Larry Wentzel wrote: >>Henry, >> >>I believe we ran into a similar problem here at Penn State. Issues >>that loaded successfully and went into the repository didn't show >>up anywhere. There's a (recent? had to be, never had this problem >>before) bug in the indexing service which fails to update the index >>after a certain point. I don't know the actual details, not being >>a developer or programmer myself. The only way we got around it >>was to delete the index and rebuild it from scratch, which takes a >>bit of time (ymmv). >> >>Cornell fixed this problem in an nigh-ready-for-primetime release >>which should come out this week or next. Watch the SourceForge >>site for updates. >> >>--Larry Wentzel >>--Digital Preservation Coordinator >>--Penn State Libraries >> >>Henry Griffy wrote: >>>Hi All, >>> >>>I've reached another impasse and hope that y'all might once again help. >>> >>>The situation is this: >>> * I've prepared the xml for the latest issue of our journal, >>> FTP'd it to the server, and run the DPubs_Importer.pl script with no errors. >>> * I've run the Index/update_index.pl script with no errors >>> (neither reported nor in the <2007-07-04-15-00-update.log-errors> file) >>> * I've restarted the Apache server, again with no reported errors. >>>However, when I go to look at the site via the UI service, no >>>contents appear for either the current or past pages. (And I'm >>>using the demo.ph User Interface files.) Further, when I look >>>through the folders in the Index directory, I see no trace of our >>>journal. The zindex-data/bibs folder doesn't contain a sub-folder >>>for our authority, and the log in the zupdate-staging/update-logs/ >>>folder reports that it "Indexed 0 new bibliographic records," as >>>do all the previous logs for times I've run the script. >>> >>>I thought the problem might be that I'm trying to upload the >>>content in html format, so I converted one of the files to pdf, >>>added a reference to that version of the file in the issue's xml >>>file, and went back through the process. Still nothing. >>> >>>Any ideas about where I might be going wrong / what I might be leaving out? >>> >>>Thanks, >>>Henry >>> >>>Henry Griffy, Program Coordinator >>>Documentation and Training Unit >>>Scholarly Resources Integration Department >>>Technical Services Division >>>The Ohio State University Libraries - Ackerman Road >>>600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 >>>E-mail: <mailto:gri...@os...>gri...@os... >>>Ph: 614-247-4663; Fax: 614-292-2015 >>> >>> >>> >>> >>> >>>------------------------------------------------------------------------- >>>This SF.net email is sponsored by DB2 Express >>>Download DB2 Express C - the FREE version of DB2 express and take >>>control of your XML. No limits. Just data. Click to get it now. >>> >>>http://sourceforge.net/powerbar/db2/ >>> >>> >>> >>> >>>_______________________________________________ >>>Dpubs-content mailing list >>><mailto:Dpu...@li...> >>>Dpu...@li... >>> >>>https://lists.sourceforge.net/lists/listinfo/dpubs-content >>> >> >> >>---------- >> >><https://antispam.osu.edu/b.php?c=s&i=354901194&m=69e86cf44d88>Spam >><https://antispam.osu.edu/b.php?c=n&i=354901194&m=69e86cf44d88>Not spam >><https://antispam.osu.edu/b.php?c=f&i=354901194&m=69e86cf44d88>Forget >>previous vote > >Henry Griffy, Program Coordinator >Documentation and Training Unit >Scholarly Resources Integration Department >Technical Services Division >The Ohio State University Libraries - Ackerman Road >600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 >E-mail: <mailto:gri...@os...>gri...@os... >Ph: 614-247-4663; Fax: 614-292-2015 >------------------------------------------------------------------------- >This SF.net email is sponsored by DB2 Express >Download DB2 Express C - the FREE version of DB2 express and take >control of your XML. No limits. Just data. Click to get it now. >http://sourceforge.net/powerbar/db2/ >_______________________________________________ >Dpubs-content mailing list >Dpu...@li... >https://lists.sourceforge.net/lists/listinfo/dpubs-content |
|
From: Henry G. <gri...@os...> - 2007-07-05 18:31:58
|
I am not. The result of any search -- including one author search for a lastname that should be there; and one full-text search for a word that should appear in nearly every article -- was: Search Results Your query returned no documents Your search criteria did not match any documents. --H At 01:20 PM 7/5/2007, Larry Wentzel wrote: >Henry, > >One thing: are you able to Search the Archive and find the material >through title/author/full text searches? > >--Larry > >Henry Griffy wrote: >>Hi Larry, >> >>Thanks for the reply and advice. >> >>Meanwhile, I've tried deleting and rebuilding the index from >>scratch, but the content is still not showing up. If anyone could >>help me figure out why, it'd be much appreciated. >> >>Specifically, I ran ../Services/Index/cleanup_indexes.sh >> and then ran ../Services/Index/update_index.pl >> >>It seemed to run successfully with two qualifications: >>1. While building the WAIS index, it reported the following error >>three times: >>Wide character in print at >>/usr/local/DPubS/Services/Index/build-indexes.pl line 1157. >> >>but these don't seem to have been fatal errors, as it also reported: >> >>Completed Database Build at Thu, 5 Jul 2007 08:42:23 (Total Build >>Time: 0:0:1) >>2. The Lucene index produced a lot of errors (pasted as a p.s. >>below, in case the details might be helpful.) The upshot is that >>the Lucene service seems not to have been enabled. >> >> >>Meanwhile, the bottom line is still that the content seems to have >>been ingested, but nothing displays on the Current or Past Issues >>pages. The name of the journal does display in the title bar, >>suggesting that there is _some_ access to the publication metadata, >>but there is no sign of the issue-level metadata. >> >>Hopefully this is enough information for a diagnosis? If not, >>please let me know what else would need to be known >> >>Thanks, >>Henry >> >>PS. This was the output following the Completed Database Build statement: >>Starting Lucene >>Exception in thread "main" java.lang.NoClassDefFoundError: while >>resolving class >>: org.apache.lucene.server.IndexServer >> at java.lang.VMClassLoader.resolveClass(java.lang.Class) >> (/usr/lib/libgcj.so. >>5.0.0) >> at java.lang.Class.initializeClass() (/usr/lib/libgcj.so.5.0.0) >> at java.lang.Class.forName(java.lang.String, boolean, >> java.lang.ClassLoader) >>(/usr/lib/libgcj.so.5.0.0) >> at java.lang.Class.forName(java.lang.String) (/usr/lib/libgcj.so.5.0.0) >> at gnu.gcj.runtime.FirstThread.run() (/usr/lib/libgcj.so.5.0.0) >> at _Jv_ThreadRun(java.lang.Thread) (/usr/lib/libgcj.so.5.0.0) >> at _Jv_RunMain(java.lang.Class, byte const, int, byte const, >> boolean) (/usr/l >>ib/libgcj.so.5.0.0) >> at __gcj_personality_v0 >> (/usr/local/DPubS/Services/Index/java.version=1.4.2) >> at __libc_start_main (/lib/tls/libc-2.3.4.so) >> at _Jv_RegisterClasses >> (/usr/local/DPubS/Services/Index/java.version=1.4.2) >>Caused by: java.lang.ClassNotFoundException: >>javax.xml.parsers.DocumentBuilderFa >>ctory not found in >>[<file:/usr/local/DPubS/Services/Index/Lucene/lucene-core-1.9>file:/usr/local/DPubS/Services/Index/Lucene/lucene-core-1.9. >> >>1.jar, file:/usr/local/DPubS/Services/Index/Lucene/jdom.jar, >>file:/usr/local/DPu >>bS/Services/Index/Lucene/luceneServer.jar, >>file:/usr/local/DPubS/Services/Index/ >>Lucene/jaxen-core.jar, >>file:/usr/local/DPubS/Services/Index/Lucene/jaxen-jdom.ja >>r, file:/usr/local/DPubS/Services/Index/Lucene/saxpath.jar, >>file:/usr/share/java >>/libgcj-3.4.6.jar, file:./, core:/] >> at java.net.URLClassLoader.findClass(java.lang.String) >> (/usr/lib/libgcj.so.5. >>0.0) >> at gnu.gcj.runtime.VMClassLoader.findClass(java.lang.String) >> (/usr/lib/libgcj >>.so.5.0.0) >> at java.lang.ClassLoader.loadClass(java.lang.String, boolean) >> (/usr/lib/libgc >>j.so.5.0.0) >> at _Jv_FindClass(_Jv_Utf8Const, java.lang.ClassLoader) >> (/usr/lib/libgcj.so.5. >>0.0) >> at java.lang.Class.forName(java.lang.String, boolean, >> java.lang.ClassLoader) >>(/usr/lib/libgcj.so.5.0.0) >> at >> _Jv_BytecodeVerifier.type.compatible(_Jv_BytecodeVerifier.type&, _Jv_Bytec >>odeVerifier) (/usr/lib/libgcj.so.5.0.0) >> at _Jv_BytecodeVerifier.verify_instructions_0() >> (/usr/lib/libgcj.so.5.0.0) >> at _Jv_VerifyMethod(_Jv_InterpMethod) (/usr/lib/libgcj.so.5.0.0) >> at _Jv_PrepareClass(java.lang.Class) (/usr/lib/libgcj.so.5.0.0) >> at _Jv_WaitForState(java.lang.Class, int) (/usr/lib/libgcj.so.5.0.0) >> at java.lang.VMClassLoader.linkClass0(java.lang.Class) >> (/usr/lib/libgcj.so.5. >>0.0) >> at java.lang.VMClassLoader.resolveClass(java.lang.Class) >> (/usr/lib/libgcj.so. >>5.0.0) >> ...9 more >>IndexServer error : Operation not allowed >>Exception in thread "main" java.lang.ExceptionInInitializerError >>*** Got java.lang.NoClassDefFoundError: gnu.gcj.runtime.NameFinder >>while trying >>to print stack trace. >> >> >> >>At 11:36 AM 7/3/2007, Larry Wentzel wrote: >>>Henry, >>> >>>I believe we ran into a similar problem here at Penn >>>State. Issues that loaded successfully and went into the >>>repository didn't show up anywhere. There's a (recent? had to be, >>>never had this problem before) bug in the indexing service which >>>fails to update the index after a certain point. I don't know the >>>actual details, not being a developer or programmer myself. The >>>only way we got around it was to delete the index and rebuild it >>>from scratch, which takes a bit of time (ymmv). >>> >>>Cornell fixed this problem in an nigh-ready-for-primetime release >>>which should come out this week or next. Watch the SourceForge >>>site for updates. >>> >>>--Larry Wentzel >>>--Digital Preservation Coordinator >>>--Penn State Libraries >>> >>>Henry Griffy wrote: >>>>Hi All, >>>> >>>>I've reached another impasse and hope that y'all might once again help. >>>> >>>>The situation is this: >>>> * I've prepared the xml for the latest issue of our journal, >>>> FTP'd it to the server, and run the DPubs_Importer.pl script with no errors. >>>> * I've run the Index/update_index.pl script with no errors >>>> (neither reported nor in the <2007-07-04-15-00-update.log-errors> file) >>>> * I've restarted the Apache server, again with no reported errors. >>>>However, when I go to look at the site via the UI service, no >>>>contents appear for either the current or past pages. (And I'm >>>>using the demo.ph User Interface files.) Further, when I look >>>>through the folders in the Index directory, I see no trace of our >>>>journal. The zindex-data/bibs folder doesn't contain a >>>>sub-folder for our authority, and the log in the >>>>zupdate-staging/update-logs/ folder reports that it "Indexed 0 >>>>new bibliographic records," as do all the previous logs for times >>>>I've run the script. >>>> >>>>I thought the problem might be that I'm trying to upload the >>>>content in html format, so I converted one of the files to pdf, >>>>added a reference to that version of the file in the issue's xml >>>>file, and went back through the process. Still nothing. >>>> >>>>Any ideas about where I might be going wrong / what I might be leaving out? >>>> >>>>Thanks, >>>>Henry >>>> >>>>Henry Griffy, Program Coordinator >>>>Documentation and Training Unit >>>>Scholarly Resources Integration Department >>>>Technical Services Division >>>>The Ohio State University Libraries - Ackerman Road >>>>600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 >>>>E-mail: <mailto:gri...@os...>gri...@os... >>>>Ph: 614-247-4663; Fax: 614-292-2015 >>>> >>>> >>>> >>>> >>>>------------------------------------------------------------------------- >>>>This SF.net email is sponsored by DB2 Express >>>>Download DB2 Express C - the FREE version of DB2 express and take >>>>control of your XML. No limits. Just data. Click to get it now. >>>> >>>>http://sourceforge.net/powerbar/db2/ >>>> >>>> >>>> >>>>_______________________________________________ >>>>Dpubs-content mailing list >>>><mailto:Dpu...@li...> >>>>Dpu...@li... >>>> >>>>https://lists.sourceforge.net/lists/listinfo/dpubs-content >>>> >>>> >>> >>> >>>---------- >>> >>><https://antispam.osu.edu/b.php?c=s&i=354901194&m=69e86cf44d88>Spam >>><https://antispam.osu.edu/b.php?c=n&i=354901194&m=69e86cf44d88>Not spam >>><https://antispam.osu.edu/b.php?c=f&i=354901194&m=69e86cf44d88>Forget >>>previous vote >> >>Henry Griffy, Program Coordinator >>Documentation and Training Unit >>Scholarly Resources Integration Department >>Technical Services Division >>The Ohio State University Libraries - Ackerman Road >>600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 >>E-mail: <mailto:gri...@os...>gri...@os... >>Ph: 614-247-4663; Fax: 614-292-2015 > > >---------- > ><https://antispam.osu.edu/b.php?c=s&i=356838446&m=208c53fea5b0>Spam ><https://antispam.osu.edu/b.php?c=n&i=356838446&m=208c53fea5b0>Not spam ><https://antispam.osu.edu/b.php?c=f&i=356838446&m=208c53fea5b0>Forget >previous vote Henry Griffy, Program Coordinator Documentation and Training Unit Scholarly Resources Integration Department Technical Services Division The Ohio State University Libraries - Ackerman Road 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 E-mail: <mailto:gri...@os...>gri...@os... Ph: 614-247-4663; Fax: 614-292-2015 |
|
From: Larry W. <lr...@ps...> - 2007-07-05 17:20:51
|
Henry, One thing: are you able to Search the Archive and find the material through title/author/full text searches? --Larry Henry Griffy wrote: > Hi Larry, > > Thanks for the reply and advice. > > Meanwhile, I've tried deleting and rebuilding the index from scratch, > but the content is still not showing up. If anyone could help me > figure out why, it'd be much appreciated. > > Specifically, I ran ../Services/Index/cleanup_indexes.sh > and then ran ../Services/Index/update_index.pl > > It seemed to run successfully with two qualifications: > 1. While building the WAIS index, it reported the following error > three times: > > Wide character in print at > /usr/local/DPubS/Services/Index/build-indexes.pl line 1157. > > but these don't seem to have been fatal errors, as it also reported: > > Completed Database Build at Thu, 5 Jul 2007 08:42:23 (Total Build > Time: 0:0:1) > > 2. The Lucene index produced a lot of errors (pasted as a p.s. below, > in case the details might be helpful.) The upshot is that the Lucene > service seems not to have been enabled. > > > Meanwhile, the bottom line is still that the content seems to have > been ingested, but nothing displays on the Current or Past Issues > pages. The name of the journal does display in the title bar, > suggesting that there is _some_ access to the publication metadata, > but there is no sign of the issue-level metadata. > > Hopefully this is enough information for a diagnosis? If not, please > let me know what else would need to be known > > Thanks, > Henry > > PS. This was the output following the Completed Database Build > statement: > > Starting Lucene > Exception in thread "main" java.lang.NoClassDefFoundError: while > resolving class > : org.apache.lucene.server.IndexServer > at java.lang.VMClassLoader.resolveClass(java.lang.Class) > (/usr/lib/libgcj.so. > 5.0.0) > at java.lang.Class.initializeClass() (/usr/lib/libgcj.so.5.0.0) > at java.lang.Class.forName(java.lang.String, boolean, > java.lang.ClassLoader) > (/usr/lib/libgcj.so.5.0.0) > at java.lang.Class.forName(java.lang.String) > (/usr/lib/libgcj.so.5.0.0) > at gnu.gcj.runtime.FirstThread.run() (/usr/lib/libgcj.so.5.0.0) > at _Jv_ThreadRun(java.lang.Thread) (/usr/lib/libgcj.so.5.0.0) > at _Jv_RunMain(java.lang.Class, byte const, int, byte const, > boolean) (/usr/l > ib/libgcj.so.5.0.0) > at __gcj_personality_v0 > (/usr/local/DPubS/Services/Index/java.version=1.4.2) > at __libc_start_main (/lib/tls/libc-2.3.4.so) > at _Jv_RegisterClasses > (/usr/local/DPubS/Services/Index/java.version=1.4.2) > Caused by: java.lang.ClassNotFoundException: > javax.xml.parsers.DocumentBuilderFa > ctory not found in > [file:/usr/local/DPubS/Services/Index/Lucene/lucene-core-1.9. > 1.jar, file:/usr/local/DPubS/Services/Index/Lucene/jdom.jar > <file://%5Cusr%5Clocal%5CDPubS%5CServices%5CIndex%5CLucene%5Cjdom.jar>, > file:/usr/local/DPu <file://%5Cusr%5Clocal%5CDPu> > bS/Services/Index/Lucene/luceneServer.jar, > file:/usr/local/DPubS/Services/Index/ > <file://%5Cusr%5Clocal%5CDPubS%5CServices%5CIndex%5C> > Lucene/jaxen-core.jar, > file:/usr/local/DPubS/Services/Index/Lucene/jaxen-jdom.ja > <file://%5Cusr%5Clocal%5CDPubS%5CServices%5CIndex%5CLucene%5Cjaxen-jdom.ja> > > r, file:/usr/local/DPubS/Services/Index/Lucene/saxpath.jar > <file://%5Cusr%5Clocal%5CDPubS%5CServices%5CIndex%5CLucene%5Csaxpath.jar>, > file:/usr/share/java <file://%5Cusr%5Cshare%5Cjava> > /libgcj-3.4.6.jar, file:./ <file:/>, core:/] > at java.net.URLClassLoader.findClass(java.lang.String) > (/usr/lib/libgcj.so.5. > 0.0) > at gnu.gcj.runtime.VMClassLoader.findClass(java.lang.String) > (/usr/lib/libgcj > .so.5.0.0) > at java.lang.ClassLoader.loadClass(java.lang.String, boolean) > (/usr/lib/libgc > j.so.5.0.0) > at _Jv_FindClass(_Jv_Utf8Const, java.lang.ClassLoader) > (/usr/lib/libgcj.so.5. > 0.0) > at java.lang.Class.forName(java.lang.String, boolean, > java.lang.ClassLoader) > (/usr/lib/libgcj.so.5.0.0) > at > _Jv_BytecodeVerifier.type.compatible(_Jv_BytecodeVerifier.type&, > _Jv_Bytec > odeVerifier) (/usr/lib/libgcj.so.5.0.0) > at _Jv_BytecodeVerifier.verify_instructions_0() > (/usr/lib/libgcj.so.5.0.0) > at _Jv_VerifyMethod(_Jv_InterpMethod) (/usr/lib/libgcj.so.5.0.0) > at _Jv_PrepareClass(java.lang.Class) (/usr/lib/libgcj.so.5.0.0) > at _Jv_WaitForState(java.lang.Class, int) > (/usr/lib/libgcj.so.5.0.0) > at java.lang.VMClassLoader.linkClass0(java.lang.Class) > (/usr/lib/libgcj.so.5. > 0.0) > at java.lang.VMClassLoader.resolveClass(java.lang.Class) > (/usr/lib/libgcj.so. > 5.0.0) > ...9 more > IndexServer error : Operation not allowed > Exception in thread "main" java.lang.ExceptionInInitializerError > *** Got java.lang.NoClassDefFoundError: gnu.gcj.runtime.NameFinder > while trying > to print stack trace. > > > > > At 11:36 AM 7/3/2007, Larry Wentzel wrote: >> Henry, >> >> I believe we ran into a similar problem here at Penn State. Issues >> that loaded successfully and went into the repository didn't show up >> anywhere. There's a (recent? had to be, never had this problem >> before) bug in the indexing service which fails to update the index >> after a certain point. I don't know the actual details, not being a >> developer or programmer myself. The only way we got around it was to >> delete the index and rebuild it from scratch, which takes a bit of >> time (ymmv). >> >> Cornell fixed this problem in an nigh-ready-for-primetime release >> which should come out this week or next. Watch the SourceForge site >> for updates. >> >> --Larry Wentzel >> --Digital Preservation Coordinator >> --Penn State Libraries >> >> Henry Griffy wrote: >>> Hi All, >>> >>> I've reached another impasse and hope that y'all might once again help. >>> >>> The situation is this: >>> >>> * I've prepared the xml for the latest issue of our journal, >>> FTP'd it to the server, and run the DPubs_Importer.pl script >>> with no errors. >>> * I've run the Index/update_index.pl script with no errors >>> (neither reported nor in the >>> <2007-07-04-15-00-update.log-errors> file) >>> * I've restarted the Apache server, again with no reported errors. >>> >>> However, when I go to look at the site via the UI service, no >>> contents appear for either the current or past pages. (And I'm >>> using the demo.ph User Interface files.) Further, when I look >>> through the folders in the Index directory, I see no trace of our >>> journal. The zindex-data/bibs folder doesn't contain a sub-folder >>> for our authority, and the log in the zupdate-staging/update-logs/ >>> folder reports that it "Indexed 0 new bibliographic records," as do >>> all the previous logs for times I've run the script. >>> >>> I thought the problem might be that I'm trying to upload the content >>> in html format, so I converted one of the files to pdf, added a >>> reference to that version of the file in the issue's xml file, and >>> went back through the process. Still nothing. >>> >>> Any ideas about where I might be going wrong / what I might be >>> leaving out? >>> >>> Thanks, >>> Henry >>> >>> Henry Griffy, Program Coordinator >>> Documentation and Training Unit >>> Scholarly Resources Integration Department >>> Technical Services Division >>> The Ohio State University Libraries - Ackerman Road >>> 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 >>> E-mail: gri...@os... <mailto:gri...@os...> >>> Ph: 614-247-4663; Fax: 614-292-2015 >>> >>> >>> ------------------------------------------------------------------------- >>> This SF.net email is sponsored by DB2 Express >>> Download DB2 Express C - the FREE version of DB2 express and take >>> control of your XML. No limits. Just data. Click to get it now. >>> >>> http://sourceforge.net/powerbar/db2/ >>> >>> >>> _______________________________________________ >>> Dpubs-content mailing list >>> >>> Dpu...@li... <mailto:Dpu...@li...> >>> >>> https://lists.sourceforge.net/lists/listinfo/dpubs-content >>> >>> >> >> ------------------------------------------------------------------------ >> >> Spam <https://antispam.osu.edu/b.php?c=s&i=354901194&m=69e86cf44d88> >> Not spam <https://antispam.osu.edu/b.php?c=n&i=354901194&m=69e86cf44d88> >> Forget previous vote >> <https://antispam.osu.edu/b.php?c=f&i=354901194&m=69e86cf44d88> > > Henry Griffy, Program Coordinator > Documentation and Training Unit > Scholarly Resources Integration Department > Technical Services Division > The Ohio State University Libraries - Ackerman Road > 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 > E-mail: gri...@os... <mailto:gri...@os...> > Ph: 614-247-4663; Fax: 614-292-2015 > |
|
From: Henry G. <gri...@os...> - 2007-07-05 15:59:25
|
Hi Larry,
Thanks for the reply and advice.
Meanwhile, I've tried deleting and rebuilding the index from scratch,
but the content is still not showing up. If anyone could help me
figure out why, it'd be much appreciated.
Specifically, I ran ../Services/Index/cleanup_indexes.sh
and then ran ../Services/Index/update_index.pl
It seemed to run successfully with two qualifications:
1. While building the WAIS index, it reported the following error
three times:
Wide character in print at
/usr/local/DPubS/Services/Index/build-indexes.pl line 1157.
but these don't seem to have been fatal errors, as it also reported:
Completed Database Build at Thu, 5 Jul 2007 08:42:23 (Total Build Time: 0:0:1)
2. The Lucene index produced a lot of errors (pasted as a p.s. below,
in case the details might be helpful.) The upshot is that the Lucene
service seems not to have been enabled.
Meanwhile, the bottom line is still that the content seems to have
been ingested, but nothing displays on the Current or Past Issues
pages. The name of the journal does display in the title bar,
suggesting that there is _some_ access to the publication metadata,
but there is no sign of the issue-level metadata.
Hopefully this is enough information for a diagnosis? If not, please
let me know what else would need to be known
Thanks,
Henry
PS. This was the output following the Completed Database Build statement:
Starting Lucene
Exception in thread "main" java.lang.NoClassDefFoundError: while
resolving class
: org.apache.lucene.server.IndexServer
at java.lang.VMClassLoader.resolveClass(java.lang.Class)
(/usr/lib/libgcj.so.
5.0.0)
at java.lang.Class.initializeClass() (/usr/lib/libgcj.so.5.0.0)
at java.lang.Class.forName(java.lang.String, boolean,
java.lang.ClassLoader)
(/usr/lib/libgcj.so.5.0.0)
at java.lang.Class.forName(java.lang.String) (/usr/lib/libgcj.so.5.0.0)
at gnu.gcj.runtime.FirstThread.run() (/usr/lib/libgcj.so.5.0.0)
at _Jv_ThreadRun(java.lang.Thread) (/usr/lib/libgcj.so.5.0.0)
at _Jv_RunMain(java.lang.Class, byte const, int, byte const,
boolean) (/usr/l
ib/libgcj.so.5.0.0)
at __gcj_personality_v0
(/usr/local/DPubS/Services/Index/java.version=1.4.2)
at __libc_start_main (/lib/tls/libc-2.3.4.so)
at _Jv_RegisterClasses
(/usr/local/DPubS/Services/Index/java.version=1.4.2)
Caused by: java.lang.ClassNotFoundException:
javax.xml.parsers.DocumentBuilderFa
ctory not found in
[file:/usr/local/DPubS/Services/Index/Lucene/lucene-core-1.9.
1.jar, file:/usr/local/DPubS/Services/Index/Lucene/jdom.jar,
file:/usr/local/DPu
bS/Services/Index/Lucene/luceneServer.jar,
file:/usr/local/DPubS/Services/Index/
Lucene/jaxen-core.jar,
file:/usr/local/DPubS/Services/Index/Lucene/jaxen-jdom.ja
r, file:/usr/local/DPubS/Services/Index/Lucene/saxpath.jar,
file:/usr/share/java
/libgcj-3.4.6.jar, file:./, core:/]
at java.net.URLClassLoader.findClass(java.lang.String)
(/usr/lib/libgcj.so.5.
0.0)
at gnu.gcj.runtime.VMClassLoader.findClass(java.lang.String)
(/usr/lib/libgcj
.so.5.0.0)
at java.lang.ClassLoader.loadClass(java.lang.String, boolean)
(/usr/lib/libgc
j.so.5.0.0)
at _Jv_FindClass(_Jv_Utf8Const, java.lang.ClassLoader)
(/usr/lib/libgcj.so.5.
0.0)
at java.lang.Class.forName(java.lang.String, boolean,
java.lang.ClassLoader)
(/usr/lib/libgcj.so.5.0.0)
at
_Jv_BytecodeVerifier.type.compatible(_Jv_BytecodeVerifier.type&, _Jv_Bytec
odeVerifier) (/usr/lib/libgcj.so.5.0.0)
at _Jv_BytecodeVerifier.verify_instructions_0() (/usr/lib/libgcj.so.5.0.0)
at _Jv_VerifyMethod(_Jv_InterpMethod) (/usr/lib/libgcj.so.5.0.0)
at _Jv_PrepareClass(java.lang.Class) (/usr/lib/libgcj.so.5.0.0)
at _Jv_WaitForState(java.lang.Class, int) (/usr/lib/libgcj.so.5.0.0)
at java.lang.VMClassLoader.linkClass0(java.lang.Class)
(/usr/lib/libgcj.so.5.
0.0)
at java.lang.VMClassLoader.resolveClass(java.lang.Class)
(/usr/lib/libgcj.so.
5.0.0)
...9 more
IndexServer error : Operation not allowed
Exception in thread "main" java.lang.ExceptionInInitializerError
*** Got java.lang.NoClassDefFoundError: gnu.gcj.runtime.NameFinder
while trying
to print stack trace.
At 11:36 AM 7/3/2007, Larry Wentzel wrote:
>Henry,
>
>I believe we ran into a similar problem here at Penn State. Issues
>that loaded successfully and went into the repository didn't show up
>anywhere. There's a (recent? had to be, never had this problem
>before) bug in the indexing service which fails to update the index
>after a certain point. I don't know the actual details, not being a
>developer or programmer myself. The only way we got around it was
>to delete the index and rebuild it from scratch, which takes a bit
>of time (ymmv).
>
>Cornell fixed this problem in an nigh-ready-for-primetime release
>which should come out this week or next. Watch the SourceForge site
>for updates.
>
>--Larry Wentzel
>--Digital Preservation Coordinator
>--Penn State Libraries
>
>Henry Griffy wrote:
>>Hi All,
>>
>>I've reached another impasse and hope that y'all might once again help.
>>
>>The situation is this:
>> * I've prepared the xml for the latest issue of our journal,
>> FTP'd it to the server, and run the DPubs_Importer.pl script with no errors.
>> * I've run the Index/update_index.pl script with no errors
>> (neither reported nor in the <2007-07-04-15-00-update.log-errors> file)
>> * I've restarted the Apache server, again with no reported errors.
>>However, when I go to look at the site via the UI service, no
>>contents appear for either the current or past pages. (And I'm
>>using the demo.ph User Interface files.) Further, when I look
>>through the folders in the Index directory, I see no trace of our
>>journal. The zindex-data/bibs folder doesn't contain a sub-folder
>>for our authority, and the log in the zupdate-staging/update-logs/
>>folder reports that it "Indexed 0 new bibliographic records," as do
>>all the previous logs for times I've run the script.
>>
>>I thought the problem might be that I'm trying to upload the
>>content in html format, so I converted one of the files to pdf,
>>added a reference to that version of the file in the issue's xml
>>file, and went back through the process. Still nothing.
>>
>>Any ideas about where I might be going wrong / what I might be leaving out?
>>
>>Thanks,
>>Henry
>>
>>Henry Griffy, Program Coordinator
>>Documentation and Training Unit
>>Scholarly Resources Integration Department
>>Technical Services Division
>>The Ohio State University Libraries - Ackerman Road
>>600 Ackerman Rd., Rm. 5839, Columbus, OH 43202
>>E-mail: <mailto:gri...@os...>gri...@os...
>>Ph: 614-247-4663; Fax: 614-292-2015
>>
>>
>>
>>
>>-------------------------------------------------------------------------
>>This SF.net email is sponsored by DB2 Express
>>Download DB2 Express C - the FREE version of DB2 express and take
>>control of your XML. No limits. Just data. Click to get it now.
>><http://sourceforge.net/powerbar/db2/>http://sourceforge.net/powerbar/db2/
>>
>>
>>
>>_______________________________________________
>>Dpubs-content mailing list
>><mailto:Dpu...@li...>Dpu...@li...
>>https://lists.sourceforge.net/lists/listinfo/dpubs-content
>>
>
>
>----------
>
><https://antispam.osu.edu/b.php?c=s&i=354901194&m=69e86cf44d88>Spam
><https://antispam.osu.edu/b.php?c=n&i=354901194&m=69e86cf44d88>Not spam
><https://antispam.osu.edu/b.php?c=f&i=354901194&m=69e86cf44d88>Forget
>previous vote
Henry Griffy, Program Coordinator
Documentation and Training Unit
Scholarly Resources Integration Department
Technical Services Division
The Ohio State University Libraries - Ackerman Road
600 Ackerman Rd., Rm. 5839, Columbus, OH 43202
E-mail: <mailto:gri...@os...>gri...@os...
Ph: 614-247-4663; Fax: 614-292-2015 |
|
From: Larry W. <lr...@ps...> - 2007-07-03 15:36:47
|
Henry, I believe we ran into a similar problem here at Penn State. Issues that loaded successfully and went into the repository didn't show up anywhere. There's a (recent? had to be, never had this problem before) bug in the indexing service which fails to update the index after a certain point. I don't know the actual details, not being a developer or programmer myself. The only way we got around it was to delete the index and rebuild it from scratch, which takes a bit of time (ymmv). Cornell fixed this problem in an nigh-ready-for-primetime release which should come out this week or next. Watch the SourceForge site for updates. --Larry Wentzel --Digital Preservation Coordinator --Penn State Libraries Henry Griffy wrote: > Hi All, > > I've reached another impasse and hope that y'all might once again help. > > The situation is this: > > * I've prepared the xml for the latest issue of our journal, FTP'd > it to the server, and run the DPubs_Importer.pl script with no > errors. > * I've run the Index/update_index.pl script with no errors > (neither reported nor in the > <2007-07-04-15-00-update.log-errors> file) > * I've restarted the Apache server, again with no reported errors. > > However, when I go to look at the site via the UI service, no contents > appear for either the current or past pages. (And I'm using the > demo.ph User Interface files.) Further, when I look through the > folders in the Index directory, I see no trace of our journal. The > zindex-data/bibs folder doesn't contain a sub-folder for our > authority, and the log in the zupdate-staging/update-logs/ folder > reports that it "Indexed 0 new bibliographic records," as do all the > previous logs for times I've run the script. > > I thought the problem might be that I'm trying to upload the content > in html format, so I converted one of the files to pdf, added a > reference to that version of the file in the issue's xml file, and > went back through the process. Still nothing. > > Any ideas about where I might be going wrong / what I might be leaving > out? > > Thanks, > Henry > > Henry Griffy, Program Coordinator > Documentation and Training Unit > Scholarly Resources Integration Department > Technical Services Division > The Ohio State University Libraries - Ackerman Road > 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 > E-mail: gri...@os... <mailto:gri...@os...> > Ph: 614-247-4663; Fax: 614-292-2015 > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------- > This SF.net email is sponsored by DB2 Express > Download DB2 Express C - the FREE version of DB2 express and take > control of your XML. No limits. Just data. Click to get it now. > http://sourceforge.net/powerbar/db2/ > ------------------------------------------------------------------------ > > _______________________________________________ > Dpubs-content mailing list > Dpu...@li... > https://lists.sourceforge.net/lists/listinfo/dpubs-content > |
|
From: Henry G. <gri...@os...> - 2007-07-03 15:17:25
|
Hi All,
I've reached another impasse and hope that y'all might once again help.
The situation is this:
* I've prepared the xml for the latest issue of our journal,
FTP'd it to the server, and run the DPubs_Importer.pl script with no errors.
* I've run the Index/update_index.pl script with no errors
(neither reported nor in the <2007-07-04-15-00-update.log-errors> file)
* I've restarted the Apache server, again with no reported errors.
However, when I go to look at the site via the UI service, no
contents appear for either the current or past pages. (And I'm using
the demo.ph User Interface files.) Further, when I look through the
folders in the Index directory, I see no trace of our journal. The
zindex-data/bibs folder doesn't contain a sub-folder for our
authority, and the log in the zupdate-staging/update-logs/ folder
reports that it "Indexed 0 new bibliographic records," as do all the
previous logs for times I've run the script.
I thought the problem might be that I'm trying to upload the content
in html format, so I converted one of the files to pdf, added a
reference to that version of the file in the issue's xml file, and
went back through the process. Still nothing.
Any ideas about where I might be going wrong / what I might be leaving out?
Thanks,
Henry
Henry Griffy, Program Coordinator
Documentation and Training Unit
Scholarly Resources Integration Department
Technical Services Division
The Ohio State University Libraries - Ackerman Road
600 Ackerman Rd., Rm. 5839, Columbus, OH 43202
E-mail: <mailto:gri...@os...>gri...@os...
Ph: 614-247-4663; Fax: 614-292-2015 |
|
From: David R. <dw...@co...> - 2007-06-28 08:38:53
|
<?xml version="1.0" encoding="UTF-8"?> <xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema" elementFormDefault="qualified"> <xs:annotation> <xs:documentation>Name: dpubs_journal_issue.xsd</xs:documentation> <xs:documentation>Version: <xs:appinfo source="xsdVersion">1.5</xs:appinfo></xs:documentation> <xs:documentation>Date: 2007-03-28</xs:documentation> <xs:documentation>Purpose: XML Schema for encoding journal issues (issue and article level metadata) for submission to DPubS.</xs:documentation> <xs:documentation>See dpubs.org for more information</xs:documentation> </xs:annotation> <xs:element name="journal_issue"> <xs:complexType> <xs:sequence> <xs:element ref="header"/> <xs:element ref="issue"/> </xs:sequence> <xs:attribute name="version"/> </xs:complexType> </xs:element> <xs:element name="header"> <xs:complexType> <xs:sequence> <xs:element name="issue_identifier" type="xs:string"/> <xs:element name="timestamp" type="xs:string"/> <xs:element name="journal_id" type="xs:string"/> <xs:element ref="contact"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="contact"> <xs:complexType> <xs:sequence> <xs:element name="contact_name" type="xs:string"/> <xs:element ref="email" /> <xs:element maxOccurs="unbounded" ref="phone" /> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="email" type="xs:string"/> <xs:element name="phone" type="xs:string"/> <xs:element name="issue"> <xs:complexType> <xs:sequence> <xs:element ref="issue_data"/> <xs:choice maxOccurs="unbounded"> <xs:element ref="div"/> <xs:element ref="record"/> </xs:choice> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="issue_data"> <xs:complexType> <xs:sequence> <xs:element ref="identifiers"/> <xs:element minOccurs="0" ref="series"/> <xs:element ref="journal_vol_number"/> <xs:element minOccurs="0" ref="issue_title"/> <xs:element minOccurs="0" ref="issue_sortList_title"/> <xs:element ref="issue_number"/> <xs:element maxOccurs="unbounded" ref="issue_publ_date"/> <xs:choice minOccurs="0"> <xs:sequence> <xs:element ref="start_page"/> <xs:element ref="end_page"/> </xs:sequence> <xs:element ref="extent"/> </xs:choice> <xs:element minOccurs="0" ref="editorial_board"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="series"> <xs:complexType mixed="true"> <xs:attribute ref="order" use="required"/> </xs:complexType> </xs:element> <xs:attribute name="order"> <xs:simpleType> <xs:restriction base="xs:positiveInteger"> <xs:minInclusive value="1"/> <xs:maxInclusive value="10"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:element name="journal_vol_number" type="xs:string"/> <xs:element name="issue_title" type="xs:string"/> <xs:element name="issue_sortList_title"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.issue_sortList_title"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.issue_sortList_title"> <xs:attribute name="sortKey"/> </xs:attributeGroup> <xs:element name="issue_number"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.issue_number"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.issue_number"> <xs:attribute name="label"/> </xs:attributeGroup> <xs:element name="issue_publ_date"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.issue_publ_date"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.issue_publ_date"> <xs:attribute name="iso8601" use="required"> <xs:simpleType> <xs:union memberTypes="xs:gYear xs:gYearMonth xs:date" /> </xs:simpleType> </xs:attribute> <xs:attribute name="type" default="print"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="print"/> <xs:enumeration value="online"/> <xs:enumeration value="other"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:attribute name="other"/> </xs:attributeGroup> <xs:element name="start_page" type="xs:string"/> <xs:element name="end_page" type="xs:string"/> <xs:element name="extent" type="xs:string" /> <xs:element name="editorial_board"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="editor"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="editor"> <xs:complexType> <xs:sequence> <xs:element minOccurs="0" ref="editor_specialty"/> <xs:element minOccurs="0" ref="role"/> <xs:element ref="name"/> <xs:element minOccurs="0" ref="email"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="web_page"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="affiliation"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="editor_specialty" type="xs:string"/> <xs:element name="role" type="xs:string"/> <xs:element name="name"> <xs:complexType> <xs:sequence> <xs:element minOccurs="0" ref="name_prefix"/> <xs:element ref="given_name"/> <xs:element ref="surname"/> <xs:element minOccurs="0" ref="name_suffix"/> <xs:element minOccurs="0" ref="display_name"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="name_prefix" type="xs:string"/> <xs:element name="given_name" type="xs:string"/> <xs:element name="surname" type="xs:string"/> <xs:element name="name_suffix" type="xs:string"/> <xs:element name="display_name" type="xs:string"/> <xs:element name="web_page" type="xs:string"/> <xs:element name="affiliation"> <xs:complexType> <xs:sequence> <xs:element minOccurs="0" ref="label"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="prof_title"/> <xs:element minOccurs="0" ref="department"/> <xs:element minOccurs="0" ref="organization"/> <xs:element minOccurs="0" ref="address"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="phone"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="fax"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="label" type="xs:string"/> <xs:element name="prof_title" type="xs:string"/> <xs:element name="department" type="xs:string"/> <xs:element name="organization" type="xs:string"/> <xs:element name="address"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="addressline"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="addressline" type="xs:string"/> <xs:element name="fax" type="xs:string"/> <xs:element name="div"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="record"/> </xs:sequence> <xs:attributeGroup ref="attlist.div"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.div"> <xs:attribute name="label"/> </xs:attributeGroup> <xs:element name="record"> <xs:complexType> <xs:sequence> <xs:element ref="identifiers"/> <xs:element maxOccurs="unbounded" ref="title"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="author"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="contributor"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="abstract"/> <xs:element minOccurs="0" ref="reviewed_items"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="related_item"/> <xs:element minOccurs="0" ref="keywords"/> <xs:element minOccurs="0" ref="subjects"/> <xs:choice minOccurs="0"> <xs:sequence> <xs:element ref="start_page"/> <xs:element ref="end_page"/> </xs:sequence> <xs:element ref="extent"/> </xs:choice> <xs:choice maxOccurs="unbounded"> <xs:element ref="record_filename"/> <xs:element ref="record_filename_set"/> </xs:choice> </xs:sequence> <xs:attributeGroup ref="attlist.record"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.record"> <xs:attribute name="lang" default="EN"/> <xs:attribute name="type" default="article"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="frontmatter"/> <xs:enumeration value="backmatter"/> <xs:enumeration value="article"/> <xs:enumeration value="review"/> <xs:enumeration value="index"/> <xs:enumeration value="other"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:attribute name="other"/> </xs:attributeGroup> <xs:element name="identifiers"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="identifier"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="identifier"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.identifier"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.identifier"> <xs:attribute name="type" use="required"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="doi"/> <xs:enumeration value="dpubs"/> <xs:enumeration value="jstorURL"/> <xs:enumeration value="pii"/> <xs:enumeration value="sici"/> <xs:enumeration value="url"/> <xs:enumeration value="other"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:attribute name="other"/> </xs:attributeGroup> <xs:element name="title"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="i"/> <xs:element ref="u"/> <xs:element ref="sub"/> <xs:element ref="sup"/> </xs:choice> <xs:attributeGroup ref="attlist.title"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.title"> <xs:attribute name="lang" default="EN"/> <xs:attribute name="type" default="main"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="main"/> <xs:enumeration value="alternative"/> </xs:restriction> </xs:simpleType> </xs:attribute> </xs:attributeGroup> <xs:element name="b"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="i"/> <xs:element ref="u"/> </xs:choice> </xs:complexType> </xs:element> <xs:element name="i"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="u"/> </xs:choice> </xs:complexType> </xs:element> <xs:element name="u"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="i"/> </xs:choice> </xs:complexType> </xs:element> <xs:element name="sub"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="i"/> <xs:element ref="u"/> </xs:choice> </xs:complexType> </xs:element> <xs:element name="sup"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="i"/> <xs:element ref="u"/> </xs:choice> </xs:complexType> </xs:element> <xs:element name="ul"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="li"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="li"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="i"/> <xs:element ref="u"/> <xs:element ref="sub"/> <xs:element ref="sup"/> <xs:element ref="ul"/> </xs:choice> </xs:complexType> </xs:element> <xs:element name="author"> <xs:complexType> <xs:sequence> <xs:element ref="name"/> <xs:element minOccurs="0" ref="email"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="web_page"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="affiliation"/> </xs:sequence> <xs:attribute ref="order"/> </xs:complexType> </xs:element> <xs:element name="contributor"> <xs:complexType> <xs:sequence> <xs:element ref="role"/> <xs:element ref="name"/> <xs:element minOccurs="0" ref="email"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="web_page"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="affiliation"/> </xs:sequence> <xs:attributeGroup ref="attlist.contributor"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.contributor"> <xs:attribute name="order"/> </xs:attributeGroup> <xs:element name="abstract"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="p"/> </xs:sequence> <xs:attributeGroup ref="attlist.abstract"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.abstract"> <xs:attribute name="lang" default="EN"/> </xs:attributeGroup> <xs:element name="p"> <xs:complexType mixed="true"> <xs:choice minOccurs="0" maxOccurs="unbounded"> <xs:element ref="b"/> <xs:element ref="i"/> <xs:element ref="u"/> <xs:element ref="sub"/> <xs:element ref="sup"/> <xs:element ref="ul"/> </xs:choice> <xs:attributeGroup ref="attlist.p"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.p"> <xs:attribute name="align"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="left"/> <xs:enumeration value="center"/> <xs:enumeration value="right"/> </xs:restriction> </xs:simpleType> </xs:attribute> </xs:attributeGroup> <xs:element name="reviewed_items"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="citation"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="citation"> <xs:complexType> <xs:sequence> <xs:element minOccurs="0" maxOccurs="unbounded" ref="author"/> <xs:element minOccurs="0" maxOccurs="unbounded" ref="contributor"/> <xs:element ref="title"/> <xs:element minOccurs="0" ref="add_desc_data"/> <xs:element minOccurs="0" ref="identifiers"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="add_desc_data" type="xs:string"/> <xs:element name="related_item"> <xs:complexType> <xs:sequence> <xs:element ref="label"/> <xs:element ref="citation"/> <xs:element minOccurs="0" ref="record_filename"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="keywords"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="keyword"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="keyword"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.keyword"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.keyword"> <xs:attribute name="lang" default="EN"/> <xs:attribute name="rank"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="primary"/> <xs:enumeration value="secondary"/> </xs:restriction> </xs:simpleType> </xs:attribute> </xs:attributeGroup> <xs:element name="subjects"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="subject"/> </xs:sequence> </xs:complexType> </xs:element> <xs:element name="subject"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.subject"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.subject"> <xs:attribute name="lang" default="EN"/> <xs:attribute name="rank" use="required"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="primary"/> <xs:enumeration value="secondary"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:attribute name="scheme" use="required"/> </xs:attributeGroup> <xs:element name="record_filename"> <xs:complexType mixed="true"> <xs:attributeGroup ref="attlist.record_filename"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.record_filename"> <xs:attribute name="filetype" use="required"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="pdf"/> <xs:enumeration value="pdfhighres"/> <xs:enumeration value="djvu"/> <xs:enumeration value="ps"/> <xs:enumeration value="doc"/> <xs:enumeration value="html"/> <xs:enumeration value="xhtml"/> <xs:enumeration value="refxml"/> <xs:enumeration value="txt-ocr"/> <xs:enumeration value="mtif"/> <xs:enumeration value="other"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:attribute name="other"/> <xs:attribute name="anchor"/> </xs:attributeGroup> <xs:element name="record_filename_set"> <xs:complexType> <xs:sequence> <xs:element maxOccurs="unbounded" ref="set_item"/> </xs:sequence> <xs:attributeGroup ref="attlist.record_filename_set"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.record_filename_set"> <xs:attribute name="setID" use="required"/> </xs:attributeGroup> <xs:element name="set_item"> <xs:complexType> <xs:attributeGroup ref="attlist.set_item"/> </xs:complexType> </xs:element> <xs:attributeGroup name="attlist.set_item"> <xs:attribute name="seq" use="required"/> <xs:attribute name="filename" use="required"/> <xs:attribute name="filetype" use="required"> <xs:simpleType> <xs:restriction base="xs:token"> <xs:enumeration value="tif"/> <xs:enumeration value="mtif"/> <xs:enumeration value="gif"/> <xs:enumeration value="jpg"/> <xs:enumeration value="pdf"/> <xs:enumeration value="ps"/> <xs:enumeration value="txt"/> </xs:restriction> </xs:simpleType> </xs:attribute> <xs:attribute name="n"/> <xs:attribute name="ftr"/> </xs:attributeGroup> </xs:schema> |
|
From: Larry W. <lr...@ps...> - 2007-06-27 20:56:59
|
Henry, The <end_page/> element is used for journals that are still in print, or have been converted from print. I believe the reason for requiring it was to give users an idea of how many pages an article would be, so they knew how much to expect when opening/downloading it. It also corresponds to the printed page numbers of the print journal, so if I skip an article in encoding and loading (it could happen), I can tell from the start and end pages. It's also an easy reference for scholars making citations to the printed work. I'll defer to Cornell for a better answer, though. --Larry Wentzel --Digital Preservation Coordinator --Penn State Libraries Henry Griffy wrote: > Hi All, > > In the process of coding up the XML for our next issue, I noticed that > the <end_page/> element is required for <record/>s. Is this intentional? > > I ask because this journal I'm working on is web-only. The concept of > end_page doesn't really apply. For now, I've gone through and added > the element with dummy content, but I wanted to see if there is a > reason for this element to be required -- if it plays a particular > role in how content is organized or displayed. > > Thanks, > Henry > > PS. Dave, thanks for the quick reply about the (non-)nesting <div>s, > and the thoughts about how DPubS might be re-configured to allow > them. Maybe in the future, but for now I'm planning to stick to the > default configuration as closely as possible. > > Henry Griffy, Program Coordinator > Documentation and Training Unit > Scholarly Resources Integration Department > Technical Services Division > The Ohio State University Libraries - Ackerman Road > 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 > E-mail: gri...@os... <mailto:gri...@os...> > Ph: 614-247-4663; Fax: 614-292-2015 > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------- > This SF.net email is sponsored by DB2 Express > Download DB2 Express C - the FREE version of DB2 express and take > control of your XML. No limits. Just data. Click to get it now. > http://sourceforge.net/powerbar/db2/ > ------------------------------------------------------------------------ > > _______________________________________________ > Dpubs-content mailing list > Dpu...@li... > https://lists.sourceforge.net/lists/listinfo/dpubs-content > |
|
From: Henry G. <gri...@os...> - 2007-06-27 20:37:48
|
Hi All, In the process of coding up the XML for our next issue, I noticed that the <end_page/> element is required for <record/>s. Is this intentional? I ask because this journal I'm working on is web-only. The concept of end_page doesn't really apply. For now, I've gone through and added the element with dummy content, but I wanted to see if there is a reason for this element to be required -- if it plays a particular role in how content is organized or displayed. Thanks, Henry PS. Dave, thanks for the quick reply about the (non-)nesting <div>s, and the thoughts about how DPubS might be re-configured to allow them. Maybe in the future, but for now I'm planning to stick to the default configuration as closely as possible. Henry Griffy, Program Coordinator Documentation and Training Unit Scholarly Resources Integration Department Technical Services Division The Ohio State University Libraries - Ackerman Road 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 E-mail: <mailto:gri...@os...>gri...@os... Ph: 614-247-4663; Fax: 614-292-2015 |
|
From: David R. <dw...@co...> - 2007-06-10 15:55:31
|
Larry,
We relaxed this restriction, I think because you pointed out that it
was unnecessarily restrictive ;)
So it now depends on which version of the schema you're using.
Version 1.3 is as you describe--you need to use records without divs,
or divs that include records, but records and divs can't be siblings.
With dpubs_journal_issue.xsd 1.4, you can mix it up more. So this is
now allowed:
<record/>
<div>
<record/>
<record/>
</div>
<record/>
<record/>
The purpose of the div is to create sections within a TOC, like
"Research Articles" and "Reviews". If you have an undifferentiated
TOC, then there's not much reason to use <div>.
Best,
--David
At 01:42 PM 6/8/2007, Larry Wentzel wrote:
>Henry,
>
>Only one thing catches my eye, which is something I had to have
>clarified to me. The <div> field is optional, but once you start using
>it in an issue, you have to use it for all of your records in that
>issue. For example, you cannot use <div> for the first three records,
>and then have no <div> </div> for the remainder of the records.
>
>Also, and this is an even smaller point to make, you can leave the
>label="" off if you have no title for a <div>. All that happens is you
>get a horizontal rule across the issue with no title. We've used that
>here at Penn State when we have journal issues with sections that have
>titles and other sections that do not.
>
>--Larry Wentzel
>--Digital Preservation Coordinator
>--Penn State Libraries
>
>
>-------------------------------------------------------------------------
>This SF.net email is sponsored by DB2 Express
>Download DB2 Express C - the FREE version of DB2 express and take
>control of your XML. No limits. Just data. Click to get it now.
>http://sourceforge.net/powerbar/db2/
>_______________________________________________
>Dpubs-content mailing list
>Dpu...@li...
>https://lists.sourceforge.net/lists/listinfo/dpubs-content
|
|
From: David R. <dw...@co...> - 2007-06-10 15:53:40
|
Hi Henry, Thanks for doing this. I think it will be useful. I just have a few general comments... You've noticed that a lot here is optional. The required elements allow the system to have some minimum expectations. The optional elements have been added over the years as publishers have asked for additional features and/or display capabilities. I'd suggest beginning with a basic set that makes sense to you and the "publisher", rather than trying to gather every possible data element. You can always add more data as you go forward. The reason the identifiers are required within the issue_data and record elements is to capture a pii (publisher item id) identifier. You can add other identifiers as needed, but you must have a single, unique (within the publication/journal), pii value for each of these elements. This is a system requirement, to support data reloading. When a journal issue is initially loaded into DPubS, internal system identifiers are assigned. To allow reloading the same issue (to correct a error, for example), the system needs to know what the assigned identifiers are, so that the same ones can be used. Otherwise, we couldn't maintain persistent identifiers over time. If you're reloading an issue, you can always add additional records, with new pii values. But don't change an existing record's pii value, once the data has been loaded into DPubS. Within issue_data, the series element and issue_sortList_title element were added to accommodate fairly special circumstances. They will be useful if a publication has changed its name over time, or if the publication has odd issues that don't follow the vol/issue numbering, such as a special index vol. If any of this sounds like it may apply, let us know and we can explain more. If the data model, the Schema, doesn't make sense to you in particular places, depending on what you are publishing, then it may be that you'll want to modify it. This is relatively easy to accommodate in DPubS. We can help guide you in the consequences of this, which will vary. Thanks again, and feel free to ask if you have any questions about what I've written, or other questions. Best, --David At 11:01 AM 6/8/2007, Henry Griffy wrote: >Hi All, > >As part of getting ready to upload our first issue of new content, >I've been studying the requirements of the xml metadata file. To >help me understand these requirements, I went through the schema >definition (DPubS/htdocs/dtds/dpubs_journal_issue.xsd) and used it >to produce an empty xml file containing all the possible containers >the schema describes. I've also gone through and added comments >about which fields and attributes are required, repeatable, etc. > >I'm sending it to the list in hopes that folks who know the schema >better than I might tell me if I've made any glaring errors. I also >thought others might find it helpful for documenting the process of >preparing content. I'll both attach it and copy-paste it below. > >TGIF, >Henry Griffy >Ohio State University Libraries > ><?xml version="1.0" encoding="iso-8859-1"?> ><dpubs_access_data version="1.0" >xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance >xsi:noNamespaceSchemaLocation="dpubs_journal_issue.xsd"> ><journal_issue> <!-- main container: contains <header> and <issue> --> > <header> <!-- contains four elements, each required and > non-repeatable --> > <issue_identifier>title, volume, number, > date</issue_identifier> <!-- required --> > <timestamp>yyyy-mm-dd</timestamp> <!-- required --> > <journal_id>aka, the authority</journal_id> <!-- required --> > <contact> <!-- --> > <contact_name>string</contact_name> <!-- --> > <email>string@string</email> <!-- --> > <phone>string</phone> <!-- --> > </contact> > </header> > <issue> <!-- contains the bulk of the metadata: > <issue_data> and _either_ <div> or <record> --> > <issue_data> <!-- information that applies to the > whole issue --> > <identifiers> <!-- required --> > <identifier type="doi | dpubs | > jstorURL | pii | sici | url | other" > other="">string</identifier> <!-- required, repeatable --> > </identifiers> > <series order="">integer btwn 1 and > 10</series> <!-- optional --> > ><journal_vol_number>string</journal_vol_number> <!-- required --> > <issue_title>string</issue_title> <!-- optional --> > <issue_sortList_title > sortKey="">string</issue_sortList_title> <!-- optional --> > <issue_number > label="">required</issue_number> <!-- required --> > <issue_publ_date iso8601="yyyy-mm-dd" > type="PRINT | online | other" > other="">string</issue_publ_date> <!-- optional, repeatable --> > <start_page>string</start_page> <!-- optional --> > <end_page>string</end_page> <!-- optional --> > <extent>string</extent> <!-- optional --> > <editorial_board> <!-- optional --> > <editor> <!-- required, if > <editorial_board> is used; repeatable --> > ><editor_specialty>string</editor_specialty> <!-- optional --> > <role>string</role> > <name> <!-- required if > <editor> is used --> > ><name_prefix>string</name_prefix> <!-- optional --> > ><given_name>string</given_name> <!-- required --> > ><surname>string</surname> <!-- required --> > ><name_suffix>string</name_suffix> <!-- optional --> > ><display_name>string</display_name> <!-- optional --> > </name> > ><email>string@string</email> <!-- optional --> > <web_page>string</web_page> > <!-- optional --> > <affiliation> <!-- > optional, repeatable --> > ><label>string</label> <!-- optional --> > ><prof_title>string</prof_title> <!-- optional --> > ><department>string</department> <!-- optional --> > ><organization>string</organization> <!-- optional --> > <address> > ><address_line>string</address_line> <!-- optional, repeatable --> > </address> > ><phone>string</phone> <!-- optional --> > <fax>string</fax> > <!-- optional --> > </affiliation> > </editorial_board> > </issue_data> > > <div label=""> <!-- optional, repeatable container > for a group of records --> > > <record lang="" type="frontmatter | backmatter | > ARTICLE | review | index | other"> <!-- required, repeatable --> > <identifiers> <!-- required --> > <identifier type="(required) doi | > dpubs | jstorURL | pii | sici | url | other" > other="">string</identifier> <!-- required --> > </identifiers> > <title lang="" type="MAIN | > alternative"> <!-- required, repeatable > can use either of the > following sets of tags: basically formatting +/- list --> > <b></b> or <i></i> or <u></u> or > <sub></sub> or <sup></sup> <!-- optional, repeatable --> > ><ul><li><b></b><i></i><u></u><sub></sub><sup></sup></li></ul> <!-- >optional, repeatable --> > </title> > <author> <!-- optional, repeatable --> > <name> <!-- required if > <author> is used --> > see above (editor) > for detailed list of sub-fields > </name> > <email>string</email> <!-- optional --> > <web_page>string</web_page> <!-- > optional --> > <affiliation> <!-- > optional, repeatable --> > see above (editor) > for detailed list of sub-fields > </affiliation> > </author> > <contributor order=""> <!-- optional, repeatable --> > <role>string</role> <!-- required, > if <contributor> used --> > <name> <!-- required if > <contributor> is used --> > see above (editor) > for detailed list of sub-fields > </name> > <email>string</email> <!-- optional --> > <web_page>string</web_page> <!-- > optional --> > <affiliation> <!-- > optional, repeatable --> > see above (editor) > for detailed list of sub-fields > </affiliation> > </contributor> > <abstract lang="default EN"> <!-- optional --> > <p align="left | center | right"> > ><ul><li><b></b><i></i><u></u><sub></sub><sup></sup></li></ul> <!-- optional --> > </p> > </abstract> > <reviewed_items> <!-- optional --> > <citation> <!-- optional --> > <author> <!-- optional, > repeatable --> > see above (author) > for detailed list of sub-fields > </author> > <contributor> <!-- > optional, repeatable --> > see above > (contributor) for detailed list of sub-fields > </contributor> > <title>string</title> <!-- > required --> > ><add_desc_data>string</add_desc_data> <!-- optional --> > <identifiers> <!-- optional --> > <identifier > type="(required) doi | dpubs | jstorURL | pii | sici | url | other" > other="">string</identifier> > </identifiers> > </citation> > </reviewed_items> > <related_item> <!-- optional --> > <label>string</label> <!-- > required, if <related_item> used --> > <citation> <!-- required if > <related_item> used --> > see above (reviewed_items) > for detailed list of sub-fields > </citation> > ><record_filename>string</record_filename> <!-- optional --> > </related_item> > <keywords> <!-- optional --> > <keyword lang="" rank="(required) > primary | secondary">string</keyword> <!-- required, if <keywords> > used, repeatable --> > </keywords> > <subjects> <!-- optional --> > <subject lang="" rank="(required) > primary | secondary" scheme="(required)">string</subject> <!-- > required, if <subjects> used, repeatable --> > </subjects> > <start_page>string</start_page> <!-- optional --> > <extent>string</extent> <!-- optional --> > <end_page>string</end_page> <!-- optional --> > <record_filename filetype="tif | mtif | gif > | jpg | pdf | ps | txt" other="" anchor="">string</record_filename> > <!-- either this or <record_filename_set> required --> > <record_filename_set setID="(required)"> > <set_item seq="" filename="" > filetype="tif | mtif | gif | jpg | pdf | ps | txt" n="" ftr=""/> > <!-- required if <record_filename_set> used --> > </record_filename_set> > </record> > </div> > </issue> ><journal_issue> > > >------------------------------------------------------------------------- >This SF.net email is sponsored by DB2 Express >Download DB2 Express C - the FREE version of DB2 express and take >control of your XML. No limits. Just data. Click to get it now. >http://sourceforge.net/powerbar/db2/ >_______________________________________________ >Dpubs-content mailing list >Dpu...@li... >https://lists.sourceforge.net/lists/listinfo/dpubs-content |
|
From: Larry W. <lr...@ps...> - 2007-06-08 17:42:24
|
Henry, Only one thing catches my eye, which is something I had to have clarified to me. The <div> field is optional, but once you start using it in an issue, you have to use it for all of your records in that issue. For example, you cannot use <div> for the first three records, and then have no <div> </div> for the remainder of the records. Also, and this is an even smaller point to make, you can leave the label="" off if you have no title for a <div>. All that happens is you get a horizontal rule across the issue with no title. We've used that here at Penn State when we have journal issues with sections that have titles and other sections that do not. --Larry Wentzel --Digital Preservation Coordinator --Penn State Libraries |
|
From: Henry G. <gri...@os...> - 2007-06-08 15:18:49
|
Hi All, As part of getting ready to upload our first issue of new content, I've been studying the requirements of the xml metadata file. To help me understand these requirements, I went through the schema definition (DPubS/htdocs/dtds/dpubs_journal_issue.xsd) and used it to produce an empty xml file containing all the possible containers the schema describes. I've also gone through and added comments about which fields and attributes are required, repeatable, etc. I'm sending it to the list in hopes that folks who know the schema better than I might tell me if I've made any glaring errors. I also thought others might find it helpful for documenting the process of preparing content. I'll both attach it and copy-paste it below. TGIF, Henry Griffy Ohio State University Libraries <?xml version="1.0" encoding="iso-8859-1"?> <dpubs_access_data version="1.0" xmlns:xsi=http://www.w3.org/2001/XMLSchema-instance xsi:noNamespaceSchemaLocation="dpubs_journal_issue.xsd"> <journal_issue> <!-- main container: contains <header> and <issue> --> <header> <!-- contains four elements, each required and non-repeatable --> <issue_identifier>title, volume, number, date</issue_identifier> <!-- required --> <timestamp>yyyy-mm-dd</timestamp> <!-- required --> <journal_id>aka, the authority</journal_id> <!-- required --> <contact> <!-- --> <contact_name>string</contact_name> <!-- --> <email>string@string</email> <!-- --> <phone>string</phone> <!-- --> </contact> </header> <issue> <!-- contains the bulk of the metadata: <issue_data> and _either_ <div> or <record> --> <issue_data> <!-- information that applies to the whole issue --> <identifiers> <!-- required --> <identifier type="doi | dpubs | jstorURL | pii | sici | url | other" other="">string</identifier> <!-- required, repeatable --> </identifiers> <series order="">integer btwn 1 and 10</series> <!-- optional --> <journal_vol_number>string</journal_vol_number> <!-- required --> <issue_title>string</issue_title> <!-- optional --> <issue_sortList_title sortKey="">string</issue_sortList_title> <!-- optional --> <issue_number label="">required</issue_number> <!-- required --> <issue_publ_date iso8601="yyyy-mm-dd" type="PRINT | online | other" other="">string</issue_publ_date> <!-- optional, repeatable --> <start_page>string</start_page> <!-- optional --> <end_page>string</end_page> <!-- optional --> <extent>string</extent> <!-- optional --> <editorial_board> <!-- optional --> <editor> <!-- required, if <editorial_board> is used; repeatable --> <editor_specialty>string</editor_specialty> <!-- optional --> <role>string</role> <name> <!-- required if <editor> is used --> <name_prefix>string</name_prefix> <!-- optional --> <given_name>string</given_name> <!-- required --> <surname>string</surname> <!-- required --> <name_suffix>string</name_suffix> <!-- optional --> <display_name>string</display_name> <!-- optional --> </name> <email>string@string</email> <!-- optional --> <web_page>string</web_page> <!-- optional --> <affiliation> <!-- optional, repeatable --> <label>string</label> <!-- optional --> <prof_title>string</prof_title> <!-- optional --> <department>string</department> <!-- optional --> <organization>string</organization> <!-- optional --> <address> <address_line>string</address_line> <!-- optional, repeatable --> </address> <phone>string</phone> <!-- optional --> <fax>string</fax> <!-- optional --> </affiliation> </editorial_board> </issue_data> <div label=""> <!-- optional, repeatable container for a group of records --> <record lang="" type="frontmatter | backmatter | ARTICLE | review | index | other"> <!-- required, repeatable --> <identifiers> <!-- required --> <identifier type="(required) doi | dpubs | jstorURL | pii | sici | url | other" other="">string</identifier> <!-- required --> </identifiers> <title lang="" type="MAIN | alternative"> <!-- required, repeatable can use either of the following sets of tags: basically formatting +/- list --> <b></b> or <i></i> or <u></u> or <sub></sub> or <sup></sup> <!-- optional, repeatable --> <ul><li><b></b><i></i><u></u><sub></sub><sup></sup></li></ul> <!-- optional, repeatable --> </title> <author> <!-- optional, repeatable --> <name> <!-- required if <author> is used --> see above (editor) for detailed list of sub-fields </name> <email>string</email> <!-- optional --> <web_page>string</web_page> <!-- optional --> <affiliation> <!-- optional, repeatable --> see above (editor) for detailed list of sub-fields </affiliation> </author> <contributor order=""> <!-- optional, repeatable --> <role>string</role> <!-- required, if <contributor> used --> <name> <!-- required if <contributor> is used --> see above (editor) for detailed list of sub-fields </name> <email>string</email> <!-- optional --> <web_page>string</web_page> <!-- optional --> <affiliation> <!-- optional, repeatable --> see above (editor) for detailed list of sub-fields </affiliation> </contributor> <abstract lang="default EN"> <!-- optional --> <p align="left | center | right"> <ul><li><b></b><i></i><u></u><sub></sub><sup></sup></li></ul> <!-- optional --> </p> </abstract> <reviewed_items> <!-- optional --> <citation> <!-- optional --> <author> <!-- optional, repeatable --> see above (author) for detailed list of sub-fields </author> <contributor> <!-- optional, repeatable --> see above (contributor) for detailed list of sub-fields </contributor> <title>string</title> <!-- required --> <add_desc_data>string</add_desc_data> <!-- optional --> <identifiers> <!-- optional --> <identifier type="(required) doi | dpubs | jstorURL | pii | sici | url | other" other="">string</identifier> </identifiers> </citation> </reviewed_items> <related_item> <!-- optional --> <label>string</label> <!-- required, if <related_item> used --> <citation> <!-- required if <related_item> used --> see above (reviewed_items) for detailed list of sub-fields </citation> <record_filename>string</record_filename> <!-- optional --> </related_item> <keywords> <!-- optional --> <keyword lang="" rank="(required) primary | secondary">string</keyword> <!-- required, if <keywords> used, repeatable --> </keywords> <subjects> <!-- optional --> <subject lang="" rank="(required) primary | secondary" scheme="(required)">string</subject> <!-- required, if <subjects> used, repeatable --> </subjects> <start_page>string</start_page> <!-- optional --> <extent>string</extent> <!-- optional --> <end_page>string</end_page> <!-- optional --> <record_filename filetype="tif | mtif | gif | jpg | pdf | ps | txt" other="" anchor="">string</record_filename> <!-- either this or <record_filename_set> required --> <record_filename_set setID="(required)"> <set_item seq="" filename="" filetype="tif | mtif | gif | jpg | pdf | ps | txt" n="" ftr=""/> <!-- required if <record_filename_set> used --> </record_filename_set> </record> </div> </issue> <journal_issue> |
|
From: Joshua S. <js...@co...> - 2007-06-04 20:05:05
|
Larry, If the content was submitted ( not loaded ) with the correct publication selected then you may be able to load it if all publications (all authorities) is selected. You should not be able to submit the content unless you have the correct publication (authority) selected. That is, you shouldn't be able to upload the ZIP package with "all authorities" selected. I think in the next release we have made things a little more strict as to what you can and cannot do. For now, it's best to always have the publication selected that you are working on. Josh Larry Wentzel wrote: > All, > > Just had an interesting situation occur. I loaded a number of > conference proceedings into DPubS through the Admin UI. However, due to > oddities in how my user account was set up, I did not have the > authorization to see/select the authority for the conference proceeding > (in this case, psu.wepan) which had been set up. The publication was > listed in the drop down menu (Penn State's Women in Engineering > Pennsylvania Network), so I did choose that. Lacking the correct > authority, I selected "All Authorities" and uploaded the files. As far > as I can determine, it worked fine. > > My question is, if I can upload files through the Admin UI without > selecting the correct authority, what is the need/value in having an > authority to select? > > --Larry Wentzel > --Digital Preservation Coordinator > --Penn State Libraries > > ------------------------------------------------------------------------- > This SF.net email is sponsored by DB2 Express > Download DB2 Express C - the FREE version of DB2 express and take > control of your XML. No limits. Just data. Click to get it now. > http://sourceforge.net/powerbar/db2/ > _______________________________________________ > Dpubs-content mailing list > Dpu...@li... > https://lists.sourceforge.net/lists/listinfo/dpubs-content |
|
From: Larry W. <lr...@ps...> - 2007-06-01 14:37:57
|
All, Just had an interesting situation occur. I loaded a number of conference proceedings into DPubS through the Admin UI. However, due to oddities in how my user account was set up, I did not have the authorization to see/select the authority for the conference proceeding (in this case, psu.wepan) which had been set up. The publication was listed in the drop down menu (Penn State's Women in Engineering Pennsylvania Network), so I did choose that. Lacking the correct authority, I selected "All Authorities" and uploaded the files. As far as I can determine, it worked fine. My question is, if I can upload files through the Admin UI without selecting the correct authority, what is the need/value in having an authority to select? --Larry Wentzel --Digital Preservation Coordinator --Penn State Libraries |
|
From: Joshua S. <js...@co...> - 2007-05-29 20:02:41
|
Henry, I think this is just a file permission problem. We suggest that you run apache, and administer the system, as one user (ex. dpubs). It sounds like you have a user set up called "dpubs" so I'll assume that's what you want to use. To bring things back to the way they should be you will want to chown all of the DPubS files. Go ahead and recursively chown the DPubS directory (as long as there is just DPubS stuff here): # cd /usr/local/ # chown -R dpubs:dpubs DPubS/ I'm also assuming dpubs is in group "dpubs" as well. The importer also needs to write to that XML file that you edited, the content you are trying to load. If that is not within /usr/local/DPubs/ make sure dpubs can read and write to the XML file and can write to the directory that this file is in. After that try to run the importer as user dpubs (you might need to sudo su - dpubs). If you hit that DB error while trying to import, go ahead and remove the Editorial DB file. This shouldn't cause any harm unless you are in production mode and want to keep track of submissions and loads. That DB file is here: # rm /usr/local/DPubS/Sercvices/Editorial/db/Editorial Josh Henry Griffy wrote: > Hi All, > > I've encountered an error message that isn't making sense to me, and I > hope someone on the list might help me understand what's not working. > > Some background. In order to test that the installation is working, > I've set up a dummy publication with the authority osu.blah. For > content, I've taken the contents of the demo.ph/001-001/ folder and > copied them to another folder called osu.blah/001-001/. I've edited the > ph_001_001.xml file minimally: changing <journal_id>demo.ph</journal_id> > to <journal_id>osu.blah</journal_id> and adding osublah to the beginning > of each of the <identifier type="pii"> tags. > > When I try to add this content using DPubS_Importer.pl in the > Services/Editorial/Batch folder, it returns an error. The precise error > depends on whether I execute the file under my own username or as a sudo > user called dpubs. > > When I try to run DPubS_Importer.pl as sudo dpubs, I get: > > Step 2: Check data routine failed for handler. > Error: 5: Failed to fix schema ref > > When I try to run DPubS_Importer.pl as my own username, I get a couple > of errors: > > First, I get one of these error statements per content file, with the > last two digits of the identifier in each statement incremented by one: > > touch: cannot touch `/usr/local/DPubS/tmp/identifiers/1180464989': > Permission denied > > Second, I get this error statement twice: > > DBD::SQLite::db do failed: attempt to write a readonly database(1) > at dbdimp.c line 402 at /usr/local/DPubS/Common/DB.pm line 124 > > I've searched for dbdimp.c but haven't found any trace of that file. > I've looked at the code of DB.pm, line 124, but it doesn't tell me much, > other than that DPubS seems to be trying to write to a database. > > Any help would be much welcome. > > Thanks, > Henry > > > Henry Griffy, Program Coordinator > Documentation and Training Unit > Scholarly Resources Integration Department > Technical Services Division > The Ohio State University Libraries - Ackerman Road > 600 Ackerman Rd., Rm. 5839, Columbus, OH 43202 > E-mail: gri...@os... <mailto:gri...@os...> > Ph: 614-247-4663; Fax: 614-292-2015 > > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------- > This SF.net email is sponsored by DB2 Express > Download DB2 Express C - the FREE version of DB2 express and take > control of your XML. No limits. Just data. Click to get it now. > http://sourceforge.net/powerbar/db2/ > > > ------------------------------------------------------------------------ > > _______________________________________________ > Dpubs-content mailing list > Dpu...@li... > https://lists.sourceforge.net/lists/listinfo/dpubs-content |
|
From: Henry G. <gri...@os...> - 2007-05-29 19:05:46
|
Hi All,
I've encountered an error message that isn't making sense to me, and
I hope someone on the list might help me understand what's not working.
Some background. In order to test that the installation is working,
I've set up a dummy publication with the authority osu.blah. For
content, I've taken the contents of the demo.ph/001-001/ folder and
copied them to another folder called osu.blah/001-001/. I've edited
the ph_001_001.xml file minimally: changing
<journal_id>demo.ph</journal_id> to <journal_id>osu.blah</journal_id>
and adding osublah to the beginning of each of the <identifier
type="pii"> tags.
When I try to add this content using DPubS_Importer.pl in the
Services/Editorial/Batch folder, it returns an error. The precise
error depends on whether I execute the file under my own username or
as a sudo user called dpubs.
When I try to run DPubS_Importer.pl as sudo dpubs, I get:
Step 2: Check data routine failed for handler.
Error: 5: Failed to fix schema ref
When I try to run DPubS_Importer.pl as my own username, I get a
couple of errors:
First, I get one of these error statements per content file, with the
last two digits of the identifier in each statement incremented by one:
touch: cannot touch `/usr/local/DPubS/tmp/identifiers/1180464989':
Permission denied
Second, I get this error statement twice:
DBD::SQLite::db do failed: attempt to write a readonly database(1) at
dbdimp.c line 402 at /usr/local/DPubS/Common/DB.pm line 124
I've searched for dbdimp.c but haven't found any trace of that
file. I've looked at the code of DB.pm, line 124, but it doesn't
tell me much, other than that DPubS seems to be trying to write to a database.
Any help would be much welcome.
Thanks,
Henry
Henry Griffy, Program Coordinator
Documentation and Training Unit
Scholarly Resources Integration Department
Technical Services Division
The Ohio State University Libraries - Ackerman Road
600 Ackerman Rd., Rm. 5839, Columbus, OH 43202
E-mail: <mailto:gri...@os...>gri...@os...
Ph: 614-247-4663; Fax: 614-292-2015 |