From: Carl B. <cbo...@gm...> - 2011-05-11 16:10:20
|
Hi Dryad team, Treebase team, Thanks Hilmar, here's an example: For instance, this Dryad entry<http://datadryad.org/handle/10255/dryad.8661> does not point to this treebase entry,<http://treebase.org/treebase-web/search/study/anyObjectAsRDF.rdf?namespacedGUID=TB2:S11266> Perhaps this is just because it is a recent entry? The treebase entry in this cahas the Dryad id under a tag I didn't expect: <tb:title.study xmlns:tb="http://treebase.org/terms#">dryad_8661</tb:title.study> Meanwhile this Dryad entry<http://datadryad.org/handle/10255/dryad.4939?show=full> does have the treebase identifer, but the treebase entry doesn't have the dryad identifier<http://treebase.org/treebase-web/search/study/anyObjectAsRDF.rdf?namespacedGUID=TB2:S84> . -Carl On Wed, May 11, 2011 at 7:04 AM, Hilmar Lapp <hl...@ne...> wrote: > Hi Carl - > > first off, the perfect place to ask these and related questions is the > Dryad Developers Google Group [1], which is public. I'm copying some of this > thread there. > > The DataONE API implementation in R is as far as I understand not pretty > and far from mature, but in principle functional. I'm copying Dave Vieglais, > who should be able to point you to the appropriate places in the DataONE svn > repository. > > Our handshaking mechanism with TreeBASE is in principle designed (or was in > my understanding) such that TreeBASE studyIDs and Dryad DOIs are harvested > and updated on the respective ends. (Completing a TreeBASE submission is > asynchronous from, and can take a lot longer than, completing the > "containing" Dryad submission.) If you have an example for where that > doesn't seem to have worked, can you post that here? > > Cheers, > > -hilmar > > [1] http://groups.google.com/group/dryad-dev > > On May 10, 2011, at 1:30 AM, Carl Boettiger wrote: > > Hi Ryan, > > I am working on implementing the TreeBASE API in R, and am implemented in > extending this to the DataDryad and DataONE APIs. (In case you're curious, > the project development is currently hosted on github<https://github.com/cboettig/treeBASE>. > Pretty straight forward, so far just done the PhyloWS side, hoping to do the > OAI-PMH side next). > > Todd suggests that there is already an implementation of the DataONE API in > R, which is great to hear. Do you know who's involved in this, and what > state it's in? I'd be happy to help test or whatnot. I've started looking > at what's available for the investigator-level API<http://mule1.dataone.org/ArchitectureDocs-current/apis/index.html>for DataONE; is this the current documentation? > > William Piel also forwarded you a question from my about getting the > treebase and dryad data better integrated -- the treebase metadata doesn't > currently include the dryad dois, and it seems the dryad dois don't include > the treebase study ids. Certainly this would be a pretty valuable to have, > though I suppose a work-around might be possible by matching other metadata > in the search? > > Anyway, nice to meet you, and look forward to hearing a bit more about what > directions you're heading with this. Is there an appropriate forum / > mailing list to discuss the APIs for Dryad and DataONE? > > Thanks much, > > Carl > > -- > Carl Boettiger > UC Davis > http://www.carlboettiger.info/ > > > -- > =========================================================== > : Hilmar Lapp -:- Durham, NC -:- informatics.nescent.org : > =========================================================== > > > > -- Carl Boettiger UC Davis http://www.carlboettiger.info/ |
From: William P. <wil...@ya...> - 2011-05-11 17:09:44
|
On May 11, 2011, at 12:10 PM, Carl Boettiger wrote: > Hi Dryad team, Treebase team, > > Thanks Hilmar, here's an example: > > For instance, this Dryad entry does not point to this treebase entry, Perhaps this is just because it is a recent entry? > The treebase entry in this cahas the Dryad id under a tag I didn't expect: <tb:title.study xmlns:tb="http://treebase.org/terms#">dryad_8661</tb:title.study> > > Meanwhile this Dryad entry does have the treebase identifer, but the treebase entry doesn't have the dryad identifier. > > -Carl Hi Carl, With respect the TreeBASE: <tb:title.study> is not the intended location for a dryad cross-reference -- it happens to be there because the title of the study is pre-populated with "dryad_8661", but the submitter can easily change this. Currently, TreeBASE exposes a DOI like so: <prism:doi xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/">10.1111/j.1469-8137.2011.03677.x</prism:doi> ... but that is for the doi to the publication, so we wouldn't want to use that for Dryad. On the other hand, I'd rather not do a schema change. But we have a field called "URL", which we could pre-populate with "http://dx.doi.org/10.5061/dryad.8661". This field is usually used for publications that don't have a doi, and instead have a particular url -- and it's rarely used. So one kluge (short of a schema change) is to pre-populate the url field with "http://dx.doi.org/10.5061/dryad.8661" and then expose that under the "prism:url" element. What would be a good element to use other than "prism:url" or "prism:doi"? bp |
From: Rutger V. <R....@re...> - 2011-05-12 12:41:28
|
> On the other hand, I'd rather not do a schema change. But we have a field > called "URL", which we could pre-populate with > "http://dx.doi.org/10.5061/dryad.8661". This field is usually used for > publications that don't have a doi, and instead have a particular url -- and > it's rarely used. So one kluge (short of a schema change) is to > pre-populate the url field with "http://dx.doi.org/10.5061/dryad.8661" and > then expose that under the "prism:url" element. What would be a good > element to use other than "prism:url" or "prism:doi"? Citation.url is a string so we can put whatever we like there in order to store it. As for the element to store it under, maybe: - dcterms:source, defined as "A related resource from which the described resource is derived. The described resource may be derived from the related resource in whole or in part. Recommended best practice is to identify the related resource by means of a string conforming to a formal identification system." (or a subclass thereof, e.g. tb.source.study.dryad) - dcterms:isReferencedBy, defined as "A related resource that references, cites, or otherwise points to the described resource.", or dcterms:references, depending on how you would qualify the relationship between the two. Or prism:isReferencedBy - dcterms:identifier, defined as "An unambiguous reference to the resource within a given context." (or a subclass thereof, e.g. tb.identifier.study.dryad) - prism:isVersionOf, defined as "The described resource is a version, edition, or adaptation of the referenced resource." To me, dcterms:source sounds like a sensible candidate. -- Dr. Rutger A. Vos School of Biological Sciences Philip Lyle Building, Level 4 University of Reading Reading, RG6 6BX, United Kingdom Tel: +44 (0) 118 378 7535 http://rutgervos.blogspot.com |
From: Hilmar L. <hl...@ne...> - 2011-05-12 13:05:53
|
Or perhaps simply dc:relation, or dcterms:isPartOf [1], which is actually a refinement of the former? I agree though that the relationship ought to be general enough that a DC or dcterms property will probably best facilitate use, and I also wouldn't kludge this - these become de-factor standards sooner than you want, and then you're stuck with them. -hilmar [1] http://dublincore.org/documents/dcmi-terms/#terms-isPartOf Sent with a tap. On May 12, 2011, at 8:41 AM, Rutger Vos <R....@re...> wrote: >> On the other hand, I'd rather not do a schema change. But we have a field >> called "URL", which we could pre-populate with >> "http://dx.doi.org/10.5061/dryad.8661". This field is usually used for >> publications that don't have a doi, and instead have a particular url -- and >> it's rarely used. So one kluge (short of a schema change) is to >> pre-populate the url field with "http://dx.doi.org/10.5061/dryad.8661" and >> then expose that under the "prism:url" element. What would be a good >> element to use other than "prism:url" or "prism:doi"? > > Citation.url is a string so we can put whatever we like there in order > to store it. As for the element to store it under, maybe: > > - dcterms:source, defined as "A related resource from which the > described resource is derived. The described resource may be derived > from the related resource in whole or in part. Recommended best > practice is to identify the related resource by means of a string > conforming to a formal identification system." (or a subclass thereof, > e.g. tb.source.study.dryad) > > - dcterms:isReferencedBy, defined as "A related resource that > references, cites, or otherwise points to the described resource.", or > dcterms:references, depending on how you would qualify the > relationship between the two. Or prism:isReferencedBy > > - dcterms:identifier, defined as "An unambiguous reference to the > resource within a given context." > (or a subclass thereof, e.g. tb.identifier.study.dryad) > > - prism:isVersionOf, defined as "The described resource is a version, > edition, or adaptation of the referenced resource." > > To me, dcterms:source sounds like a sensible candidate. > > -- > Dr. Rutger A. Vos > School of Biological Sciences > Philip Lyle Building, Level 4 > University of Reading > Reading, RG6 6BX, United Kingdom > Tel: +44 (0) 118 378 7535 > http://rutgervos.blogspot.com > > ------------------------------------------------------------------------------ > Achieve unprecedented app performance and reliability > What every C/C++ and Fortran developer should know. > Learn how Intel has extended the reach of its next-generation tools > to help boost performance applications - inlcuding clusters. > http://p.sf.net/sfu/intel-dev2devmay > _______________________________________________ > Treebase-devel mailing list > Tre...@li... > https://lists.sourceforge.net/lists/listinfo/treebase-devel |
From: Greenberg, J. <ja...@em...> - 2011-05-12 13:32:22
|
All -- I am not sure I've followed this conversation all the way, but Hilmar's interpretation of dcterms:isPartOf (or simple/unqualified DC w/dc:relation) makes good sense to me; and will help w/interoperabilty. W.r.t. PRISM, the Drayd 3.0 ap (application profile) has moved away from PRISM to BIBO, see: http://bibliontology.com/specification. There seems to be increased interest in and discussion of this scheme on various metadata lists. I am a tad wary about the overall upkeep (latest version, etc.).... but it is Sematic Web compatible, rdf, etc... Hope this helps. ~ jane -----Original Message----- From: dry...@go... [mailto:dry...@go...] On Behalf Of Hilmar Lapp Sent: Thursday, May 12, 2011 9:06 AM To: Rutger Vos Cc: William Piel; Tre...@li...; Vision, Todd J; DaveVieglais; Ryan Scherle; Dryad Developers Subject: [dryad-dev] Re: [Treebase-devel] Dryad API, advice on cyberinfastructure Or perhaps simply dc:relation, or dcterms:isPartOf [1], which is actually a refinement of the former? I agree though that the relationship ought to be general enough that a DC or dcterms property will probably best facilitate use, and I also wouldn't kludge this - these become de-factor standards sooner than you want, and then you're stuck with them. -hilmar [1] http://dublincore.org/documents/dcmi-terms/#terms-isPartOf Sent with a tap. On May 12, 2011, at 8:41 AM, Rutger Vos <R....@re...> wrote: >> On the other hand, I'd rather not do a schema change. But we have a >> field called "URL", which we could pre-populate with >> "http://dx.doi.org/10.5061/dryad.8661". This field is usually used >> for publications that don't have a doi, and instead have a particular >> url -- and it's rarely used. So one kluge (short of a schema change) >> is to pre-populate the url field with >> "http://dx.doi.org/10.5061/dryad.8661" and then expose that under the >> "prism:url" element. What would be a good element to use other than "prism:url" or "prism:doi"? > > Citation.url is a string so we can put whatever we like there in order > to store it. As for the element to store it under, maybe: > > - dcterms:source, defined as "A related resource from which the > described resource is derived. The described resource may be derived > from the related resource in whole or in part. Recommended best > practice is to identify the related resource by means of a string > conforming to a formal identification system." (or a subclass thereof, > e.g. tb.source.study.dryad) > > - dcterms:isReferencedBy, defined as "A related resource that > references, cites, or otherwise points to the described resource.", or > dcterms:references, depending on how you would qualify the > relationship between the two. Or prism:isReferencedBy > > - dcterms:identifier, defined as "An unambiguous reference to the > resource within a given context." > (or a subclass thereof, e.g. tb.identifier.study.dryad) > > - prism:isVersionOf, defined as "The described resource is a version, > edition, or adaptation of the referenced resource." > > To me, dcterms:source sounds like a sensible candidate. > > -- > Dr. Rutger A. Vos > School of Biological Sciences > Philip Lyle Building, Level 4 > University of Reading > Reading, RG6 6BX, United Kingdom > Tel: +44 (0) 118 378 7535 > http://rutgervos.blogspot.com > > ---------------------------------------------------------------------- > -------- Achieve unprecedented app performance and reliability What > every C/C++ and Fortran developer should know. > Learn how Intel has extended the reach of its next-generation tools to > help boost performance applications - inlcuding clusters. > http://p.sf.net/sfu/intel-dev2devmay > _______________________________________________ > Treebase-devel mailing list > Tre...@li... > https://lists.sourceforge.net/lists/listinfo/treebase-devel -- You received this message because you are subscribed to the Google Groups "dryad-dev" group. To post to this group, send email to dry...@go.... To unsubscribe from this group, send email to dry...@go.... For more options, visit this group at http://groups.google.com/group/dryad-dev?hl=en. |
From: William P. <wil...@ya...> - 2011-05-12 14:04:35
|
Is there a consensus, then, that TreeBASE should express the corresponding Dryad DOIs, prefixed with http://dx.doi.org/, using dcterms:isPartOf via both OAI and PhyloWS ? Or, for that matter, express the handle that Dryad labels dc:identifier.uri (e.g. http://hdl.handle.net/10255/dryad.15417). Meanwhile Dryad should express the corresponding TreeBASE URI also using dcterms:isPartOf ? A slight wrinkle is to make sure we are mutually cross-referencing the proper "level" of object. Dryad issues "Data Package" identifiers, which I guess is synonymous with a TreeBASE URI to a "study". These two identifiers point to the same package: dc:identifier: doi: 10.5061/dryad.tf48r dc:identifier.uri: http://hdl.handle.net/10255/dryad.15415 But in addition, Dryad issues identifiers for particular data sets. These two identifiers point to the same data set within the above package: dc:identifier: doi: 10.5061/dryad.tf48r/2 dc:identifier.uri: http://hdl.handle.net/10255/dryad.15417 There is no one-to-one equivalency between these data set identifiers and TreeBASE's URIs, because TreeBASE's URIs point to matrix objects and tree objects, which are all extracted from the same or different data set(s). Hence I think that for both TreeBASE and Dryad, the cross-referencing of identifiers should be at the level of "package" and "study" respectively. bp On May 12, 2011, at 9:32 AM, Greenberg, Jane wrote: > All -- I am not sure I've followed this conversation all the way, but Hilmar's interpretation of dcterms:isPartOf (or simple/unqualified DC w/dc:relation) makes good sense to me; and will help w/interoperabilty. > > W.r.t. PRISM, the Drayd 3.0 ap (application profile) has moved away from PRISM to BIBO, see: http://bibliontology.com/specification. There seems to be increased interest in and discussion of this scheme on various metadata lists. I am a tad wary about the overall upkeep (latest version, etc.).... but it is Sematic Web compatible, rdf, etc... > > Hope this helps. > > ~ jane > > -----Original Message----- > From: dry...@go... [mailto:dry...@go...] On Behalf Of Hilmar Lapp > Sent: Thursday, May 12, 2011 9:06 AM > To: Rutger Vos > Cc: William Piel; Tre...@li...; Vision, Todd J; DaveVieglais; Ryan Scherle; Dryad Developers > Subject: [dryad-dev] Re: [Treebase-devel] Dryad API, advice on cyberinfastructure > > Or perhaps simply dc:relation, or dcterms:isPartOf [1], which is actually a refinement of the former? > > I agree though that the relationship ought to be general enough that a DC or dcterms property will probably best facilitate use, and I also wouldn't kludge this - these become de-factor standards sooner than you want, and then you're stuck with them. > > -hilmar > > [1] http://dublincore.org/documents/dcmi-terms/#terms-isPartOf > > Sent with a tap. > > On May 12, 2011, at 8:41 AM, Rutger Vos <R....@re...> wrote: > >>> On the other hand, I'd rather not do a schema change. But we have a >>> field called "URL", which we could pre-populate with >>> "http://dx.doi.org/10.5061/dryad.8661". This field is usually used >>> for publications that don't have a doi, and instead have a particular >>> url -- and it's rarely used. So one kluge (short of a schema change) >>> is to pre-populate the url field with >>> "http://dx.doi.org/10.5061/dryad.8661" and then expose that under the >>> "prism:url" element. What would be a good element to use other than "prism:url" or "prism:doi"? >> >> Citation.url is a string so we can put whatever we like there in order >> to store it. As for the element to store it under, maybe: >> >> - dcterms:source, defined as "A related resource from which the >> described resource is derived. The described resource may be derived >> from the related resource in whole or in part. Recommended best >> practice is to identify the related resource by means of a string >> conforming to a formal identification system." (or a subclass thereof, >> e.g. tb.source.study.dryad) >> >> - dcterms:isReferencedBy, defined as "A related resource that >> references, cites, or otherwise points to the described resource.", or >> dcterms:references, depending on how you would qualify the >> relationship between the two. Or prism:isReferencedBy >> >> - dcterms:identifier, defined as "An unambiguous reference to the >> resource within a given context." >> (or a subclass thereof, e.g. tb.identifier.study.dryad) >> >> - prism:isVersionOf, defined as "The described resource is a version, >> edition, or adaptation of the referenced resource." >> >> To me, dcterms:source sounds like a sensible candidate. >> |
From: Hilmar L. <hl...@ne...> - 2011-05-12 21:34:38
|
The mutual cross-referencing at the level of data package and TB study makes sense. Please don't use the Dryad handles or handle URIs - the use of handles within Dryad is legacy and being phased out (except perhaps for individual files, which are not relevant here). If we use isPartOf for both references, then that suggests that the Dryad package and the TB study are semantically equivalent objects, which I think they are cleary not. I would think that Dryas would reference the TB study through a dcterms:hasPart property if TB references the Dryad package through dcterms:isPartOf. Or again the simpler dc.relation. -hilmar Sent with a tap. On May 12, 2011, at 10:04 AM, William Piel <wil...@ya...> wrote: > > Is there a consensus, then, that TreeBASE should express the corresponding Dryad DOIs, prefixed with http://dx.doi.org/, using dcterms:isPartOf via both OAI and PhyloWS ? > > Or, for that matter, express the handle that Dryad labels dc:identifier.uri (e.g. http://hdl.handle.net/10255/dryad.15417). > > Meanwhile Dryad should express the corresponding TreeBASE URI also using dcterms:isPartOf ? > > A slight wrinkle is to make sure we are mutually cross-referencing the proper "level" of object. > > Dryad issues "Data Package" identifiers, which I guess is synonymous with a TreeBASE URI to a "study". These two identifiers point to the same package: > > dc:identifier: doi: 10.5061/dryad.tf48r > dc:identifier.uri: http://hdl.handle.net/10255/dryad.15415 > > But in addition, Dryad issues identifiers for particular data sets. These two identifiers point to the same data set within the above package: > > dc:identifier: doi: 10.5061/dryad.tf48r/2 > dc:identifier.uri: http://hdl.handle.net/10255/dryad.15417 > > There is no one-to-one equivalency between these data set identifiers and TreeBASE's URIs, because TreeBASE's URIs point to matrix objects and tree objects, which are all extracted from the same or different data set(s). > > Hence I think that for both TreeBASE and Dryad, the cross-referencing of identifiers should be at the level of "package" and "study" respectively. > > bp > > > On May 12, 2011, at 9:32 AM, Greenberg, Jane wrote: > >> All -- I am not sure I've followed this conversation all the way, but Hilmar's interpretation of dcterms:isPartOf (or simple/unqualified DC w/dc:relation) makes good sense to me; and will help w/interoperabilty. >> >> W.r.t. PRISM, the Drayd 3.0 ap (application profile) has moved away from PRISM to BIBO, see: http://bibliontology.com/specification. There seems to be increased interest in and discussion of this scheme on various metadata lists. I am a tad wary about the overall upkeep (latest version, etc.).... but it is Sematic Web compatible, rdf, etc... >> >> Hope this helps. >> >> ~ jane >> >> -----Original Message----- >> From: dry...@go... [mailto:dry...@go...] On Behalf Of Hilmar Lapp >> Sent: Thursday, May 12, 2011 9:06 AM >> To: Rutger Vos >> Cc: William Piel; Tre...@li...; Vision, Todd J; DaveVieglais; Ryan Scherle; Dryad Developers >> Subject: [dryad-dev] Re: [Treebase-devel] Dryad API, advice on cyberinfastructure >> >> Or perhaps simply dc:relation, or dcterms:isPartOf [1], which is actually a refinement of the former? >> >> I agree though that the relationship ought to be general enough that a DC or dcterms property will probably best facilitate use, and I also wouldn't kludge this - these become de-factor standards sooner than you want, and then you're stuck with them. >> >> -hilmar >> >> [1] http://dublincore.org/documents/dcmi-terms/#terms-isPartOf >> >> Sent with a tap. >> >> On May 12, 2011, at 8:41 AM, Rutger Vos <R....@re...> wrote: >> >>>> On the other hand, I'd rather not do a schema change. But we have a >>>> field called "URL", which we could pre-populate with >>>> "http://dx.doi.org/10.5061/dryad.8661". This field is usually used >>>> for publications that don't have a doi, and instead have a particular >>>> url -- and it's rarely used. So one kluge (short of a schema change) >>>> is to pre-populate the url field with >>>> "http://dx.doi.org/10.5061/dryad.8661" and then expose that under the >>>> "prism:url" element. What would be a good element to use other than "prism:url" or "prism:doi"? >>> >>> Citation.url is a string so we can put whatever we like there in order >>> to store it. As for the element to store it under, maybe: >>> >>> - dcterms:source, defined as "A related resource from which the >>> described resource is derived. The described resource may be derived >>> from the related resource in whole or in part. Recommended best >>> practice is to identify the related resource by means of a string >>> conforming to a formal identification system." (or a subclass thereof, >>> e.g. tb.source.study.dryad) >>> >>> - dcterms:isReferencedBy, defined as "A related resource that >>> references, cites, or otherwise points to the described resource.", or >>> dcterms:references, depending on how you would qualify the >>> relationship between the two. Or prism:isReferencedBy >>> >>> - dcterms:identifier, defined as "An unambiguous reference to the >>> resource within a given context." >>> (or a subclass thereof, e.g. tb.identifier.study.dryad) >>> >>> - prism:isVersionOf, defined as "The described resource is a version, >>> edition, or adaptation of the referenced resource." >>> >>> To me, dcterms:source sounds like a sensible candidate. >>> > > ------------------------------------------------------------------------------ > Achieve unprecedented app performance and reliability > What every C/C++ and Fortran developer should know. > Learn how Intel has extended the reach of its next-generation tools > to help boost performance applications - inlcuding clusters. > http://p.sf.net/sfu/intel-dev2devmay > _______________________________________________ > Treebase-devel mailing list > Tre...@li... > https://lists.sourceforge.net/lists/listinfo/treebase-devel |
From: Carl B. <cbo...@gm...> - 2011-05-12 21:50:15
|
Hi Hilmar, William, List, Isn't Dryad already using dc:identifier to reference TreeBASE (i.e. this example <http://datadryad.org/handle/10255/dryad.4939?show=full>)? I'm sometimes puzzled in how to parse the returns when very different content is all included as dc:relation i.e. this example<http://www.datadryad.org/oai/request?verb=GetRecord&identifier=oai:datadryad.org:10255/dryad.1619&metadataPrefix=oai_dc>, dc:relation is sometimes a handle, sometimes a doi, a journal title, or a number I don't recognize. Is there anything more clever than a bunch of grep rules for distinguishing between these? Is there a way to get the URL of the actual data file (i.e. some csv file) through the dryad API? So far I can only return the website that contains the data file. Thanks much. Carl On Thu, May 12, 2011 at 2:34 PM, Hilmar Lapp <hl...@ne...> wrote: > The mutual cross-referencing at the level of data package and TB study makes > sense. Please don't use the Dryad handles or handle URIs - the use of > handles within Dryad is legacy and being phased out (except perhaps for > individual files, which are not relevant here). > If we use isPartOf for both references, then that suggests that the Dryad > package and the TB study are semantically equivalent objects, which I think > they are cleary not. I would think that Dryas would reference the TB study > through a dcterms:hasPart property if TB references the Dryad package > through dcterms:isPartOf. Or again the simpler dc.relation. > > -hilmar > Sent with a tap. > On May 12, 2011, at 10:04 AM, William Piel <wil...@ya...> wrote: > > > Is there a consensus, then, that TreeBASE should express the corresponding > Dryad DOIs, prefixed with http://dx.doi.org/, using dcterms:isPartOf via > both OAI and PhyloWS ? > Or, for that matter, express the handle that Dryad labels dc:identifier.uri > (e.g. http://hdl.handle.net/10255/dryad.15417). > Meanwhile Dryad should express the corresponding TreeBASE URI also > using dcterms:isPartOf ? > A slight wrinkle is to make sure we are mutually cross-referencing the > proper "level" of object. > Dryad issues "Data Package" identifiers, which I guess is synonymous with a > TreeBASE URI to a "study". These two identifiers point to the same package: > dc:identifier: doi: 10.5061/dryad.tf48r > dc:identifier.uri: http://hdl.handle.net/10255/dryad.15415 > But in addition, Dryad issues identifiers for particular data sets. These > two identifiers point to the same data set within the above package: > dc:identifier: doi: 10.5061/dryad.tf48r/2 > dc:identifier.uri: http://hdl.handle.net/10255/dryad.15417 > There is no one-to-one equivalency between these data set identifiers and > TreeBASE's URIs, because TreeBASE's URIs point to matrix objects and tree > objects, which are all extracted from the same or different data set(s). > Hence I think that for both TreeBASE and Dryad, the cross-referencing of > identifiers should be at the level of "package" and "study" respectively. > bp > > On May 12, 2011, at 9:32 AM, Greenberg, Jane wrote: > > All -- I am not sure I've followed this conversation all the way, but > Hilmar's interpretation of dcterms:isPartOf (or simple/unqualified DC > w/dc:relation) makes good sense to me; and will help w/interoperabilty. > > W.r.t. PRISM, the Drayd 3.0 ap (application profile) has moved away from > PRISM to BIBO, see: http://bibliontology.com/specification. There seems to > be increased interest in and discussion of this scheme on various metadata > lists. I am a tad wary about the overall upkeep (latest version, etc.).... > but it is Sematic Web compatible, rdf, etc... > > Hope this helps. > > ~ jane > > -----Original Message----- > From: dry...@go... [mailto:dry...@go...] On > Behalf Of Hilmar Lapp > Sent: Thursday, May 12, 2011 9:06 AM > To: Rutger Vos > Cc: William Piel; Tre...@li...; Vision, Todd J; > DaveVieglais; Ryan Scherle; Dryad Developers > Subject: [dryad-dev] Re: [Treebase-devel] Dryad API, advice on > cyberinfastructure > > Or perhaps simply dc:relation, or dcterms:isPartOf [1], which is actually a > refinement of the former? > > I agree though that the relationship ought to be general enough that a DC or > dcterms property will probably best facilitate use, and I also wouldn't > kludge this - these become de-factor standards sooner than you want, and > then you're stuck with them. > > -hilmar > > [1] http://dublincore.org/documents/dcmi-terms/#terms-isPartOf > > Sent with a tap. > > On May 12, 2011, at 8:41 AM, Rutger Vos <R....@re...> wrote: > > On the other hand, I'd rather not do a schema change. But we have a > > field called "URL", which we could pre-populate with > > "http://dx.doi.org/10.5061/dryad.8661". This field is usually used > > for publications that don't have a doi, and instead have a particular > > url -- and it's rarely used. So one kluge (short of a schema change) > > is to pre-populate the url field with > > "http://dx.doi.org/10.5061/dryad.8661" and then expose that under the > > "prism:url" element. What would be a good element to use other than > "prism:url" or "prism:doi"? > > Citation.url is a string so we can put whatever we like there in order > > to store it. As for the element to store it under, maybe: > > - dcterms:source, defined as "A related resource from which the > > described resource is derived. The described resource may be derived > > from the related resource in whole or in part. Recommended best > > practice is to identify the related resource by means of a string > > conforming to a formal identification system." (or a subclass thereof, > > e.g. tb.source.study.dryad) > > - dcterms:isReferencedBy, defined as "A related resource that > > references, cites, or otherwise points to the described resource.", or > > dcterms:references, depending on how you would qualify the > > relationship between the two. Or prism:isReferencedBy > > - dcterms:identifier, defined as "An unambiguous reference to the > > resource within a given context." > > (or a subclass thereof, e.g. tb.identifier.study.dryad) > > - prism:isVersionOf, defined as "The described resource is a version, > > edition, or adaptation of the referenced resource." > > To me, dcterms:source sounds like a sensible candidate. > > > ------------------------------------------------------------------------------ > Achieve unprecedented app performance and reliability > What every C/C++ and Fortran developer should know. > Learn how Intel has extended the reach of its next-generation tools > to help boost performance applications - inlcuding clusters. > http://p.sf.net/sfu/intel-dev2devmay > > _______________________________________________ > Treebase-devel mailing list > Tre...@li... > https://lists.sourceforge.net/lists/listinfo/treebase-devel > > ------------------------------------------------------------------------------ > Achieve unprecedented app performance and reliability > What every C/C++ and Fortran developer should know. > Learn how Intel has extended the reach of its next-generation tools > to help boost performance applications - inlcuding clusters. > http://p.sf.net/sfu/intel-dev2devmay > _______________________________________________ > Treebase-devel mailing list > Tre...@li... > https://lists.sourceforge.net/lists/listinfo/treebase-devel > > -- Carl Boettiger UC Davis http://www.carlboettiger.info/ |
From: Hilmar L. <hl...@ne...> - 2011-05-12 22:17:23
|
On May 12, 2011, at 5:50 PM, Carl Boettiger <cbo...@gm...> wrote: > Is there anything more clever than a bunch of grep rules for distinguishing between these? Well, you can pack the semantics into the property, or into the resources that they connect. Personally, I loath RDF models that have a baroque number of properties because typically you end up having to hardcode these into applications that try to do something meaningful with the RDF. I.e., there is no good way to infer or deduce much about the semantics of properties. Conversely, the Semantic Web and LOD already provide the conventions and standards For resources to describe themselves on the web, so while figuring out what it is that a property connects a subject to requires a HTTP GET on the resource URI, it is the recommended (and I think most extensible) way of doing this. (Though unfortunately in our case this doesn't quite hold - yet; however, Crossref is finally starting to implement LOD-compliant DOI resolution, and the question is when DataCite DOIs will play ball too.) > Is there a way to get the URL of the actual data file (i.e. some csv file) through the dryad API? So far I can only return the website that contains the data file. Yes, it's documented on the Data Access API page on the Dryad wiki that I referred to earlier. -hilmar Sent with a tap. |
From: Carl B. <cbo...@gm...> - 2011-05-12 23:30:07
|
On Thu, May 12, 2011 at 3:17 PM, Hilmar Lapp <hl...@ne...> wrote: > > On May 12, 2011, at 5:50 PM, Carl Boettiger <cbo...@gm...> wrote: > > > Is there anything more clever than a bunch of grep rules for > distinguishing between these? > > Well, you can pack the semantics into the property, or into the resources > that they connect. Personally, I loath RDF models that have a baroque number > of properties because typically you end up having to hardcode these into > applications that try to do something meaningful with the RDF. I.e., there > is no good way to infer or deduce much about the semantics of properties. > Conversely, the Semantic Web and LOD already provide the conventions and > standards For resources to describe themselves on the web, so while figuring > out what it is that a property connects a subject to requires a HTTP GET on > the resource URI, it is the recommended (and I think most extensible) way of > doing this. (Though unfortunately in our case this doesn't quite hold - yet; > however, Crossref is finally starting to implement LOD-compliant DOI > resolution, and the question is when DataCite DOIs will play ball too.) > I think most of that went over my head for the moment. > > > Is there a way to get the URL of the actual data file (i.e. some csv > file) through the dryad API? So far I can only return the website that > contains the data file. > > Yes, it's documented on the Data Access API page on the Dryad wiki that I > referred to earlier. > Thanks, that's excellent, sorry I didn't realize what it was before. That just worked fine for me given the dryad short from identifier to the data. I'm curious what the flow would look like from a Dryad identifier to a study, i.e. if that would go through METS or OAI-PMH to get the identifiers for each data file. i.e. If I followed the discussion earlier, treeBASE would be able to provide the short form dryad identifer to the study. It looks like I could enter that into METS as outlined in (2) on the wiki, extract the identifiers to the data, enter each those into METS, and proceed as on the wiki to get the files? I see I can check the MIMETYPE once I'm on the METS page for a datafile. I don't suppose I can query by MIMETYPE? Thanks for the help, just started learning my way around the api. -Carl > > -hilmar > > Sent with a tap. > -- Carl Boettiger UC Davis http://www.carlboettiger.info/ |
From: Ryan S. <rsc...@gm...> - 2011-05-13 10:01:13
|
On Fri, May 13, 2011 at 12:30 AM, Carl Boettiger <cbo...@gm...> wrote: > > |That just worked fine for me given the dryad short from identifier to the > data. I'm curious what the flow would look like from a Dryad identifier to > a study, i.e. if that would go through METS or OAI-PMH to get the > identifiers for each data file. i.e. If I followed the discussion earlier, > treeBASE would be able to provide the short form dryad identifer to the > study. It looks like I could enter that into METS as outlined in (2) on the > wiki, extract the identifiers to the data, enter each those into METS, and > proceed as on the wiki to get the files? Yes, you could do that. If you're interested in getting the files directly (without going through the Dryad package objects), you could use the DataONE API, which is still a bit rough, but focuses more on the data files. The documentation (also a bit rough) is here: http://code.google.com/p/dryad/wiki/D1RESTfulAPI > I see I can check the MIMETYPE once I'm on the METS page for a datafile. > I don't suppose I can query by MIMETYPE? > Not yet, but you can obtain the file extensions easily through the DataONE API: http://datadryad.org/mn/object |
From: Ryan S. <rsc...@gm...> - 2011-05-13 10:12:30
|
On Fri, May 13, 2011 at 11:01 AM, Ryan Scherle <rsc...@gm...> wrote: > > Not yet, but you can obtain the file extensions easily through the DataONE > API: > http://datadryad.org/mn/object > Oops, I forgot the DataONE API *does* support search by MIME type: http://datadryad.org/mn/object?objectFormat=text/plain -- Ryan |