From: <ant...@gm...> - 2007-11-17 23:48:07
|
jm pisze: > Hi, > > I am trying to upgrade to 1.0.1b from alpha2. I think I am mostly > done, but one thing is bothering me. I am only interested in text > extraction (i dont mind about numbers, dates etc) from all sort of > files. Before I just collected data from a number of fields: > > public static URI[] DATA_FIELDS = { DATA.title, DATA.creator, > DATA.subject, DATA.generator, DATA.keyword, > DATA.description, DATA.fullText }; > > But now I see DATA is gone, and there is a bunch of classes in > org\semanticdesktop\aperture\vocabulary\ that seem to do its work? But > I dont know clearly which ones should I get? Is there succint way to > collect text from all places wihout going through each of the classes > in that package? > > thank you in advance > javi > This is the most fundamental change done in this release. The entire data model has been redesigned. You can now see the results at http://www.semanticdesktop.org/ontologies/2007/01/19/nie/ or, better yet, the current version in the trunk. It will be published at it's proper place soon. This is the one on which Aperture 1.0.1-beta is based. <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/nie.html> <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/nfo.html> <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/nco.html> <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/nmo.html> <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/ncal.html> <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/nid3.html> <http://dev.nepomuk.semanticdesktop.org/repos/tags/nie-tags/revision8/htmldocs/nexif.html> They contain everything the old aperture ontologies contained (DATA, VCARD, ICAL, ICALTZD, DCES, DCTERMS, DCTYPE) and much more, all gathered together in a coherent whole. Those simple properties you mentioned are all present in the main part: NIE NIE.title NIE.subject NIE.generator NIE.description NIE.plainTextContent (was DATA.fulltext) the creator property has been moved to NCO, the Nepomuk Contact Ontology, it is present there (NCO.creator). Note that it doesn't contain a string with the name anymore. It points to a more complex structure - an instance of the RDF class nco:Contact. It usually consists of more triples, the most important being nco:fullname. Unfortunately you can't access the name of a creator with the simple RDFContainer API anymore. You need to dig it from the RDFContainer with the Model API. If you're unfamiliar with RDF and the above explanation seems confusing, let me know.. The question "How to find out what properties are contained in the RDFContainer produced by the Aperture component xxx" has been recurring on this list with some regularity now. We need to work on the documentation. Antoni Mylka ant...@gm... |