Re: [dotNetRDF-Develop] Integrating SPIN into dotnetrdf

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Rob, Tom

After more reading I believe a dotnetrdf/SPIN/Fluent SPARQL would be an
excellent solution.  In fact I would think people would convert to C#
dotnetrdf due to this compiling story.

Strategy Questions:

Due to my knowledge, time constraints, and past experience I know if I
attempt SPIN dotnetrdf alone it will drag out and never complete.  Can you
let me know if any of the strategies below make any sense.  Both strategies
involve splitting the work with the understanding that we are all busy and
there is no real obligation.  Also I assume any knowledgeable new comer is
welcomed to contribute to the cause.  I believe the effort will obviously
benefit the community, but also provide a deep insight to the contributing
developers.

-Convert the Java TopBraid SPIN API <http://topbraid.org/spin/api/>  (Uses
Jena <http://jena.apache.org/>  interface) into C# starting with a
conversion tool like Tangible Java-
<http://www.tangiblesoftwaresolutions.com/Product_Details/Java_to_CSharp_Con
verter.html> >C# (I will purchase).  The tool will help with most of the
syntax, but there will be a lot of manual work.  The conversion will
obviously entail replacing the Jena <http://jena.apache.org/>  interface
with dotnetrdf.  The benefits of this strategy is we harness the
completeness, robustness, and future updates (re-port changes) of TopBraid
SPIN API.  The conversion should provide insights into the implementation
enabling custom tweaks.  It should also be possible to split the converted
Java files into 3 groups if you guys are up for the task.  I am in
preference of this solution primarily because we are essential creating a
SPIN inference engine based upon code from the people who invented SPIN.

-Follow your initial start at SPIN dotnetrdf and maybe use TopBraid SPIN API
as a reference (Difficult because approaches will diverge).  Rob are you a
take any small essential blocks and coordinate the overall effort?  Tom are
you able to help out, especially where Fluent SPARQL is utilized?  Again I
think the alternate strategy above has a lot of merit, but it must align
with Rob's vision.   No matter the strategy please look at the
implementation questions below to help clarify the high/low level picture.

Implementation Questions:

-Can you provide a high level overview of a SPIN dotnetrdf implementation so
I can ensure the final solution is acceptable.  Let's say you have the SPIN
rule for "adult rdfs:subclassof person" at the top-level (ie. owl:Thing) and
the user queries is Elvis a Adult.   Would dotnetrdf  SPIN spawn numerous
intermediate SPARQL queries to gather the important SPIN rules that must be
executed?  If RDF database is remote (ie. dbpedia) could this accumulative
delay become unacceptably (Over 3 seconds in my application).  Alternatively
is everything required by the user query obtained in 1 or 2 complex
intermediate queries which dotnetrdf SPIN creates and digests?  Basically if
you could paint the high level overview with an example I could then move to
the details.

-Can you provide a low-level overview of your SPIN dotnetrdf implementation
strategy.  I only partially understood you explanation below, which I know
will be a little clearer after looking at the code.  What is
spin-sparql-syntax.ttl and how does it fit into the puzzle?  What does it
mean to convert a query into SPIN RDF representation?  My naive picture is
that a user query is converted into internal query(s) to obtain SPIN RDF
rules, which are then executed by the new SPIN dotnetrdf engine.  Lastly
please explain a little more your view of turning a SPIN query into a query?

Thanks,

Kevin

From: Rob Vesse [mailto:rv...@do...] 
Sent: Friday, March 15, 2013 2:57 PM
To: Kevin
Cc: dotNetRDF Developer Discussion and Feature Request
Subject: Re: Integrating SPIN into dotnetrdf

Hey Kevin

Discussion inline:

From: Kevin <ke...@th...>
Date: Wednesday, March 13, 2013 7:40 PM
To: Rob Vesse <rv...@do...>
Subject: Integrating SPIN into dotnetrdf

Rob,

First thank you for your quality work you have done with the dotnetrdf
project.  I have seen a few different posts about your initiative to
integrate SPIN into dotnetrdf (ie.  SPIN Post
<http://answers.semanticweb.com/questions/537/experiences-with-spin> ).
After much reading on the subject it really seems that SPIN would really
propel/complement dotnetrdf.  I believe SPIN not only makes up for the
missing OWL inference (Via SPIN OWL-RL implementation), it also can expand
to suit the modelers imagination.  The fact that the rules are in SPARQL
makes for an unbeatable solution.  Should it matter my current effort
involves query a Virtuoso database (Some owl support) with dotnetrdf.  I
would really appreciate you taking a look at the questions below:

-Have you made any further progress on integrating SPIN into dotnetrdf?
Would you allow me to have the source code in its current state?  Could I
possibly be a contributor on this cause as I am not really equipped for the
full task?  In any case I would appreciate any source code which I could use
as a learning tool.

No I haven't had time to do anything on SPIN for a long time now.  I've been
primarily concentrating on getting core features stabilized such as the
SPARQL engine which are obviously fairly key to building stuff like SPIN on
top.

However I still don't have time to work on SPIN directly so if you want to
work on this please feel free, find the code in the mercurial repository at
https://bitbucket.org/dotnetrdf/dotnetrdf 

The previous and very minimal SPIN stub is under Libraries\Query\Spin,
create your own fork and then you can send pull requests as and when you
have something to 

The key things that need to be done to get the core of SPIN implemented are
as follows:

*	Update the current spin-sparql-syntax.ttl to a current version, it
likely doesn't represent the current version of the spec (this is primarily
a convenience reference for developers)
*	Finish the existing stubs for converting queries into their SPIN RDF
representation (see SpinSyntax.cs)
*	Write code to turn a RDF encoding of a SPIN query into a query

The middle one would be the easiest to start with since there is already
some partial stubs to get you started.

-From the available TopQuadrant documentation I have tried to deduce how
dotnetrdf might implement SPIN.  According to SPIN tutrial
<http://dallemang.typepad.com/my_weblog/2010/08/extending-owl-rl-.html> ,
TopBraid finds all SPIN inferecer rules and runs them when you hit play.
Would dotnetrdf SPIN inferencer only run the rules that are associated with
the class structure being queried?  Basically I am confused how dotnetrdf
decides when/how/which SPIN rules to run for a given query.

That's an implementation detail, we would control how and when rules get
run.  We need to get the basic implementation of SPIN done first before this
aspect of things gets implemented anyway.

-How much of SPIN could dotnetrdf possibly support.? It appears SPIN
contains Inference Rules, Constraint Checking, and ability to Isolate rules
for certain conditions.  Also the TopBraid tool seems to have  "User-Defined
SPARQL functions" and "SPIN Query Templates".  I imagine dotnetrdf would
have to keep up with any SPIN improvements.

All of those are supportable in some shape of form, until we have the core
of SPIN up and running we can't really implement those.  Most of those
features run on top of the SPIN core and so will ultimately just be
implementation details once we have a core to build upon.  User defined
SPARQL functions are basically just SPARQL queries that return a single
value and query templates are just parameterized queries both of which the
existing SPARQL engine is capable of supporting in one way or another.  So
it is just a case of exposing that functionality in the SPIN style.

Hope this is enough to get you started, if not please let us know,

Rob

Regards,

Kevin