From: Bryan T. <br...@sy...> - 2015-06-22 13:20:34
|
Ryan, Turn on the solutions trace in your log4j.properties. This will show you the solutions flowing out of each operator. You can look at this to see where the duplicate solutions are coming from. Please let me know what you find. Is it possible that you have a quads mode database and two copies of the data? This would then turn into a default graph query, but that should be the RDF merge of the named graphs and so it should also be distinct. The Explain of the query might also provide some insight. ## # Solutions trace (tab delimited file). Uncomment the next line to enable. #log4j.logger.com.bigdata.bop.engine.SolutionsLog=INFO,solutionsLog log4j.additivity.com.bigdata.bop.engine.SolutionsLog=false log4j.appender.solutionsLog=org.apache.log4j.ConsoleAppender #log4j.appender.solutionsLog=org.apache.log4j.FileAppender log4j.appender.solutionsLog.Threshold=ALL #log4j.appender.solutionsLog.File=solutions.csv #log4j.appender.solutionsLog.Append=true # I find that it is nicer to have this unbuffered since you can see what # is going on and to make sure that I have complete rule evaluation logs # on shutdown. #log4j.appender.solutionsLog.BufferedIO=false log4j.appender.solutionsLog.layout=org.apache.log4j.PatternLayout log4j.appender.solutionsLog.layout.ConversionPattern=SOLUTION:\t%m Bryan ---- Bryan Thompson Chief Scientist & Founder SYSTAP, LLC 4501 Tower Road Greensboro, NC 27410 br...@sy... http://blazegraph.com http://blog.bigdata.com <http://bigdata.com> http://mapgraph.io Blazegraph™ <http://www.blazegraph.com/> is our ultra high-performance graph database that supports both RDF/SPARQL and Tinkerpop/Blueprints APIs. MapGraph™ <http://www.systap.com/mapgraph> is our disruptive new technology to use GPUs to accelerate data-parallel graph analytics. CONFIDENTIALITY NOTICE: This email and its contents and attachments are for the sole use of the intended recipient(s) and are confidential or proprietary to SYSTAP. Any unauthorized review, use, disclosure, dissemination or copying of this email or its contents or attachments is prohibited. If you have received this communication in error, please notify the sender by reply email and permanently delete all copies of the email and its contents and attachments. On Mon, Jun 22, 2015 at 8:51 AM, ryan <ry...@os...> wrote: > Hi List, > > When I load the attached TTL into a blank blazegraph instance (1.5.1), I > get different results for these two queries (they differ only in the > "DISTINCT" keyword): > > > > SELECT DISTINCT ?People WHERE { > ?People a <http://semoss.org/ontologies/Person> > <http://semoss.org/ontologies/Person> . > } > > SELECT ?People WHERE { > ?People a <http://semoss.org/ontologies/Person> > <http://semoss.org/ontologies/Person> . > } > > > > The first query gives three results; the second gives six. All the Persons > in the query are unique, so I expect both queries to return identical > results. > > > > Puzzling addition: when I add a second clause to the DISTINCT query, I get > the expected results. > > > > SELECT DISTINCT ?People WHERE { > > ?People a <http://semoss.org/ontologies/Person> . > > ?People ?p ?o . > > } > > > > I can come up with no explanation for this problem. I'd welcome any > insights. > > > > Thanks, > > > > ry > > > > -- > > I would never want to leave this country > > Where roads are fast and knowledge easy > > --Kitchens of Distinction > > > > > ------------------------------------------------------------------------------ > Monitor 25 network devices or servers for free with OpManager! > OpManager is web-based network management software that monitors > network devices and physical & virtual servers, alerts via email & sms > for fault. Monitor 25 devices for free with no restriction. Download now > http://ad.doubleclick.net/ddm/clk/292181274;119417398;o > _______________________________________________ > Bigdata-developers mailing list > Big...@li... > https://lists.sourceforge.net/lists/listinfo/bigdata-developers > > |