bigdata-developers Mailing List for Blazegraph (powered by bigdata) (Page 45)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Seems interesting.  Give me a call if you want to talk about it.

From: Jeremy J Carroll <jj...@sy...<mailto:jj...@sy...>>
Date: Wednesday, November 13, 2013 8:48 PM
To: "Big...@li...<mailto:Big...@li...>" <Big...@li...<mailto:Big...@li...>>
Subject: Re: [Bigdata-developers] analysis of 770 and 773: cardinality of ?a p* ?b

Here is a proposal for the values returned by ALPP getEstimatedCardinality where lowerBound() == 0

Calculate the result from the single child as with the code before my commit 7442

If lowerBound() == 0 and:
- one end is bound then add 1 to the result
- two ends are bound then add 1 to the result if the two ends are equal otherwise 0
- add a large number to the result if both ends are unbound, where the large number should ideally be the number of non-literal nodes in the context
( maybe using

   StatementPatternNode sp = alpp.get(0).get(0);
 final IV<?, ?> c = getIV(sp.c(), exogenousBindings);
   long card = db.getAccessPath(null, null, null, c, null).rangeCount(false);

)

i.e. attempt to address the issues by improving the estimate of the cardinality in the relevant cases.

I will think about how to make appropriate test cases … feels like using the optimizer test case pattern from

com.bigdata.rdf.sparql.ast.optimizers.TestAll

If this looks acceptable I can have a shot tomorrow ...

Jeremy J Carroll
Principal Architect
Syapse, Inc.

On Nov 13, 2013, at 5:30 PM, Jeremy J Carroll <jj...@sy...<mailto:jj...@sy...>> wrote:

My commit 7442 introduced some problems while solving

https://sourceforge.net/apps/trac/bigdata/ticket/739

My commit concerned zero length property paths, where the query in trac 739 was misbehaving because a zlpp needs to be run last … the actual estimate could be the number of items in the current graph context, but I put Long.MAX_VALUE (in commit 7442, which should be visible here:
https://github.com/jeremycarroll/bigdata/commit/9f93a2b752bbfcee84f0e8c1047d9a17fcf6223f
)

This had an unintended side effect of marking such ALPPs as not reorder able, because

publicboolean isReorderable() {

finallong estCard = getEstimatedCardinality(null);

return estCard >= 0 && estCard < Long.MAX_VALUE;

}

On my machine I seem to be doing better on the examples from 770 773 and 739 using the (definitely hacky)

public long getEstimatedCardinality(StaticOptimizer opt) {

final JoinGroupNode group = subgroup();

/*
* if lowerBound() is zero, and both ?s and ?o are
* variables then we (notionally) match
* any subject or object in the triple store,
* see:
*
* http://www.w3.org/TR/2013/REC-sparql11-query-20130321/#defn_evalPP_ZeroOrOnePath
*
* Despite this not being implemented, the optimizer does better
* knowing this correctly.
*/
if (lowerBound() == 0 && left() instanceof VarNode && right() instanceof VarNode) {
return Long.MAX_VALUE/2;
}

….

Jeremy J Carroll
Principal Architect
Syapse, Inc.

2010	Jan	Feb (19)	Mar (8)	Apr (25)	May (16)	Jun (77)	Jul (131)	Aug (76)	Sep (30)	Oct (7)	Nov (3)	Dec
2011	Jan	Feb	Mar	Apr	May (2)	Jun (2)	Jul (16)	Aug (3)	Sep (1)	Oct	Nov (7)	Dec (7)
2012	Jan (10)	Feb (1)	Mar (8)	Apr (6)	May (1)	Jun (3)	Jul (1)	Aug	Sep (1)	Oct	Nov (8)	Dec (2)
2013	Jan (5)	Feb (12)	Mar (2)	Apr (1)	May (1)	Jun (1)	Jul (22)	Aug (50)	Sep (31)	Oct (64)	Nov (83)	Dec (28)
2014	Jan (31)	Feb (18)	Mar (27)	Apr (39)	May (45)	Jun (15)	Jul (6)	Aug (27)	Sep (6)	Oct (67)	Nov (70)	Dec (1)
2015	Jan (3)	Feb (18)	Mar (22)	Apr (121)	May (42)	Jun (17)	Jul (8)	Aug (11)	Sep (26)	Oct (15)	Nov (66)	Dec (38)
2016	Jan (14)	Feb (59)	Mar (28)	Apr (44)	May (21)	Jun (12)	Jul (9)	Aug (11)	Sep (4)	Oct (2)	Nov (1)	Dec
2017	Jan (20)	Feb (7)	Mar (4)	Apr (18)	May (7)	Jun (3)	Jul (13)	Aug (2)	Sep (4)	Oct (9)	Nov (2)	Dec (5)
2018	Jan	Feb	Mar	Apr (2)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2019	Jan	Feb	Mar (1)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec

bigdata-developers Mailing List for Blazegraph (powered by bigdata) (Page 45)

Fast, scalable, robust graph database platform

bigdata-developers — List for bigdata developers