|
From: Bryan T. <br...@sy...> - 2010-02-22 23:38:36
|
I would like to solicit some input on the question of whether the primary index for the quad store should be SPOC (it is today) or CSPO. There has been some discussion on this issue in the past. I am raising the issue again in the light of discussions where an entire context corresponding to a relatively large collection of statements is to be dropped, e.g., wikipedia when mapped onto a single context, and when eventual consistency is being used for the secondary indices (that is, we handle conflict resolution on the primary statement index, e.g., SPOC, and then have a restart safe protocol guaranteeing eventual updates on the secondary statement indices). I have come around to the opinion that mapping that much data onto a single context is generally wrong. The information would be more readily managed by mapping it onto a set of contexts corresponding to individual wikipedia entries, each of which was then associated with the source using statements about that context. Thoughts? Bryan |