|
From: Bryan T. <br...@sy...> - 2011-05-16 13:06:44
|
I am preparing release 0.84.0 from the trunk. The release notes are inline below. This release has several important bug fixes and includes a new feature for fast reverse provenance lookup. We will be doing another release very soon off the quads branch which will include the RWStore (10B+ triples on a single machine) and full native query evaluation. After that, our attention will turn to SPARQL 1.1 and analytic query support. I'll send out another message once this release has been posted. Thanks, Bryan This is a bigdata (R) release. This release is capable of loading 1B triples in under one hour on a 15 node cluster. JDK 1.6 is required. See [1] for instructions on installing bigdata(R), [2] for the javadoc and [3] and [4] for news, questions, and the latest developments. For more information about SYSTAP, LLC and bigdata, see [5]. Please note that we recommend checking out the code from SVN using the tag for this release. The code will build automatically under eclipse. You can also build the code using the ant script. The cluster installer requires the use of the ant script. You can checkout this release from the following URL: https://bigdata.svn.sourceforge.net/svnroot/bigdata/branches/BIGDATA_RELEASE_0_84_0 New features: - Inlining provenance metadata into the statement indices and fast reverse lookup of provenance metadata using statement identifiers (SIDs). Significant bug fixes: - The journal size could double in some cases following a restart due to a type in the WORMStrategy constructor. See https://sourceforge.net/apps/trac/bigdata/ticket/236 - Fixed a concurrency hole in the commit protocol for the Journal which could result in a concurrent modification to the B+Tree during the commit protocol. - Fixed a problem in the abort protocol for the BigdataSail: - Fixed a problem where the BigdataSail would permit the same thread to obtain more than one UNISOLATED connection: See https://sourceforge.net/apps/trac/bigdata/ticket/278 See https://sourceforge.net/apps/trac/bigdata/ticket/284 See https://sourceforge.net/apps/trac/bigdata/ticket/288 See https://sourceforge.net/apps/trac/bigdata/ticket/289 The road map [3] for the next releases include: - Single machine data storage to 10B+ triples; - 100% native SPARQL evaluation with lots of query optimizations; - High-volume analytic query workloads and SPARQL 1.1 query, including aggregations; - Simplified deployment, configuration, and administration for clusters. - High availability for the journal and the cluster; For more information, please see the following links: [1] https://sourceforge.net/apps/mediawiki/bigdata/index.php?title=Main_Page [2] https://sourceforge.net/apps/mediawiki/bigdata/index.php?title=GettingStarted [3] https://sourceforge.net/apps/mediawiki/bigdata/index.php?title=Roadmap [2] http://www.bigdata.com/bigdata/docs/api/ [3] http://sourceforge.net/projects/bigdata/ [4] http://www.bigdata.com/blog [5] http://www.systap.com/bigdata.htm About bigdata: Bigdata(r) is a horizontally-scaled, general purpose storage and computing fabric for ordered data (B+Trees), designed to operate on either a single server or a cluster of commodity hardware. Bigdata(r) uses dynamically partitioned key-range shards in order to remove any realistic scaling limits - in principle, bigdata(r) may be deployed on 10s, 100s, or even thousands of machines and new capacity may be added incrementally without requiring the full reload of all data. The bigdata(r) RDF database supports RDFS and OWL Lite reasoning, high-level query (SPARQL), and datum level provenance. |