bigdata-developers Mailing List for Blazegraph (powered by bigdata) (Page 24)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

I would wait on this.  There will not (should not) be any intermediate
commits so what we need to do is log the allocators (and the shadow
allocators used during group commit for unisolated index operations).

@Martyn: Can you suggest some logging that might capture what is happening
with the allocators during the load before Andreas retries this operation?

Thanks,
Bryan

----
Bryan Thompson
Chief Scientist & Founder
SYSTAP, LLC
4501 Tower Road
Greensboro, NC 27410
br...@sy...
http://blazegraph.com
http://blog.bigdata.com <http://bigdata.com>
http://mapgraph.io

Blazegraph™ <http://www.blazegraph.com/> is our ultra high-performance
graph database that supports both RDF/SPARQL and Tinkerpop/Blueprints
APIs.  MapGraph™ <http://www.systap.com/mapgraph> is our disruptive new
technology to use GPUs to accelerate data-parallel graph analytics.

CONFIDENTIALITY NOTICE:  This email and its contents and attachments are
for the sole use of the intended recipient(s) and are confidential or
proprietary to SYSTAP. Any unauthorized review, use, disclosure,
dissemination or copying of this email or its contents or attachments is
prohibited. If you have received this communication in error, please notify
the sender by reply email and permanently delete all copies of the email
and its contents and attachments.

On Wed, Apr 22, 2015 at 8:32 AM, Andreas Kahl <ka...@bs...> wrote:

> There were no other concurrent queries. Just the one SPARQL LOAD.
> I have deleted the file in the meantime (after a bit of cleaning I had
> ~60GB, so the disk was full at that size).
> If I can run DumpJournal without a commit, I can easily re-run the Load up
> to the java.io.IOException thrown by the full disk.
>
> Currently I have restarted the LOAD. I will wait until it breaks down
> (about 1h) and try to run DumpJournal on it.
>
> Andreas
>
> >>> Bryan Thompson <br...@sy...> 22.04.15 14.03 Uhr >>>
> Were you running any other operations concurrently against the database?
> Other updates or queries?
>
> In general, it is helpful to get the metadata about the allocators and root
> blocks.  However, from what you have written, it sounds like you terminated
> the process when the disk space filled up.  In this case there would only
> be the original root blocks and no commit points recorded on the journal.
>
> If you still have the file, can you run DumpJournal on it and send the
> output? The -pages option is not required in this case since we are only
> interested in the root blocks and allocators.
>
> Thanks,
> Bryan
>
> ----
> Bryan Thompson
> Chief Scientist & Founder
> SYSTAP, LLC
> 4501 Tower Road
> Greensboro, NC 27410
> br...@sy...
> http://blazegraph.com
> http://blog.bigdata.com <http://bigdata.com>
> http://mapgraph.io
>
> Blazegraph™ <http://www.blazegraph.com/> is our ultra high-performance
> graph database that supports both RDF/SPARQL and Tinkerpop/Blueprints
> APIs.  MapGraph™ <http://www.systap.com/mapgraph> is our disruptive new
> technology to use GPUs to accelerate data-parallel graph analytics.
>
> CONFIDENTIALITY NOTICE:  This email and its contents and attachments are
> for the sole use of the intended recipient(s) and are confidential or
> proprietary to SYSTAP. Any unauthorized review, use, disclosure,
> dissemination or copying of this email or its contents or attachments is
> prohibited. If you have received this communication in error, please notify
> the sender by reply email and permanently delete all copies of the email
> and its contents and attachments.
>
> On Wed, Apr 22, 2015 at 7:58 AM, Andreas Kahl <ka...@bs...>
> wrote:
>
> > That was a newly created journal. I simply stopped tomcat, deleted
> > bigdata.jnl and restarted.
> >
> > Andreas
> >
> > >>> Bryan Thompson <br...@sy...> 22.04.15 13.46 Uhr >>>
> > Was the data loaded into a new and empty journal or into a pre-existing
> > journal?  If the latter, what size was the journal and what data were in
> > it?
> >
> > Thanks,
> > Bryan
> >
> > ----
> > Bryan Thompson
> > Chief Scientist & Founder
> > SYSTAP, LLC
> > 4501 Tower Road
> > Greensboro, NC 27410
> > br...@sy...
> > http://blazegraph.com
> > http://blog.bigdata.com <http://bigdata.com>
> > http://mapgraph.io
> >
> > Blazegraph™ <http://www.blazegraph.com/> is our ultra high-performance
> > graph database that supports both RDF/SPARQL and Tinkerpop/Blueprints
> > APIs.  MapGraph™ <http://www.systap.com/mapgraph> is our disruptive new
> > technology to use GPUs to accelerate data-parallel graph analytics.
> >
> > CONFIDENTIALITY NOTICE:  This email and its contents and attachments are
> > for the sole use of the intended recipient(s) and are confidential or
> > proprietary to SYSTAP. Any unauthorized review, use, disclosure,
> > dissemination or copying of this email or its contents or attachments is
> > prohibited. If you have received this communication in error, please
> notify
> > the sender by reply email and permanently delete all copies of the email
> > and its contents and attachments.
> >
> > On Wed, Apr 22, 2015 at 6:54 AM, Andreas Kahl <ka...@bs...>
> > wrote:
> >
> > > Bryan,
> > >
> > > yes, I used this command:
> > > curl -d"update=LOAD <file:///srv/feed-dateien/DNBLOD/GND.rdf.gz>;"
> > > -d"namespace=gnd" -d"monitor=true"
> http://localhost:8080/bigdata/sparql
> > >
> > > Best Regards
> > > Andreas
> > >
> > > >>> Bryan Thompson <br...@sy...> 22.04.15 12.51 Uhr >>>
> > > Andreas,
> > >
> > > What command did you use to load the data set?  I.e., SPARQL update
> > "Load"
> > > or something else?
> > >
> > > Than Hello everyone,
> > > >
> > > > I currently updated to the current Revision (f4c63e5) of Blazegraph
> > from
> > > > Git and tried to load a dataset into the updated Webapp. With Bigdata
> > > 1.4.0
> > > > this resulted in a journal of ~18GB. Now the process was cancelled
> > > because
> > > > the disk was full - the journal was beyond 50GB for the same file
> with
> > > the
> > > > same settings.
> > > > The only exception was that I activated GroupCommit.
> > > >
> > > > The dataset can be downloaded here:
> > > >
> > >
> >
> http://datendienst.dnb.de/cgi-bin/mabit.pl?cmd=fetch&userID=opendata&pass=opendata&mabheft=GND.rdf.gz
> > > > .
> > > > Please find the settings used to load the file below.
> > > >
> > > > Do I have a misconfiguration, or is there a bug eating all disk
> memory?
> > > >
> > > > Best regards
> > > > Andreas
> > > >
> > > > Namespace-Properties:
> > > > curl -H "Accept: text/plain"
> > > > http://localhost:8080/bigdata/namespace/gnd/properties
> > > > #Wed Apr 22 11:35:31 CEST 2015
> > > >
> > com.bigdata.namespace.kb.spo.com.bigdata.btree.BTree.branchingFactor=700
> > > > com.bigdata.relation.container=gnd
> > > > com.bigdata.rwstore.RWStore.smallSlotType=1024
> > > > com.bigdata.journal.AbstractJournal.bufferMode=DiskRW
> > > > com.bigdata.journal.AbstractJournal.file=/var/lib/bigdata/bigdata.jnl
> > > >
> > > >
> > >
> >
> com.bigdata.rdf.store.AbstractTripleStore.vocabularyClass=de.bsb_muenchen.bigdata.vocab.B3KatVocabulary
> > > > com.bigdata.journal.AbstractJournal.initialExtent=209715200
> > > > com.bigdata.rdf.store.AbstractTripleStore.textIndex=true
> > > > com.bigdata.btree.BTree.branchingFactor=700
> > > >
> > > >
> > >
> >
> com.bigdata.rdf.store.AbstractTripleStore.axiomsClass=com.bigdata.rdf.axioms.NoAxioms
> > > > com.bigdata.rdf.sail.isolatableIndices=false
> > > > com.bigdata.service.AbstractTransactionService.minReleaseAge=1
> > > > com.bigdata.rdf.sail.bufferCapacity=2000
> > > > com.bigdata.rdf.sail.truthMaintenance=false
> > > > com.bigdata.rdf.sail.namespace=gnd
> > > > com.bigdata.relation.class=com.bigdata.rdf.store.LocalTripleStore
> > > > com.bigdata.rdf.store.AbstractTripleStore.quads=false
> > > > com.bigdata.journal.AbstractJournal.writeCacheBufferCount=500
> > > > com.bigdata.search.FullTextIndex.fieldsEnabled=false
> > > > com.bigdata.relation.namespace=gnd
> > > > com.bigdata.j.sail.BigdataSail.bufferCapacity=2000
> > > > com.bigdata.rdf.store.AbstractTripleStore.statementIdentifiers=false
> > > >
> > >
> > >
> > > --
> > > ----
> > > Bryan Thompson
> > > Chief Scientist & Founder
> > > SYSTAP, LLC
> > > 4501 Tower Road
> > > Greensboro, NC 27410
> > > br...@sy...
> > > http://blazegraph.com
> > > http://blog.bigdata.com <http://bigdata.com>
> > > http://mapgraph.io
> > >
> > > Blazegraph™ <http://www.blazegraph.com/> is our ultra high-performance
> > > graph database that supports both RDF/SPARQL and Tinkerpop/Blueprints
> > > APIs.  MapGraph™ <http://www.systap.com/mapgraph> is our disruptive
> new
> > > technology to use GPUs to accelerate data-parallel graph analytics.
> > >
> > > CONFIDENTIALITY NOTICE:  This email and its contents and attachments
> are
> > > for the sole use of the intended recipient(s) and are confidential or
> > > proprietary to SYSTAP. Any unauthorized review, use, disclosure,
> > > dissemination or copying of this email or its contents or attachments
> is
> > > prohibited. If you have received this communication in error, please
> > notify
> > > the sender by reply email and permanently delete all copies of the
> email
> > > and its contents and attachments.
> > >
> > >
> >
> >
>
>

2010	Jan	Feb (19)	Mar (8)	Apr (25)	May (16)	Jun (77)	Jul (131)	Aug (76)	Sep (30)	Oct (7)	Nov (3)	Dec
2011	Jan	Feb	Mar	Apr	May (2)	Jun (2)	Jul (16)	Aug (3)	Sep (1)	Oct	Nov (7)	Dec (7)
2012	Jan (10)	Feb (1)	Mar (8)	Apr (6)	May (1)	Jun (3)	Jul (1)	Aug	Sep (1)	Oct	Nov (8)	Dec (2)
2013	Jan (5)	Feb (12)	Mar (2)	Apr (1)	May (1)	Jun (1)	Jul (22)	Aug (50)	Sep (31)	Oct (64)	Nov (83)	Dec (28)
2014	Jan (31)	Feb (18)	Mar (27)	Apr (39)	May (45)	Jun (15)	Jul (6)	Aug (27)	Sep (6)	Oct (67)	Nov (70)	Dec (1)
2015	Jan (3)	Feb (18)	Mar (22)	Apr (121)	May (42)	Jun (17)	Jul (8)	Aug (11)	Sep (26)	Oct (15)	Nov (66)	Dec (38)
2016	Jan (14)	Feb (59)	Mar (28)	Apr (44)	May (21)	Jun (12)	Jul (9)	Aug (11)	Sep (4)	Oct (2)	Nov (1)	Dec
2017	Jan (20)	Feb (7)	Mar (4)	Apr (18)	May (7)	Jun (3)	Jul (13)	Aug (2)	Sep (4)	Oct (9)	Nov (2)	Dec (5)
2018	Jan	Feb	Mar	Apr (2)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2019	Jan	Feb	Mar (1)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec

bigdata-developers Mailing List for Blazegraph (powered by bigdata) (Page 24)

Fast, scalable, robust graph database platform

bigdata-developers — List for bigdata developers