- assigned_to: nobody --> jeromesimeon
A user of the Galax Debian package reported that Galax can segfault when invoked on very large XML files (about 1.1 Gb is the reported site).
For your convenience, I report the full story from the Debian bug report. The most up to date information on the Debian side can always be found at http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=576553
The use of a very large XML file (1.1 GB in my case) can cause a segfault
in galax. The backtrace doesn't seem to be useful to me, but then again I don't
know OCaml. Here's how to reproduce the issue:
$ galax-run -context-item test.rdf noop.xq
noop.xq can be an empty file. Hence, I suspect that no XQuery-related code
is affected, but rather just the XML parser used for the context item.
The memory consumption goes up to about 160 MB before the segfault. In comparison,
it is about 5.2 GB for loading the document with xsltproc, xmllint or xmlstarlet
which thrashes my system a bit but otherwise works fine on the document.
In case it works with other large documents, the specific one was pulled from:
http://glue.cefriel.it/glue/Service-Finder-Data/