From: Christiaan F. <chr...@ad...> - 2008-10-06 12:25:41
|
Hi Stefan, It seems to me then that the FileAccessData is not trying to parse the file that you submitted. That file is fully well-formed and the chance that you've found a bug in Xerces is pretty low. Perhaps you can run your application in a debugger and verify that it is indeed parsing this file? Regards, Chris -- Stefan Dellmuth wrote: > Hi... > > so, nobody has any ideas? :( At the moment, the SAX parser spams random > errors, saying that there is no matching endtag for "scanresult" or that > there is a "<" in the id of a data object... which can't be the case and > in the case of the endtag, is plain wrong. Any suggestions? > > Stefan Dellmuth schrieb: >> Hi, Aperturians! :) >> >> I'm currently working on DynaQ, and I'm experiencing some problems with >> the underlying Aperture Framework. The problem has something to do with >> AccessData... ok, let me explain. >> >> I'm carwling a part one directory that contains 8,000 images. I had to >> interrupt the crawling process and when I resumed, I got this exception. >> >> 29.09.2008 14:41:26 SEVERE: IOException while accessing AccessData >> <<<< from CrawlerBase.crawl(..) [Thread 15] >> java.io.IOException: Premature end of file. >> at >> org.semanticdesktop.aperture.accessor.base.FileAccessData.read(FileAccessData.java:205) >> at >> org.semanticdesktop.aperture.accessor.base.FileAccessData.initialize(FileAccessData.java:169) >> at >> org.semanticdesktop.aperture.crawler.base.CrawlerBase.crawl(CrawlerBase.java:208) >> at >> de.dfki.catwiesel.synchronizer.importer.aperture.file.ApertureFileSystemImporter.startImport(ApertureFileSystemImporter.java:288) >> at >> de.dfki.catwiesel.synchronizer.importer.ImporterInputQueue.startImport(ImporterInputQueue.java:110) >> at >> de.dfki.catwiesel.CatwieselDocumentStore.importData(CatwieselDocumentStore.java:720) >> at org.dynaq.index.Indexer.performCatwieselIndexing(Indexer.java:1093) >> at org.dynaq.index.Indexer.createIndex(Indexer.java:495) >> at >> org.dynaq.index.CyclicIndexerRunnable.run(CyclicIndexerRunnable.java:82) >> at java.lang.Thread.run(Thread.java:619) >> Caused by: org.xml.sax.SAXParseException: Premature end of file. >> at >> org.apache.xerces.util.ErrorHandlerWrapper.createSAXParseException(Unknown >> Source) >> at org.apache.xerces.util.ErrorHandlerWrapper.fatalError(Unknown Source) >> at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source) >> at org.apache.xerces.impl.XMLErrorReporter.reportError(Unknown Source) >> at org.apache.xerces.impl.XMLScanner.reportFatalError(Unknown Source) >> at >> org.apache.xerces.impl.XMLDocumentScannerImpl$ContentDispatcher.endOfFileHook(Unknown >> Source) >> at >> org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown >> Source) >> at >> org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown >> Source) >> at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) >> at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) >> at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) >> at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) >> at javax.xml.parsers.SAXParser.parse(SAXParser.java:395) >> at javax.xml.parsers.SAXParser.parse(SAXParser.java:198) >> at >> org.semanticdesktop.aperture.util.SimpleSAXParser.parse(SimpleSAXParser.java:144) >> at >> org.semanticdesktop.aperture.accessor.base.FileAccessData.read(FileAccessData.java:197) >> ... 9 more >> >> The corresponding AccessData file is attached to this mail. The >> exception forces the whole crawling process to abort. >> >> When I start DynaQ for the next time, the crawler starts to crawl all >> the files again! The interesting thing is... the AccessData file is >> still there, referencing the files that were just crawled some minutes ago! >> >> Does anybody know what the problem might be? >> >> Sincerely yours, >> Stefan Dellmuth >> >> >> ------------------------------------------------------------------------ >> >> ------------------------------------------------------------------------- >> This SF.Net email is sponsored by the Moblin Your Move Developer's challenge >> Build the coolest Linux based applications with Moblin SDK & win great prizes >> Grand prize is a trip for two to an Open Source event anywhere in the world >> http://moblin-contest.org/redirect.php?banner_id=100&url=/ >> ------------------------------------------------------------------------ >> >> _______________________________________________ >> Aperture-devel mailing list >> Ape...@li... >> https://lists.sourceforge.net/lists/listinfo/aperture-devel > > > ------------------------------------------------------------------------- > This SF.Net email is sponsored by the Moblin Your Move Developer's challenge > Build the coolest Linux based applications with Moblin SDK & win great prizes > Grand prize is a trip for two to an Open Source event anywhere in the world > http://moblin-contest.org/redirect.php?banner_id=100&url=/ > _______________________________________________ > Aperture-devel mailing list > Ape...@li... > https://lists.sourceforge.net/lists/listinfo/aperture-devel |