Reindexing a directory of PDFs with tens of thousands of IBM mainframe manuals, along with several Powerpoints and a zip file full of HTML and Eclipse.
I let this run overnight, and in the morning I found this MergeException:
program.name=DocFetcher program.version=1.1.11 program.build=20140303-1457 program.portable=false java.runtime.name=Java(TM) SE Runtime Environment java.runtime.version=1.6.0_17-b04 java.version=1.6.0_17 sun.arch.data.model=64 os.arch=amd64 os.name=Windows 7 os.version=6.1 user.language=en org.apache.lucene.index.MergePolicy$MergeException: java.io.IOException: Cannot overwrite: C:\Users\mmullins\AppData\Roaming\DocFetcher\PDF_1382115096437\_33.fdt at org.apache.lucene.index.ConcurrentMergeScheduler.handleMergeException(ConcurrentMergeScheduler.java:517) at org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:482) Caused by: java.io.IOException: Cannot overwrite: C:\Users\mmullins\AppData\Roaming\DocFetcher\PDF_1382115096437\_33.fdt at org.apache.lucene.store.FSDirectory.ensureCanWrite(FSDirectory.java:316) at org.apache.lucene.store.FSDirectory.createOutput(FSDirectory.java:305) at org.apache.lucene.index.FieldsWriter.<init>(FieldsWriter.java:83) at org.apache.lucene.index.SegmentMerger.mergeFields(SegmentMerger.java:234) at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:108) at org.apache.lucene.index.IndexWriter.mergeMiddle(IndexWriter.java:4295) at org.apache.lucene.index.IndexWriter.merge(IndexWriter.java:3940) at org.apache.lucene.index.ConcurrentMergeScheduler.doMerge(ConcurrentMergeScheduler.java:388) at org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:456)
Unfortunately, my Java knowledge isn't that good to figure out what "Cannot overwrite" means.
Anonymous
Hi,
This could be a problem with file permissions, i.e. DocFetcher not being able to write into the indexes directory. As a workaround, you could try the portable version of DocFetcher, which will write its indexes into the DocFetcher folder itself. Just make sure you unpack the DocFetcher archive into a writable folder (i.e. not into a system folder like C: or C:\Program Files).
Best regards
q:-) <= Quang
I re-ran the entire rebuild (8 hours!) and there were no issues.
I suspect that Piriform's CCleaner may have zapped a file during a scheduled cleaning.
We can close this ticket.
Fixed in DocFetcher 1.1.13.