From: Gary W. <je...@cs...> - 2011-04-05 15:35:30
|
I have the new 1.6.0 and want to use only CDX and no BDB for my indexing, since I have a lot of files. I put a small number of files in my wayback.basedir=/lfs/1/tmp/wayback and started Tomcat. I commented out the indexqueueupdater, to prevent BDB from indexing the files. I see the files in file-db/incoming and file-db/state/filesk. 1) How do I get them to appear where CDX can use them? You sent me a script: find /lfs/1/tmp/wayback/index-data/{incoming,merged} -type f -name "*.arc.gz" | xargs cat | /lfs/1/tmp/wayback/bin/url-client | sort -u -S 50% -T /lfs/1/tmp/wayback/sort-tmp > /lfs/1/tmp/wayback/cdx/Katrina.cdx but I don't see any files in those directories. (because it was for when I had already partially indexed with BDB, in my previous attempt?). 2) How do I update my CDX when I add files? Gary Wesley -- A witty saying proves nothing. -- Voltaire |