Option to store text version of the documents in the index (sort of "cache")
Desktop search application
Brought to you by:
qforce
I would like to store a text version of the documents in the index, just like the "Cache" of Google does.
For example I might have an index of a network drive indexed in my laptop, and I would like to preview the documents even if I am not in the network. This cache might be useful for indexes of DVD or USB drives as well.
This should be optional as the text version of the document might increase the size of the index. Anyway a text version of a Word or PDF document is much smaller than the original document and it can be compressed very well.
I don't know if Lucene provides this feature, but I have seen this feature in http://regain.sourceforge.net/ which is also based on Lucene.
Octavi
Well, I can see how this feature would be useful, but implementing it would definitely require a substantial amount of programming effort. Also, it doesn't look quite as important as some other features I have in mind, e.g. EPUB support. So, text caching is probably not going to get implemented in the near future.