The Lemur project software also contains several utilities for gathering in-links to trec-web and HTML data, adding document-prior values to an index, and for getting documents, terms, and other various index statistics from an Indri repository.
See the pages below for details of each item:
* [dumpdoc, dumpterm, and dumpindex]
Wiki: Toolkit Usage Overview
Wiki: dumpdoc, dumpterm, and dumpindex