Harvest is a distributed search engine framework. It collects data using various methods like HTTP, FTP, News, local files etc., extracts relevant information, creates indexes and make them searchable using a Web interface. All of the collecting, extracti