From: ranganaths <ran...@ex...> - 2012-01-27 14:14:38
|
Hello, I work for the RnD dept of my company which is into e-learning. I am currently working on a semantic search engine for which I am searching for an robust open source. I came across infomap while googling for this information. I have few clarifications on this 1. While reading the tutorials I found one thing which requires infomap to be supplied with a single file for indexing the large corpora. In my case when I have lot of documents to index does this require all the documents' contents be placed in a single text document. 2. I require the semantic engine to index different types of files like pdf, txt, doc, html. 3. How is the robustness of the Algorithm used since the documentation says similar to LSA. Ranganath S Technology Specialist |