<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Recent changes to Overview</title><link>https://sourceforge.net/p/lemur/wiki/Overview/</link><description>Recent changes to Overview</description><atom:link href="https://sourceforge.net/p/lemur/wiki/Overview/feed" rel="self"/><language>en</language><lastBuildDate>Thu, 04 Oct 2012 13:12:33 -0000</lastBuildDate><atom:link href="https://sourceforge.net/p/lemur/wiki/Overview/feed" rel="self" type="application/rss+xml"/><item><title>WikiPage Overview modified by David Fisher</title><link>https://sourceforge.net/p/lemur/wiki/Overview/</link><description>The Lemur Project, a collaboration between the Computer Science Department at the University of Massachusetts and the School of Computer Science at Carnegie Mellon University, provides a collection of open-source software and data collections designed to facilitate research in language modeling and information retrieval. Lemur supports a wide range of industrial and research language applications such as ad-hoc retrieval, site-search, and text mining.

The Indri search engine supports indexing of large-scale text databases, the construction of simple language models for documents, queries, or subcollections, and the implementation of retrieval systems based on language models as well as a variety of other retrieval models. The system is written in the C and C++ languages, and is designed as a research system to run under Unix operating systems, although it can also run under Windows.

The sections below provide more insight into the toolkit and some of the notable features:

  * [Indri]
  * [Indexer File Formats]
  * [Fields and Metadata]
  * [ClueWeb 09]
</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">David Fisher</dc:creator><pubDate>Thu, 04 Oct 2012 13:12:33 -0000</pubDate><guid>https://sourceforge.netf494fcb8a35a99e5a55a832070f138730ec6484a</guid></item></channel></rss>