From: Steven Bird <sb@cs...> - 2008-01-29 00:36:32
NLTK-Lite version 0.9.1 has been released -- http://nltk.org/index.php
NLTK -- the Natural Language Toolkit -- is a suite of open source
Python modules, data and documentation for research and development in
natural language processing. NLTK contains Code supporting dozens of
NLP tasks, along with 40 popular Corpora and extensive Documentation
including a 375-page online Book.
This version contains new support for accessing text categorization
corpora, along with several corpora categorized for topic, genre,
question type, or sentiment. (See section 7 of the corpus guide for
It includes several new corpora: Question classification
data (Li & Roth), Reuters 21578 Corpus, Movie Reviews
corpus (Pang & Lee), Recognising Textual Entailment (RTE) Challenges.
NLTK-Contrib includes expanded support for semantics (Dan Garrette),
readability scoring (Thomas Jakobsen, Thomas Skardal), and SIL Toolbox
(Greg Aumann). The book contains many improvements in early chapters
in response to reader feedback.
Note that a video of a talk about NLTK is available: