NLTK 1.2 released

NLTK version 1.2 is now available on SourceForge:


NLTK, the Natural Language Toolkit, is a suite of Python libraries and programs for symbolic and statistical natural language processing. NLTK includes graphical demonstrations and sample data. It is accompanied by extensive documentation, including tutorials that explain the underlying concepts behind the language processing tasks supported by the toolkit.

NLTK is ideally suited to students who are learning NLP (natural language processing) or conducting research in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning. NLTK has been used successfully as a teaching tool, as an individual study tool, and as a platform for prototyping and building research systems.

NLTK version 1.1 adds:

* 4 new datasets that are useful for developing and testing NLP tools, along with tokenizers and parsers to provide a high-level interface to the datasets.

* Improvements to the graphical chart parser demo.

* Improvements to the sequential tagger.

* Several new third-party contributions, including a boosting classifier, a decision list, a decision tree, an implementation of Lesk's dictionary-based tagger, an interface to wordnet, and an interface to Babelfish.

For a complete list of improvements, see the change log:


Posted by Edward Loper 2003-11-05

Log in to post a comment.

Get latest updates about Open Source Projects, Conferences and News.

Sign up for the SourceForge newsletter:

JavaScript is required for this form.

No, thanks