Java API and tools for performing NLP and other AI tasks
...ISBN: 978-3-642-45113-3
The zip file contains Gannu jar, source, API documentation and necessary resources for performing research.
Gannu uses the following projects: Weka, JExcel API, Stanford POS Tagger and WordNet. Please cite them when using Gannu.
NLPTools-ES is a Spanish plugin for GATE (General Architecture for Text Engineering). It includes a tokenizer, sentence splitter, gazetteer, pos tagger.
CRFTagger: Conditional Random Fields Part-of-Speech (POS) Tagger for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences/s.