
Balie v1.02 is here

Balie is designed to support textual information extraction tasks.

Given a text, it will (1) identify the language, (2) tokenize the text, (3) detect sentence boundaries and (4) guess part-of-speech for each token.


This is a minor release with 1 change:

1- The tokenlist can now be expressed as an XML string. It is usefull as an intermediate representation and can be useful for visualization using XSL.

Posted by David N. 2004-12-21

Log in to post a comment.