SentenceDetectorME finalization [TRUNK]

Status: Beta

Brought to you by: gann, jasonbaldridge, joernkottmann, tsmorton

This project can now be found here.

#4 SentenceDetectorME finalization [TRUNK]

Status: open

Owner: nobody

Labels: None

Priority: 5

Updated: 2009-08-27

Created: 2009-08-27

Creator: James Kosin

Private: No

Well, I've done it again.
In short, this patch adds:
a) the cutoff and iteration additions to the command line.
b) outputs more information on the setup and the commands to verify they took correctly.
c) breaks the encoding option to be optional, with a default of the system default encoding if not specified.

Note, however, I still belive this really needs to be strongly suggested that it is provided by developers or people who cross international boundries with the models.
Until we get a good map of language to encoding for the input files, we can't be sure the parsers will properly interpret the training files.

Lastly, this patch closes a few todos in the file which were for support for the cutoff and iteration limits.

Discussion

James Kosin - 2009-08-27

patch for completed SentenceDetectorME class

sentdetect.2.patch

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

SentenceDetectorME finalization [TRUNK]

Group

Searches

Help

#4 SentenceDetectorME finalization [TRUNK]

Discussion