I've added a patch to maybe help complete what may have been started.
The patch makes both -lang and -encoding options a requirment. Usually
enclosing any option in [] means the entire option is completely optional,
this really isn't the case anymore.
The next half of the patch checks both lang and encoding for non-null
values before continuing.
The last actually uses the lang value when calling the train function...
Simple enough.
Other usefull things may be to add a way to get the valid encoding names,
and supported lang values... ie: "en", "es", etc...
Nobody/Anonymous
None
None
Public
|
Date: 2009-08-22 04:45 If they are using their native encoding then I may agree with you. |
|
Date: 2009-08-21 10:27 Usually its a good idea to use the platform default encoding as default. |
|
Date: 2009-08-20 23:09 Thanks for taking. |
|
Date: 2009-08-20 13:53 Thanks, for the patch. Its applied now. |
|
Date: 2009-08-20 13:47 Thanks, for the patch. Its applied now. |
| Filename | Description | Download |
|---|---|---|
| sentdetect.patch | Patch for SentenceDetectorME.java on TRUNK | Download |
| Field | Old Value | Date | By |
|---|---|---|---|
| File Added | 339854: sentdetect.patch | 2009-08-20 03:44 | jameskosin |
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use