| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| parser.zip | 2014-07-02 | 4.4 kB | |
| Readme.txt | 2014-07-02 | 567 Bytes | |
| Totals: 2 Items | 5.0 kB | 0 |
This is a code for the sentence parsing that does its job properly and FAST.
The main problem is that you really need a database of abbreviations so that phrases such as "Dr. Smith" are not calculated as 2 sentences, which means that the good parser must be language dependent.
I am also providing a list of all English abbreviations with the code.
USAGE:
parser p = new parser();
p.parseDoc("in.txt", "out.txt");
in.txt is the String for the filename that you wish to parse
out.txt is the String for the filename that you want to have as an output