Name | Modified | Size | Downloads / Week |
---|---|---|---|
parser.zip | 2014-07-02 | 4.4 kB | |
Readme.txt | 2014-07-02 | 567 Bytes | |
Totals: 2 Items | 5.0 kB | 0 |
This is a code for the sentence parsing that does its job properly and FAST. The main problem is that you really need a database of abbreviations so that phrases such as "Dr. Smith" are not calculated as 2 sentences, which means that the good parser must be language dependent. I am also providing a list of all English abbreviations with the code. USAGE: parser p = new parser(); p.parseDoc("in.txt", "out.txt"); in.txt is the String for the filename that you wish to parse out.txt is the String for the filename that you want to have as an output