Looking for the latest version? Download parser.zip (4.4 kB)
Home
Name Modified Size Downloads / Week Status
Totals: 2 Items   5.0 kB 7
parser.zip 2014-07-02 4.4 kB 55 weekly downloads
Readme.txt 2014-07-02 567 Bytes 22 weekly downloads
This is a code for the sentence parsing that does its job properly and FAST. The main problem is that you really need a database of abbreviations so that phrases such as "Dr. Smith" are not calculated as 2 sentences, which means that the good parser must be language dependent. I am also providing a list of all English abbreviations with the code. USAGE: parser p = new parser(); p.parseDoc("in.txt", "out.txt"); in.txt is the String for the filename that you wish to parse out.txt is the String for the filename that you want to have as an output
Source: Readme.txt, updated 2014-07-02