Stream-oriented Java library and a set of command line tools for high quality sentence boundary detection. (Sentence segmentation / splitting / disambiguation). Currently has one model for German (trained on general text and Wikipedia lynx dumps).
- model for German (trained on general text and wikipedia lynx dumps)
- highly accurate
- handles a wide range of potential boundaries
- can cope with headlines, lists, tables
- preserves whitespace
Networking is becoming cloudier, hybrid and more Internet-centric. IT managers now own user experience, whether they own the networks or not. Get our latest ebook to learn how network intelligence will help you adapt to a quickly changing Internet-centric environment.
Are you involved with your company's network performance/operations team?