Stream-oriented Java library and a set of command line tools for high quality sentence boundary detection. (Sentence segmentation / splitting / disambiguation). Currently has one model for German (trained on general text and Wikipedia lynx dumps).

Features

  • model for German (trained on general text and wikipedia lynx dumps)
  • highly accurate
  • handles a wide range of potential boundaries
  • can cope with headlines, lists, tables
  • preserves whitespace
  • stream-oriented

Project Samples

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow Sentrick

Sentrick Web Site

Other Useful Business Software
Try Google Cloud Risk-Free With $300 in Credit Icon
Try Google Cloud Risk-Free With $300 in Credit

No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Sentrick!

Additional Project Details

Programming Language

Java, Prolog

Registered

2010-01-31