Stream-oriented Java library and a set of command line tools for high quality sentence boundary detection. (Sentence segmentation / splitting / disambiguation). Currently has one model for German (trained on general text and Wikipedia lynx dumps).
Features
- model for German (trained on general text and wikipedia lynx dumps)
- highly accurate
- handles a wide range of potential boundaries
- can cope with headlines, lists, tables
- preserves whitespace
- stream-oriented
License
GNU General Public License version 3.0 (GPLv3)Follow Sentrick
Other Useful Business Software
Gen AI apps are built with MongoDB Atlas
MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Sentrick!