Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.
Features
- Stems
- Analyzes Morphology
- Accepts & Transliterates between UTF-8, Windows-1256, ISIRI-3342, HTML-style Numeric Character References, ArabTeX romanization, and Dehdari transliteration
- Displays Part-of-Speech Tags for Many Words
- Tokenizes
- Handles Irregular Verbs, Semi-Regular Verbs, and Many Broken Plurals
- Very Fast
- Small Single File, Requiring no External Data
License
GNU General Public License version 3.0 (GPLv3)Follow Perstem
You Might Also Like
ConnectWise CPQ, formerly ConnectWise Sell, is a professional quote and proposal automation software for IT solution providers. ConnectWise CPQ offers a wide range of tools that enables IT solution providers to save time, quote more, and win big. Top features include professional quote or proposal templates, product catalog and sourcing, workflow automation, sales reporting, and integrations with best-in-breed solutions like Cisco, Dell, HP, and Salesforce.
Rate This Project
Login To Rate This Project
User Reviews
There are no 1 star reviews.