Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.
Features
- Stems
- Analyzes Morphology
- Accepts & Transliterates between UTF-8, Windows-1256, ISIRI-3342, HTML-style Numeric Character References, ArabTeX romanization, and Dehdari transliteration
- Displays Part-of-Speech Tags for Many Words
- Tokenizes
- Handles Irregular Verbs, Semi-Regular Verbs, and Many Broken Plurals
- Very Fast
- Small Single File, Requiring no External Data
License
GNU General Public License version 3.0 (GPLv3)Follow Perstem
Other Useful Business Software
Keep company data safe with Chrome Enterprise
Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.
Rate This Project
Login To Rate This Project
User Reviews
-
Nice, thank you