Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.

Features

  • Stems
  • Analyzes Morphology
  • Accepts & Transliterates between UTF-8, Windows-1256, ISIRI-3342, HTML-style Numeric Character References, ArabTeX romanization, and Dehdari transliteration
  • Displays Part-of-Speech Tags for Many Words
  • Tokenizes
  • Handles Irregular Verbs, Semi-Regular Verbs, and Many Broken Plurals
  • Very Fast
  • Small Single File, Requiring no External Data

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow Perstem

Perstem Web Site

Other Useful Business Software
$300 in Free Credit Towards Top Cloud Services Icon
$300 in Free Credit Towards Top Cloud Services

Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
Get Started
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • Nice, thank you
    1 user found this review helpful.
Read more reviews >

Additional Project Details

Operating Systems

BSD, Linux

Languages

English

Intended Audience

Advanced End Users, Information Technology, Science/Research

User Interface

Command-line, Web-based

Programming Language

Perl

Related Categories

Perl Search Engines, Perl Linguistics Software, Perl Languages Software

Registered

2006-08-23