Perstem is a Persian (Farsi) stemmer, morphological analyzer, transliterator, and partial part-of-speech tagger. Inflexional morphemes are separated or removed from their stems. Perstem can also tokenize and transliterate between various character set encodings and romanizations.

Features

  • Stems
  • Analyzes Morphology
  • Accepts & Transliterates between UTF-8, Windows-1256, ISIRI-3342, HTML-style Numeric Character References, ArabTeX romanization, and Dehdari transliteration
  • Displays Part-of-Speech Tags for Many Words
  • Tokenizes
  • Handles Irregular Verbs, Semi-Regular Verbs, and Many Broken Plurals
  • Very Fast
  • Small Single File, Requiring no External Data

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow Perstem

Perstem Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 0 / 5

User Reviews

  • Nice, thank you
    1 user found this review helpful.
Read more reviews >

Additional Project Details

Operating Systems

BSD, Linux

Languages

English

Intended Audience

Advanced End Users, Information Technology, Science/Research

User Interface

Command-line, Web-based

Programming Language

Perl

Related Categories

Perl Search Engines, Perl Linguistics Software, Perl Languages Software

Registered

2006-08-23