Sarf is an integrated software system (engine) that can generate arabic verbs, derivative nouns, and gerunds, and inflect them, starting from their triple and quadruple roots, depending on the grammar and morphology rules, and using the system database.
Be the first to post a text review of Sarf - Arabic Morphology System. Rate and review a project by clicking thumbs up or thumbs down in the right column.
1. System Components The system has two points of view: linguistic and software. 1.1. Linguistic Point of View The system has two main functions: derivation and conjugation. 1.1.1. Derivation Abstract verbs: specifying their conjugation categories, indications of their transitivity and intransitivity, and their types as relates to strength and weakness. Augmented verbs: specifying their conjugation measures, and indications of their transitivity and intransitivity. Derivative nouns. Vocal (non-standard) infinitives. Standard infinitives. 1.1.2. Conjugation Abstract and augmented verbs (triple and quadruple) by assigning them to the pronouns, in present, past and imperative tenses, in passive and active voice. Derivative nouns and infinitives: singular, dual, and plural (masculine and feminine) in nominative, accusative, and genitive cases. 1.2. Software Point of View The software system consists of two parts: Database and Program. 1.2.1. Database It contains the linguistic data items which the system needs. The data is stored in files of XML format. These XML files are divided according to the triple and quadruple roots, in such a using as to simplify their editing and modification. XML files can be edited or modified in any text editor, without having to use the Database Management System (DBMS), or to refer to the programmer because they are text files, designed in hierarchical structure which helps in specifying the target dictionary. Reading and editing these files are possible conformably with the open source principles. 1.2.2. The Program The required system will be designed and implemented using Java programming language. 2. System Features 2.1. Linguistic Features Covers all Arabic roots (over 7000 triple and quadruple roots). Embraces all the verbs (over 24,000 triple and quadruple verbs). Able to generate all derivative nouns (about 100,000), and standard infinitives (over 16,000), depending on the derivation rules. Encompasses all non-standard infinitives (over 14,000). Considers all syntactical and morphological parts related to verb conjugation and noun derivation, such as common conjugation rules. Able to observe the Hamzah (Arabic Character) rules in both nouns and verbs. Words complete adjusting according to vowelization. 2.2. Software Features Database is an independent of the program. The ability to modify the Database (either by adding, updating, or deleting). The possibility of extending the program to include new levels of Arabic Language Processing. User friendly by providing menus and options. The system can be used on any operating system like Windows, Linux, Solarix, or MacOS. 3. System Output It will be represented in a software library that can be used as a subsystem in any other software system. In addition, it will be provided with many executable versions that can work in the corresponding operating system, to interact with the end-user by the graphical user interface to do the following: The derivative nouns for the entered root. Linguistic aspect related to a specific verb. Verb conjugation tables, which assign the pronouns to verb. Noun declension tables, which include all morphological and syntactical cases of nouns. 4. System Possible Applications Arabic Language E-Dictionary. Computer-aided translation (CAT). Speech synthesizing. Optical Character Recognition (OCR). Automated Arabic text vocalization (diacritical marks). Text error recognition and correction. Arabic Automatic Reading. Arabic Automatic Writing. Arabic language learning and teaching for native speakers and others. 5. Organizations Interested In This System Research centers specialized in Arabic Natural Language Processing (ANLP). Education ministries in the Arab countries. Software companies that work in the development of ANLP. Arabic academies in the Arab World.
Production version 1.0
Release 1.0
Production version 1.0
Production version 1.0
Production version 1.0
Be the first person to add a text review.
Copyright © 2009 Geeknet, Inc. All rights reserved. Terms of Use
Thanks for your rating!
Would you also like to write a review?