This is a list of obsolete words, or words that are outdated or not in contemporary use, in the Buckwalter Morphological Analyser database. This list is developed according to a threshold of frequency on the web and the Arabic gigaword corpus. The list contain about 8,400 words that fell out of current use with a margin error of 1%. The threshold is defined like this. All the lemmas in Buckwalter queried in three news web sites (al-Jazeera, Arabic BBC and Arabic Wikipedia) and if the lemma is not found in any of the three search engines, it is considered as obsolete. Then all the lemmas are queried in the Arabic Gigaword corpus (fourth edition) and if a lemma has a frequency of 10 or less occurrences, then it is considered as obsolete.
Reference
Mohammed Attia, Pavel Pecina, Lamia Tounsi, Antonio Toral, Josef van Genabith. 2011. A Lexical Database for Modern Standard Arabic Interoperable with a Finite State Morphological Transducer.

Project Activity

See All Activity >

Follow Arabic Obsolete Words

Arabic Obsolete Words Web Site

You Might Also Like
Passwordless authentication enables a secure and frictionless experience for your users | Auth0 Icon
Over two-thirds of people reuse passwords across sites, resulting in an increasingly insecure e-commerce ecosystem. Learn how passwordless can not only mitigate these issues but make the authentication experience delightful. Implement Auth0 in any application in just five minutes
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Arabic Obsolete Words!

Additional Project Details

Registered

2012-05-31