Name | Modified | Size | Downloads / Week |
---|---|---|---|
Parent folder | |||
README | 2013-05-11 | 1.7 kB | |
COPYING | 2013-05-11 | 35.1 kB | |
AUTHORS | 2013-05-11 | 24 Bytes | |
Arabic_list_of_new_words.zip | 2013-05-11 | 8.1 MB | |
Totals: 4 Items | 8.1 MB | 1 |
#------------------------------------------------------------------------------- # This file is part of Arabic New Words # # Copyright (c) 2013 Mohammed Attia # # This program is free software: you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation, either version 3 of the License, or # (at your option) any later version. # # This program is distributed in the hope that it will be useful, # but WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the # GNU General Public License for more details. # # You should have received a copy of the GNU General Public License # along with this program. If not, see <http://www.gnu.org/licenses/> #------------------------------------------------------------------------------- This is a list of new words, or words that are not included in the Buckwalter Morphological Analyser version 2.0. It includes 476,349 new lemmatized words, and they are weighted and ordered so that there is a good likelihood that words which are most relevant (lexicographically) will surface to the top and the least relevant words will be pushed down the list. So, for example if you take the first 10,000 words, there is a good chance that you'll find a large number of word fit to include in a dictionary. Please consider that the word list is not filtered by a spell checker, so many words will only be misspellings. Proper names are not filtered out because high frequency proper names are usually included in morphological analysers to improve coverage, but in dictionaries people might want to exclude them.