|
From: <fri...@us...> - 2009-05-25 22:45:26
|
Revision: 9768
http://zaf.svn.sourceforge.net/zaf/?rev=9768&view=rev
Author: friedelwolff
Date: 2009-05-25 22:44:59 +0000 (Mon, 25 May 2009)
Log Message:
-----------
A first batch of useful classified words
Added Paths:
-----------
trunk/dict/zu/wordlists/classified.adjective.1.in
trunk/dict/zu/wordlists/classified.languages.1.in
trunk/dict/zu/wordlists/classified.monoverb.1.in
trunk/dict/zu/wordlists/classified.names.1.in
trunk/dict/zu/wordlists/classified.names9.1.in
trunk/dict/zu/wordlists/classified.noun1.1.in
trunk/dict/zu/wordlists/classified.noun1.other.in
trunk/dict/zu/wordlists/classified.noun11.1.in
trunk/dict/zu/wordlists/classified.noun11.other.in
trunk/dict/zu/wordlists/classified.noun13.1.in
trunk/dict/zu/wordlists/classified.noun15.1.in
trunk/dict/zu/wordlists/classified.noun1a.1.in
trunk/dict/zu/wordlists/classified.noun3.1.in
trunk/dict/zu/wordlists/classified.noun3.other.in
trunk/dict/zu/wordlists/classified.noun4.1.in
trunk/dict/zu/wordlists/classified.noun4.other.in
trunk/dict/zu/wordlists/classified.noun5.1.in
trunk/dict/zu/wordlists/classified.noun7.1.in
trunk/dict/zu/wordlists/classified.noun7.other.in
trunk/dict/zu/wordlists/classified.noun8.other.in
trunk/dict/zu/wordlists/classified.noun9.1.in
trunk/dict/zu/wordlists/classified.noun9.2.in
trunk/dict/zu/wordlists/classified.passive.1.in
trunk/dict/zu/wordlists/classified.relative.1.in
trunk/dict/zu/wordlists/classified.verb.1.in
trunk/dict/zu/wordlists/classified.verb.other.in
Added: trunk/dict/zu/wordlists/classified.adjective.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.adjective.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.adjective.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,25 @@
+# adjectives
+# also known as the qualificative noun with alternating noun class prefix
+# flags: WY
+
+fishane
+
+# counting
+bili
+thathu
+ne
+hlanu
+
+#other
+bi
+hle
+de
+fushane
+ncane
+#ngaka
+#ngakanani
+khulu
+sha
+dala
+ningi
+ngakhi
Added: trunk/dict/zu/wordlists/classified.languages.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.languages.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.languages.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,26 @@
+# Class 7 nouns
+# flags:NP
+
+isiZulu
+isiXhosa
+isiSwazi
+isiNdebele
+isiSuthu
+#isiPedi
+isiTshwana
+isiTsonga
+isiVenda
+isiBhunu
+isiNgisi
+#isiLungu
+#isiCamto
+
+isiJalimani
+isiJapani
+isiFulentshi
+isiGriki
+isiHebheru
+isiPutukezi
+isiRashiya
+isiSwidi
+isiTaliyana
Added: trunk/dict/zu/wordlists/classified.monoverb.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.monoverb.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.monoverb.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,17 @@
+# Monosyllabic verbs
+#
+# TODO: need rules for monosyllabic verbs, then uncomment them.
+# TODO: Ideally we should just be generating this file from the other verbs.
+# They can be programmatically detected.
+#
+# flags:BCY
+
+#fa
+#wa
+ona
+akha
+enza
+#dla
+#ya
+osa
+#lwa
Added: trunk/dict/zu/wordlists/classified.names.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.names.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.names.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,82 @@
+# Person names
+# Classifying the priority is subjective. Let's limit names to really useful
+# and common ones as a start. Less popular names should be in files ending
+# in .2.in or .3.in or similar.
+#
+# This is necessarily all class 1a:
+# flags: NRS
+
+#well known people
+uShaka
+uZulu
+uGoodwill
+uZwelethini
+uMangosuthu
+uButhelezi
+uShenge
+uJacob
+uZuma
+uMsholozi
+uThabo
+uMbeki
+uNelson
+uMandela
+uMadiba
+uJesu
+
+#common names
+uBheki
+uBongani
+uBuhle
+uDumisani
+uJabu
+uMandla
+uNhlanhla
+uSipho
+uSibu
+uSibusiso
+uS'bu
+uSbu
+uVusi
+uXolani
+
+uBusisiwe
+uLindiwe
+uMbali
+uNelisiwe
+uNompi
+uNonhlanla
+uNokuthula
+uNonkululeko
+uNosipho
+uNozipho
+uNtombenhle
+uNtombifuthi
+uNtombizodwa
+uThandeka
+uThandiwe
+uZama
+
+uMkhize
+uGumede
+uHadebe
+uRadebe
+uDube
+uNdebele
+
+#Western names
+uSmith
+uJohn
+uWilliam
+uJames
+uRobert
+uDavid
+uRichard
+uCharles
+uJoseph
+uGeorge
+uBrian
+
+
+#Names of organisations
+uKhongolose
Added: trunk/dict/zu/wordlists/classified.names9.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.names9.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.names9.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,64 @@
+# Names handled as 9 nouns
+# Isolezwe is mostly using the hyphenated form, but only mostly. We should be
+# able to easily script a move between them.
+# flags:NOT
+
+# politics
+i-IEC
+i-ANC
+i-IFP
+i-KZN
+i-NPA
+i-COPE
+i-ANCWL
+i-ANCYL
+i-YCL
+i-DA
+i-SACP
+i-NPA
+i-UDM
+i-UDF
+i-ID
+i-PAC
+i-MK
+i-BEE
+i-EU
+i-SADC
+i-MDC
+
+# degrees
+i-BCom
+i-BSc
+i-BA
+
+# sport, entertainment
+i-PSL
+i-FIFA
+i-ICC
+i-TV
+i-SABC
+i-MNet
+i-BBC
+i-CNN
+i-YFM
+i-DVD
+i-CD
+i-SMS
+i-IBF
+i-SAMRO
+
+i-SRC
+i-UKZN
+i-DUT
+i-TUT
+i-UNISA
+i-SAPS
+i-SARS
+
+i-WHO
+i-SAB
+i-MXit
+i-GPS
+i-MTN
+i-BMW
+i-VW
Added: trunk/dict/zu/wordlists/classified.noun1.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun1.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun1.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,39 @@
+# Class 1 nouns
+# flags: NRS
+
+umuntu
+umfana
+umfazi
+umzali
+umntwana
+umfundi
+umfundisi
+umshayeli
+ummeli
+umngane
+#common, but wrong:
+#umngani
+umfowethu
+umfowabo
+umfowenu
+umyeni
+umzala
+umkhwenyana
+umkhwenyanawethu
+umkhwenyawabo
+umkhwenyawenu
+
+umdlali
+umsizi
+umphathi
+umholi
+umculi
+#not in dictionaries
+umqeqeshi
+
+#TODO: review capitalisation
+umLungu
+umSuthu
+umTswhana
+umXhosa
+umZulu
Added: trunk/dict/zu/wordlists/classified.noun1.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun1.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun1.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,6 @@
+# Class 1 nouns
+# flags: NRS
+
+umthandi
+umhleli
+abahleli
Added: trunk/dict/zu/wordlists/classified.noun11.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun11.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun11.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,21 @@
+# Class 11 nouns
+# flags: NP
+
+ubisi
+ucingo
+udaba
+udonga
+ufudu
+uhambo
+uhlelo
+uhlobo
+ulimi
+ulwandle
+unwele
+uphahla
+uphondo
+usizo
+usuku
+uthando
+uthi
+uxolo
Added: trunk/dict/zu/wordlists/classified.noun11.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun11.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun11.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,27 @@
+# Class 11 nouns
+# flags: NP
+
+uphiko
+usizi
+uhla
+uhlu
+uhlangothi
+unya
+uxhaxha
+uchungechunge
+utwayi
+uthuli
+udumo
+unyanyavu
+uphaphe
+uphawu
+ukhetho
+ulwazi
+ucwaningo
+uyaba
+ujenga
+uqwembe
+
+
+
+
Added: trunk/dict/zu/wordlists/classified.noun13.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun13.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun13.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,30 @@
+# Class 13 nouns
+# flags:NP
+
+ubuso
+ubuntu
+uthswala
+utshani
+ubuhlalu
+ubuncane
+ubukhulu
+ubuhle
+ububi
+ubumnandi
+ubungcono
+ubuciko
+ubungozi
+ubusika
+ubuhlungu
+ubusuku
+ubukholwa
+ubuKrestu
+ubudala
+ubunzima
+ubufakazi
+ubugebengu
+ubuholi
+ubuhlakani
+ubuthaputhaphu
+ubudlelwane
+ubuthongo
Added: trunk/dict/zu/wordlists/classified.noun15.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun15.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun15.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,22 @@
+# Class 15 nouns
+# In reality we will probably not add much here, since they should all be
+# generated from verbs. However, if we have any trouble to apply more than one
+# level of morphemes to infinitives (which already counts as one level), we
+# can consider adding common ones here.
+# flags:NP
+
+ukudla
+ukudlala
+ukusebenza
+ukuthola
+ukuthula
+ukufunda
+ukufundisa
+ukuphela
+ukuqala
+ukuhamba
+ukugcina
+
+#confirm:
+ukuthuthukisa
+ukulwa
Added: trunk/dict/zu/wordlists/classified.noun1a.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun1a.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun1a.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,32 @@
+# Class 1a nouns
+# flags:NRS
+
+
+ubaba
+umama
+umalume
+ugogo
+ubabamkhulu
+udadewethu
+udadewenu
+udadewabo
+umakoti
+umakhelwane
+udokotela
+unesi
+uthisha
+uKhisimusi
+umese
+ushizi
+ubhontshisi
+utamatisi
+unogwaja
+ubhejane
+# these are very commonly capitalised for titles, so let's put in both
+uhulumeni
+uHulumeni
+#uhulumende
+umengameli
+uMengameli
+ungqongqoshe
+uNgqongqoshe
Added: trunk/dict/zu/wordlists/classified.noun3.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun3.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun3.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,44 @@
+# Class 3 nouns
+# flags:NP
+
+umbodi
+umbane
+umkhumbi
+umsebenzi
+umshado
+ummbila
+umbhede
+umfula
+umzimba
+umlenze
+umlilo
+umlomo
+umhlane
+umgodi
+umnyaka
+umshini
+umkhuhlane
+umunwe
+umuthi
+umuzi
+umnyango
+umhlaba
+umoya
+umgwaqo
+umthetho
+umsindo
+umdlalo
+umphumela
+umsakazo
+umhlangano
+umgudu
+umbala
+umgomo
+umkhankaso
+umzamo
+umndeni
+umcimbi
+umzuzu
+umbiko
+umlayezo
+umbuzo
Added: trunk/dict/zu/wordlists/classified.noun3.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun3.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun3.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,15 @@
+# Class 3 nouns
+# flags:NP
+
+umusi
+umqhudelwano
+umgede
+umthunzi
+umqwayiba
+umhlabathi
+umphimbo
+uMfolozi
+umthanda
+umthandabuzo
+umthandazo
+umthando
Added: trunk/dict/zu/wordlists/classified.noun4.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun4.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun4.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,48 @@
+# Class 4 nouns
+# flags:NP
+
+
+imininingwane
+
+
+umbodi
+imibane
+imikhumbi
+imisebenzi
+imishado
+ummbila
+imibhede
+imifula
+imizimba
+imilenze
+imililo
+imilomo
+imihlane
+imigodi
+iminyaka
+imishini
+imikhuhlane
+iminwe
+imithi
+imizi
+iminyango
+imihlaba
+imimoya
+imigwaqo
+imithetho
+imisindo
+imidlalo
+imiphumela
+imisakazo
+imihlangano
+imigudu
+imibala
+imigomo
+imikhankaso
+imizamo
+imindeni
+imicimbi
+imizuzu
+imibiko
+imilayezo
+imibuzo
Added: trunk/dict/zu/wordlists/classified.noun4.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun4.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun4.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,7 @@
+# Class 4 nouns
+# flags:NP
+
+imithanda
+imithandabuzo
+imithandazo
+imithando
Added: trunk/dict/zu/wordlists/classified.noun5.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun5.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun5.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,32 @@
+# Class 5 nouns
+# flags:NP
+
+iso
+ilanga
+ibhuku
+ibhola
+ikati
+itiye
+ikhofi
+iphalishi
+ifasitele
+igama
+igazi
+idolobha
+izinyo
+izulu
+izwe
+izwi
+itshe
+iculo
+iphepha
+iphephandaba
+ilanga
+ikhanda
+ikhaya
+ithambo
+ikhala
+ihlombe
+isango
+iqanda
+itafula
Added: trunk/dict/zu/wordlists/classified.noun7.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun7.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun7.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,36 @@
+# Class 7 nouns
+# flags:NP
+
+isicoco
+#isizini
+
+
+isikole
+isinkwa
+isilwane
+isitofu
+isigqoko
+isithelo
+isikhathi
+isifo
+isibane
+isiboshwa
+isitini
+isifundo
+isihlalo
+isitulo
+isibhamu
+isicathulo
+isikhwama
+isikwelethi
+isikwelethu
+isibaya
+isihambi
+isisebenzi
+isiteshi
+isitimela
+isitolo
+isitsha
+isikhumba
+isangoma
+isithandwa
Added: trunk/dict/zu/wordlists/classified.noun7.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun7.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun7.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,8 @@
+# Class 7 nouns
+# flags:NP
+
+isithandamanzi
+isithandathu
+isithando
+isicoco
+#isizini
Added: trunk/dict/zu/wordlists/classified.noun8.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun8.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun8.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,3 @@
+# Class 8 nouns
+# flags: NP
+izicoco
Added: trunk/dict/zu/wordlists/classified.noun9.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun9.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun9.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,48 @@
+# Class 9 nouns
+# flags:NP
+
+# Remember that not all class 9 nouns have plurals in class 10:
+indoda
+intombazana
+intombazane
+
+# some start with im-
+imali
+imbali
+imbuzi
+imoto
+imvu
+imvula
+
+into
+indawo
+inkabi
+indlovu
+induna
+inyoka
+inyoni
+inyosi
+intaba
+indlu
+incwadi
+inja
+ingadi
+inhlanzi
+inhlanhla
+inhloko
+intamo
+indlebe
+ingalo
+inhliziyo
+intombi
+ingubo
+inyama
+ingane
+indlela
+inkomo
+insizwa
+inkosi
+intambo
+indwangu
+intuthu
+induku
Added: trunk/dict/zu/wordlists/classified.noun9.2.in
===================================================================
--- trunk/dict/zu/wordlists/classified.noun9.2.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.noun9.2.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,5 @@
+# Class 9 nouns
+# flags:NP
+
+intatheli
+imvume
Added: trunk/dict/zu/wordlists/classified.passive.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.passive.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.passive.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,85 @@
+# Passive verbs
+# We might want to generate passives from the active verbs, but haven't done so
+# yet, so we add a few common ones here. Handling palatalisation correctly
+# might be unrealistic, so we might do a lexical approach anyway. Negatives are
+# the main thing that should be handled differently from active verbs.
+# the 'Y' flag is to ensure they don't occur on their own
+# flags:BCY
+
+setshenziswa
+kishwa
+boshwa
+qashwa
+shintshwa
+
+# others
+balwa
+bhalwa
+bonwa
+bongwa
+buzwa
+cabangwa
+celwa
+culwa
+dingwa
+dlalwa
+dumwa
+fanwa
+funwa
+fundwa
+gezwa
+hanjwa
+hlalwa
+hlushwa
+khalwa
+khulwa
+khulunya
+lalwa
+lethwa
+nikwa
+ngenwa
+phekwa
+phendulwa
+philwa
+phunywa
+phumulwa
+qalwa
+qedwa
+salwa
+sebenzwa
+shaywa
+sizwa
+sukwa
+thandwa
+thenjwa
+thengwa
+tholwa
+valwa
+vulwa
+xoxwa
+zalwa
+zanywa
+bekwa
+bhekwa
+bingelelwa
+buswa
+dlulwa
+dubulwa
+gqokwa
+hlanganywa
+holwa
+khathalwa
+khathazwa
+kholwa
+khumbulwa
+linywa
+nqotshwa
+pheshwa
+phuzwa
+shadwa
+sheshwa
+shishwa
+sikwa
+thandazwa
+thayishwa
+tshelwa
Added: trunk/dict/zu/wordlists/classified.relative.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.relative.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.relative.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,38 @@
+# relatives
+# also known as qualificative noun with a fossilised noun class prefix
+# flags: VY
+
+manzi
+lukhuni
+buhlungu
+lula
+mnandi
+munyu
+luhlaza
+liphuzi
+mhlophe
+ngcwele
+qotho
+#ngaka
+#ngakanana
+#ngakanani
+ngaki
+#ngako
+banzi
+makhaza
+mtoti
+ngcono
+nzima
+bomvu
+nsundu
+mnyama
+
+bukhali
+muncu
+nqunu
+mpofu
+buthuntu
+duma
+mpisholo
+mpunga
+ngwevu
Added: trunk/dict/zu/wordlists/classified.verb.1.in
===================================================================
--- trunk/dict/zu/wordlists/classified.verb.1.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.verb.1.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,85 @@
+# Verbs.
+# This list should probably not include statives, auxilaries or any deficative
+# verbs. Currently monosyllabic verbs are in a separate file.
+# We should probably separate transitive, etc. for the sake of object morphemes.
+# flags:BC
+
+# others
+bala
+bamba
+bhala
+bona
+bonga
+buya
+buza
+cabanga
+cela
+cula
+dinga
+dlala
+duma
+faka
+fana
+funa
+funda
+geza
+hamba
+hlala
+hleka
+hlupa
+khala
+kheta
+khula
+khuluma
+lala
+letha
+nika
+ngena
+pheka
+phendula
+phila
+phuma
+phumula
+qala
+qeda
+sala
+sebenza
+shaya
+siza
+suka
+thanda
+themba
+thenga
+thola
+vala
+vula
+xoxa
+zala
+zama
+beka
+bheka
+bingelela
+busa
+dlula
+dubula
+gqoka
+hlangana
+hola
+khathala
+khathaza
+khola
+khumbula
+lima
+nqoba
+phepha
+phuza
+shada
+shesha
+shisha
+sika
+thandaza
+thayipha
+tshela
+thuma
+wina
+luma
Added: trunk/dict/zu/wordlists/classified.verb.other.in
===================================================================
--- trunk/dict/zu/wordlists/classified.verb.other.in (rev 0)
+++ trunk/dict/zu/wordlists/classified.verb.other.in 2009-05-25 22:44:59 UTC (rev 9768)
@@ -0,0 +1,20 @@
+# Verbs.
+# This list contains new verb stems or ones that should still be reviewed.
+# Check the other files for more informatoin.
+# flags:BC
+
+#intransitives that we probably should not handle the same
+thandabuka
+thandabula
+thandabuza
+
+
+############
+
+thandana
+thandazela
+thandela
+thandeka
+qapha
+vika
+vikela
This was sent by the SourceForge.net collaborative development platform, the world's largest Open Source development site.
|