-
Ok, thanks,
I resolved!
2009-11-10 16:54:08 UTC by Massimeddu
-
Why is this solution not good for you:
acapia.aff:
SFX A Y 1
SFX A o a·mi·ddu
acapia.dic:
1
acapio/A
Result:
en@anonymous:~/dtest/perl/nevszo$ /home/en/program/hunspell-1.2.8/src/tools/unmunch /home/en/tmp/download/spell/qu_EC/acapia.dic /home/en/tmp/download/spell/qu_EC/acapia.aff
parsing line: SFX A Y 1
parsing A entries 1
stable 0 num is 1 flag A
parsed in 0 prefixes and 1...
2009-11-10 13:52:35 UTC by tyuk
-
Hi all,
I'm trying to write a dictionary that can recognize words like:
acapia·mi·ddu
where "acapia" is a verb with a suffix and mi and ddu are two clitics.
The .dic file looks like this:
4
·/·
acapio/A
mi/B
ddu/C
The .aff file looks like this:
SET UTF-8
FLAG UTF-8
COMPOUNDMIN 1
WORDCHARS ·...
2009-11-09 16:25:04 UTC by Massimeddu
-
Hi all,
I'm trying to write a dictionary that can recognize words like:
acapia·mi·ddu
where "acapia" is a verb with a suffix and mi and ddu are two clitics.
The .dic file looks like this:
4
·/·
acapio/A
mi/B
ddu/C
The .aff file looks like this:
SET UTF-8
FLAG UTF-8
COMPOUNDMIN 1
WORDCHARS ·
COMPOUNDPERMITFLAG D...
2009-11-09 16:18:37 UTC by Massimeddu
-
Thanks for the reply.
Yes, the dic file contains the nominal forms of the words, but I thought, a test file would contain some agglutinated forms. (The more depth the better)
I tried to use unmunch to create some tests, however, I got this:
adorachikuna//+
adorachikuni//+
adorachikunki//+
adorachikun//+
adorachikunchik//+
adorachikunkichik//+
...
adorachiwashkakuna//+...
2009-11-08 22:22:54 UTC by tyuk
-
hi tyuk
Thanks a lot for your comments. I have to say that I've not gotten very far with this project yet, but in a few months I'm moving to Ecuador and that might just speed up the progress and my own comprehension of the language.
Well for the scripting part, I actually use a generateQUdicfile.sh bash script to generate the .dic file.
For the use of compound: It's because we found...
2009-11-08 19:46:15 UTC by arnotixe
-
To arnotixe:
I checked your aff/dic list. It is very interesting. I am not sure, weather your using compounding does not:
1. increase unnecessarily time
2. make the whole thing inflexibel.
It would be good, if you added some test to it: words that should be considered as bad ones, and words, that should be considered as correct ones. If I had that, I could make some test, that prove or...
2009-11-07 20:24:01 UTC by tyuk
-
I wonder, why you do not:
1. choose one dialect with a relatively fixed orthography (do not handle multiple orthographies first, at least not much of them).
2. use strictly only prefix and suffix feature. Do not use compund words and reps first.
3. 3. Use scripting (perl, awk, java) to create you affix file. It is hopeless in this complexity manually.
This is the way, Hungarian spell...
2009-11-06 15:16:38 UTC by tyuk
-
Hi
I'm developing a Kichwa dictionary for the Ecuadorian shukllachiska Kichwa, a nationwide unified ortography. You can get it here:
http://extensions.services.openoffice.org/files/2121/3/qu_EC.oxt
It's actually a zip file, so you can open it with file-roller (ubuntu) or perhaps http://www.7-zip.org/ if you use windows.
There you can see some of my reasoning. Perhaps there are better...
2009-11-06 10:43:24 UTC by arnotixe
-
Thanks László- That is where I got the hyphen\_en\_US.dic. Are you saying that this library must be built for hunspell? Is this called and used independently of hunspell? Documentation or usage is a little thin, especially for a beginner. Any direction you can give is appreciated. Thanks so much.
2009-10-29 21:27:09 UTC by ptrace1