-
Why is this solution not good for you:
acapia.aff:
SFX A Y 1
SFX A o a·mi·ddu
acapia.dic:
1
acapio/A
Result:
en@anonymous:~/dtest/perl/nevszo$ /home/en/program/hunspell-1.2.8/src/tools/unmunch /home/en/tmp/download/spell/qu_EC/acapia.dic /home/en/tmp/download/spell/qu_EC/acapia.aff
parsing line: SFX A Y 1
parsing A entries 1
stable 0 num is 1 flag A
parsed in 0 prefixes and 1...
2009-11-10 13:52:35 UTC in Hunspell
-
Thanks for the reply.
Yes, the dic file contains the nominal forms of the words, but I thought, a test file would contain some agglutinated forms. (The more depth the better)
I tried to use unmunch to create some tests, however, I got this:
adorachikuna//+
adorachikuni//+
adorachikunki//+
adorachikun//+
adorachikunchik//+
adorachikunkichik//+
...
adorachiwashkakuna//+...
2009-11-08 22:22:54 UTC in Hunspell
-
To arnotixe:
I checked your aff/dic list. It is very interesting. I am not sure, weather your using compounding does not:
1. increase unnecessarily time
2. make the whole thing inflexibel.
It would be good, if you added some test to it: words that should be considered as bad ones, and words, that should be considered as correct ones. If I had that, I could make some test, that prove or...
2009-11-07 20:24:01 UTC in Hunspell
-
I wonder, why you do not:
1. choose one dialect with a relatively fixed orthography (do not handle multiple orthographies first, at least not much of them).
2. use strictly only prefix and suffix feature. Do not use compund words and reps first.
3. 3. Use scripting (perl, awk, java) to create you affix file. It is hopeless in this complexity manually.
This is the way, Hungarian spell...
2009-11-06 15:16:38 UTC in Hunspell
-
en@anonymous:~/program/humorph$ echo cica | ./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM ELA
< st:cica po:noun ts:ELA>
cicából
en@anonymous:~/program/humorph$ echo Afrika | ./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM ELA
Speicherzugriffsfehler.
2009-10-26 15:29:08 UTC in Hunspell
-
en@anonymous:~/program/humorph$ echo apa | ./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM "ts:NOM is:POSS_SG_1 is:NOM"
< st:apa po:noun ts:ts:NOM is:POSS_SG_1 is:NOM al:apjáék al:ap>
apám
en@anonymous:~/program/humorph$ echo báty | ./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM "ts:NOM is:POSS_SG_1 is:NOM"
báty.
2009-10-21 11:25:46 UTC in Hunspell
-
echo agyrém |./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM DAT
n=<agyrém> pln=1
i=0 pli:< pa:agy st:agy po:noun ts:DAT al:agyak pa:rém st:rém po:noun ts:NOM>
agyrém
en@anonymous:~/program/humorph$ echo rém |./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM DAT
rémnek.
2009-10-21 10:46:08 UTC in Hunspell
-
echo e_mail | ./chmorph hu_HU_gen.aff hu_HU_gen.dic /dev/stdin NOM ACC
The command causes looping for ever.
If fixed, please exact version number of working hunspell. Thanks.
2009-10-19 13:29:05 UTC in Hunspell
-
I use magyarispell 1.5 and hunspell 1.2.8 on debian linux. Here what I get:
$echo books | ./analyze en_US.aff en_US.dic /dev/stdin
> books
analyze(books) = st:book ts:0 is:Ns
analyze(books) = st:book ts:0 is:Vs
stem(books) = book
$echo boys | ./analyze en_US.aff en_US.dic /dev/stdin
> boys
analyze(boys) = st:boy ts:0 is:Ns
stem(boys) = boy
$ echo book boys |./analyze...
2009-10-15 21:36:30 UTC in Hunspell
-
Also crash with "Indokína".
2009-09-30 08:49:46 UTC in Hunspell