From: Gilles D. <gr...@sc...> - 2004-06-08 21:23:46
|
According to Dominique Arpin: > I will install htdig 3.2 beta6 and I will try this patch. Your french.aff file only defines altstringchar entries for tex, not for latin1, so you shouldn't need the patch I mentioned. As far as I can tell, the patch won't make any difference for your affix file. > here my config: >=20 > endings_affix_file: ${lang_dir}/french.aff > endings_dictionary: ${lang_dir}/french.0 > endings_root2word_db: ${common_dir}/root2wordfr.db > endings_word2root_db: ${common_dir}/word2rootfr.db >=20 > You can see a copy of the files on: >=20 > http://darwin.espacecourbe.com/~dominique/ The problem is in the french.0 file. You'll need to add an entry: acouph=E8ne/X in the appropriate spot, and then run "htfuzzy endings". Do the same for any other word that doesn't pluralize properly, if the word doesn't appear in french.0. > thanks >=20 >=20 > > According to Lachlan Andrew: > >> Greetings Dominique, > >> > >> I have tried to reproduce your problem (as I understood it), but > >> can't. Several possibilities come to mind: > >> 1. You are (as Gilles suggested) relying on the fuzzy rule "accent= s" > >> rather than explicitly entering the accent into the query. In > >> this case, you are out of luck. > >> 2. Your endings_dictionary file doesn't contain the words with > >> actual accents. > >> 3. Your endings_dictionary has the accents, but encoded as > >> multi-byte unicode sequences. Currently, ht://Dig doesn't > >> support unicode. > >> In either case 2 or case 3, the solution is to replace the entries i= n > >> your endings_dictionary file with the single-byte latin1 (not > >> unicode) accents. > >> > >> Do any of these cases apply? > > > > Your 3rd possibility brings to mind a 4th one I heard about a few yea= rs > > ago. Some ispell affix files make use of "altstringchar" to define a > > sequence of ASCII characters that can be used in the dictionary file = to > > represent an accented character. If Dominique's francais.0 dictionar= y > > uses these, that could be the problem. > > > > There was a patch posted to the mailing list back in June of 2000, wh= ich > > added a hack to the endings algorithm to support these, for latin1 on= ly. > > The patch was for 3.1.5, so I don't know how well it'll work for 3.1.= 6 or > > the 3.2 betas. For some reason, it never made it into the patch arch= ive, > > but it's available here: > > > > http://www.mail-archive.com/ht...@ht.../msg05248.html > > > > The only way to know for sure which of the 4 possibilities is the cor= rect > > one would be to look at the dictionary and affix file Dominique used = to > > generate the endings database. > > > > -- > > Gilles R. Detillieux E-mail: <gr...@sc...> > > Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.c= a/ > > Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) > > >=20 >=20 > --=20 > Dominique Arpin_______________________[ Espace > administrateur r=E9seau Courbe ] >=20 > http://www.espacecourbe.com/ > t=E9l=E9phone 514.933.9861 > t=E9l=E9copieur 514.933.9546 >=20 --=20 Gilles R. Detillieux E-mail: <gr...@sc...> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/ Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada) |