Thread: [Fulltextsearch-devel] Spanish StopList
Status: Beta
Brought to you by:
tjmather
Menu
▾
▴
fulltextsearch-devel
From: Josep R. B. <jr...@ca...> - 2002-12-19 23:29:12
|
Hi all, I've found that the default spanish stop list fails when creating the table giving a duplicate key error. It's due to the fact that mySql does not distinguish between words with tildes and words without it, eg. cómo and como (not sure if you will see the example words, it's in Latin-1 character set). So we should index stoplist allowing duplicates, which will be a pitty, or modify the table creation statement to create the table with the "binary" modifier; this last options means that we must be sure that all lookups on the stoplist table are done using always the same case (lower or upper) to avoid not recognizing stop words. What do you think that will be the best solution? I can do the required modifications and submit a patch if you want, when the best option is decided. TIA. Josep Ruano Bou IT Manager - CAPSiDE, IT consultants jr...@ca... Cell Phone +34 653 665 290 Phone +34 934 266 731 *************************************************************************** DISCLAIMER: Este mensaje contiene información propietaria de la cual parte o toda puede contener información confidencial o protegida legalmente. Esta exclusivamente destinado al usuario de destino. Si, por un error de envio o transmisión, ha recibido este mensaje y usted no es el destinatario del mismo, por favor, notifique de este hecho al remitente y borre el mensaje. Si no es el destinatario final de este mensaje no debe usar, informar, distribuir, imprimir, copiar o difundir este mensaje bajo ningún medio. --------- DISCLAIMER: This e-mail contains propietary information some or all of which may be legally privileged. It is for the intended recipient only. If an addressing or transmission error has misdirected this e-mail, please notify the author by replying to this e-mail and then delete the email. If you are not the intended recipient you must not use, disclose, distribute, copy, print or rely on this e-mail. *************************************************************************** |