Hi,
I have little success utilizing the tool lm3g2dmp on french words which have characters such as , , etc. Even if the dump file can be created, the recognizer (sphinx 4) fail to read it. I suspect the tool lm3g2dmp doesn't create the file correctly. Anyone has any success using this tool with french words? Thank you.
Bonjour,
J'ai quelques difficults en utilisant l'outil lm3g2dmp avec les mots franais qui contient un des caractres spciales comme , , , ... Mme si je construis le fichier 'dump', le recognizeur (sphinx 4) ne peut pas reconnatre ces mots. Est-ce qu'il y as quelq'un qui a une solution? Merci beaucoup.
TP
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I ended up converting the non-ASCII characters to smth ASCII, e.g. numbers. Or you can use you some other encoding mechanism that gets rid of the non-ASCII characters. Of course, you have to convert them back to the french characters after decoding.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
I have little success utilizing the tool lm3g2dmp on french words which have characters such as , , etc. Even if the dump file can be created, the recognizer (sphinx 4) fail to read it. I suspect the tool lm3g2dmp doesn't create the file correctly. Anyone has any success using this tool with french words? Thank you.
Bonjour,
J'ai quelques difficults en utilisant l'outil lm3g2dmp avec les mots franais qui contient un des caractres spciales comme , , , ... Mme si je construis le fichier 'dump', le recognizeur (sphinx 4) ne peut pas reconnatre ces mots. Est-ce qu'il y as quelq'un qui a une solution? Merci beaucoup.
TP
See https://sourceforge.net/tracker/index.php?func=detail&aid=998796&group_id=1904&atid=351904
I ended up converting the non-ASCII characters to smth ASCII, e.g. numbers. Or you can use you some other encoding mechanism that gets rid of the non-ASCII characters. Of course, you have to convert them back to the french characters after decoding.