Hi my name is Gaby Im mexican and I need help, I read the manuals and tutorials for the sphinx but I need more help,
I read the messegaes write Omar, and I need a similar system, but my system is only for two words: ADELANTE ATRAS. I will train a continous acustic model for the Sphinx 4, but need more help for the acustic model, for example:
I dont know what is the sintax for the transcrip file, if only I want to recognize 2 words, and I dont have idea for the Sound files.This means that: What need talk and how many files sounds I need?.
I think that in the Soun file many persons need record many ocurrences of the two words, because the system need be speaker independent, for example:
ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS
How long need be each file sound?
I only need this system, and think that you make before a similar system, can you help me?
Hoe is the structure of the phone list for this type of system, because is not dependent of context, I think that the only two phones in the phone list must be ADELANTE ATRAS (plus SIL), Am I right?
Can you show me an example for this type of system, I think that the TDIGITs is more complex and I only need to recognize two words
I read the Tiny doc for the scripts_pl and I am not secure if need run all the script, I read in this forum and found that only need run 00 and 02, Is that rigth? and The Sphinx 4 work fine only with this type of acustic model?
Im using linux, and I recorrd 50 utterances (tree people record this), the structure of each utterance (transcription file) is:
Hi my name is Gaby Im mexican and I need help, I read the manuals and tutorials for the sphinx but I need more help,
I read the messegaes write Omar, and I need a similar system, but my system is only for two words: ADELANTE ATRAS. I will train a continous acustic model for the Sphinx 4, but need more help for the acustic model, for example:
I dont know what is the sintax for the transcrip file, if only I want to recognize 2 words, and I dont have idea for the Sound files.This means that: What need talk and how many files sounds I need?.
I think that in the Soun file many persons need record many ocurrences of the two words, because the system need be speaker independent, for example:
ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS
How long need be each file sound?
I only need this system, and think that you make before a similar system, can you help me?
Hoe is the structure of the phone list for this type of system, because is not dependent of context, I think that the only two phones in the phone list must be ADELANTE ATRAS (plus SIL), Am I right?
Can you show me an example for this type of system, I think that the TDIGITs is more complex and I only need to recognize two words
I read the Tiny doc for the scripts_pl and I am not secure if need run all the script, I read in this forum and found that only need run 00 and 02, Is that rigth? and The Sphinx 4 work fine only with this type of acustic model?
Im using linux, and I recorrd 50 utterances (tree people record this), the structure of each utterance (transcription file) is:
<s> ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS <s> (archivo01)
...
...
<s> ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS ADELANTE ATRAS <s> (archivo50)
The phone list is:
ADELANTE
ATRAR
SIL
The main dictionary is
ADELANTE ADELANTE
ATRAS ATRAS
The fill dictionario is
<s> SIL
</sil> SIL
SIL SIL
The control file is (The scripy 00 shows a error in the sintay of this file, I think need more things Is rigth?):
archivo01
archivo02
....
archivo50
Thanks a lot, Gaby