i've a question about the silence length during the training...how long is the SIL phoneme? for example, if i'm training the sentence <s> TRE <sil> TRE <sil> TRE </s>, and the first <sil> lasts for 10ms while the second <sil> lasts for 30ms, is the training ok?
how can i specify different sil length during the training?
truly, I don't know if this question makes sense....
thanks to all!
p.s. I'm use Sphinx4 environment
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
i've a question about the silence length during the training...how long is the SIL phoneme? for example, if i'm training the sentence <s> TRE <sil> TRE <sil> TRE </s>, and the first <sil> lasts for 10ms while the second <sil> lasts for 30ms, is the training ok?
how can i specify different sil length during the training?
truly, I don't know if this question makes sense....
thanks to all!
p.s. I'm use Sphinx4 environment