I tried to train a new model for the German language. After I got errors which
I suppose were UTF8-Decoding problems (ä, ö, ü weren't recognized as such in
the dictionary) I minimized everything to one single transcription-sentence
and the corresponding words in the dictionary. What I got after running :
Okay, I found the answer. The search engine is your friend ;)
Can somebody give me hints on how to train an existing language model with new
audio data (acoustic model). In my instance I want to train an existing 16khz
model with 8khz audio data, but I don't see an advantage over creating a
totally new model despite I can use the existing language model.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
how to train an existing language model with new audio data (acoustic
model). In my instance I want to train an existing 16khz model with 8khz audio
data
It's easier to train 8khz model from scratch.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I am trainging the TIdigits data and I am getting the same error. I have
sufficient data for training if I compare it with the an4 corpus.
How did you solve this problem?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hey there,
I tried to train a new model for the German language. After I got errors which
I suppose were UTF8-Decoding problems (ä, ö, ü weren't recognized as such in
the dictionary) I minimized everything to one single transcription-sentence
and the corresponding words in the dictionary. What I got after running :
was the error on the top. I suppose it is because I gave not enough
observation material ?
Okay, I found the answer. The search engine is your friend ;)
Can somebody give me hints on how to train an existing language model with new
audio data (acoustic model). In my instance I want to train an existing 16khz
model with 8khz audio data, but I don't see an advantage over creating a
totally new model despite I can use the existing language model.
It's easier to train 8khz model from scratch.
Hi mikadomoses!
I am trainging the TIdigits data and I am getting the same error. I have
sufficient data for training if I compare it with the an4 corpus.
How did you solve this problem?
Sorry i forgot to add, how did you solve the first error that you have posted.