I'm trying to train a acoustic model with force-aligned transcripts but, I'm
confused about the steps should I take to do it. In the sphinx_train.cfg there
are some properties that refers to this task but, the descriptions provided in
this file were not sufficient to make me understand what should i do. For
example, i can not realize the difference between the properties
falign_ci_mgau and ci_mgau. In short, could you suggest some tutorial,
documentation or information to take me off from the darkness?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
falign_ci_mgau enables multiple gaussian models on forced alignment stage 10
and 11. ci_mgau enables multiple gaussian models on stage 20. That is needed
if you want to tran CI multiple gaussian models for small vocabulary task.
To understand what is gaussian and what are multiple gaussians you can check
any book on HMM-based speech recognition.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thanks, I read about what you suggested me and now I am trying to train a
model with forced aligned transcripts but I've got a error on the
03.force_align step.
The codDef-art001a is the first transcription in my train.fileids file, and
that is not the first time I'm using it in a train, so to the others programs
that use it there is not any problem with it. Well, the fact is: I do not know
what is happening.
The aligner fails to build HMM sequence for your transcription.
Aligner might be not that robust to newlines, spaces and other characters. For
example to UTF-8 BOM symbols and so on. Maybe you want to check the
transcription file and the dictionary to find out if there are any issues like
that.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I'm trying to train a acoustic model with force-aligned transcripts but, I'm
confused about the steps should I take to do it. In the sphinx_train.cfg there
are some properties that refers to this task but, the descriptions provided in
this file were not sufficient to make me understand what should i do. For
example, i can not realize the difference between the properties
falign_ci_mgau and ci_mgau. In short, could you suggest some tutorial,
documentation or information to take me off from the darkness?
falign_ci_mgau enables multiple gaussian models on forced alignment stage 10
and 11. ci_mgau enables multiple gaussian models on stage 20. That is needed
if you want to tran CI multiple gaussian models for small vocabulary task.
To understand what is gaussian and what are multiple gaussians you can check
any book on HMM-based speech recognition.
Thanks, I read about what you suggested me and now I am trying to train a
model with forced aligned transcripts but I've got a error on the
03.force_align step.
This is the last INFO given in the log:
but, it shows a error before:
The codDef-art001a is the first transcription in my train.fileids file, and
that is not the first time I'm using it in a train, so to the others programs
that use it there is not any problem with it. Well, the fact is: I do not know
what is happening.
Here is entire log:
Thanks
The aligner fails to build HMM sequence for your transcription.
Aligner might be not that robust to newlines, spaces and other characters. For
example to UTF-8 BOM symbols and so on. Maybe you want to check the
transcription file and the dictionary to find out if there are any issues like
that.
Thanks for the replay, I will check my transcriptions.