I am performing "Adapting the acoustic model" as explained in the tutorial.
I am using continuous model while executing bw command and map_adapt.exe.
The matrices created are of the following dimensions:
Means, Variances, Mixture weights, Gaussian densities: are all 5138 X 1 X 32
As I understand, 5138 is number of senones. What is this 32? The feature vector's dimension is 13 + 13 +13 = 39 right?
Also, the MLLR Matrix is 36 X 36. How?
Can somebody explain please? Thank you.
Balaji.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello,
I am performing "Adapting the acoustic model" as explained in the tutorial.
I am using continuous model while executing bw command and map_adapt.exe.
The matrices created are of the following dimensions:
Means, Variances, Mixture weights, Gaussian densities: are all 5138 X 1 X 32
As I understand, 5138 is number of senones. What is this 32? The feature vector's dimension is 13 + 13 +13 = 39 right?
Also, the MLLR Matrix is 36 X 36. How?
Can somebody explain please? Thank you.
Balaji.
32 is number of gaussians per senone. Each GMM model is a mixture of gaussians.
Yes
Before processing features are multiplied by LDA matrix feature_transform (39x36), so the dimension becomes 36.
Thank you.
Hello Nickolay,
In this thread, earlier you wrote:
Any justification for the number of gaussians being 32? Can it be more or less?
Similarly, in feature transorm - the number of features (feature vector's size) is 39. But, how is this 36 chosen.
Thank you.
Last edit: Balaji 2019-09-24