I have an .mfc file encoded by sphinx_fe. Now next step, I want to segment the audio with LIUM SpkDiarization, using this .mfc file as input.
Trying LIUM options -fInputDesc sphinx doesn't quite help, because I receive the following complaint:
after a few quantities have evauated to NaN. Full output here: http://pastebin.com/Tp1EK9EZ
I speculate that I have provided the wrong parameters. The parameter list is long (default seems to be audio2sphinx,1:1:0:0:0:0,13,0:0:0:0) and I have no clue what all these should be for a feature file from sphinx_fe. Does anybody know?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sorry for the confusion, indeed I have set --fInputDesc sphinx. (See link above for the exact command line I used.) But there are 11 other parameters to set there. I have no clue here. It says they corresponds to energy, delta, umber of features, ... Any idea which ones are important? Because if I use default for them I obtain the exception cited above.
Last edit: Emil Lundh 2016-04-26
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
hm, I think the right option should be not simply sphinx but something like
sphinx,1:1:0:0:0:0,13,0:0:0:0
Overall, I'm quite unhappy with LIUM, you'd better start writing code from scratch instead of looking on it. It is extremely hard to configure LIUM properly.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thanks Nickolay, I guess I'll just experiment along. After all, I'm not really up to writing my own speech segmentation tool! (Were there plans for writing a segmentation library within CMU Sphinx?)
I could also redesign my workflow to give LIUM a wav file, but that feels suboptimal for a number of (internal) reasons...
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have an .mfc file encoded by sphinx_fe. Now next step, I want to segment the audio with LIUM SpkDiarization, using this .mfc file as input.
Trying LIUM options -fInputDesc sphinx doesn't quite help, because I receive the following complaint:
after a few quantities have evauated to NaN. Full output here: http://pastebin.com/Tp1EK9EZ
I speculate that I have provided the wrong parameters. The parameter list is long (default seems to be audio2sphinx,1:1:0:0:0:0,13,0:0:0:0) and I have no clue what all these should be for a feature file from sphinx_fe. Does anybody know?
I believe audio2sphinx is for PCM input. As described here:
http://www-lium.univ-lemans.fr/diarization/doku.php/commun_parameter
it probably should be just "sphinx".
Sorry for the confusion, indeed I have set --fInputDesc sphinx. (See link above for the exact command line I used.) But there are 11 other parameters to set there. I have no clue here. It says they corresponds to energy, delta, umber of features, ... Any idea which ones are important? Because if I use default for them I obtain the exception cited above.
Last edit: Emil Lundh 2016-04-26
To clarify: Setting --fInputDesc sphinx does NOT help; not alone.
Probably one has to change some more settings. The question is which.
hm, I think the right option should be not simply sphinx but something like
Overall, I'm quite unhappy with LIUM, you'd better start writing code from scratch instead of looking on it. It is extremely hard to configure LIUM properly.
Thanks Nickolay, I guess I'll just experiment along. After all, I'm not really up to writing my own speech segmentation tool! (Were there plans for writing a segmentation library within CMU Sphinx?)
I could also redesign my workflow to give LIUM a wav file, but that feels suboptimal for a number of (internal) reasons...