I have some question about MFC file format in Sphinx.
I want to modified the feature extraction algorithm (MFCC) using modified MFCC in order to reduce the noise. Therefore i want to know about the format of mfc file that generated using MFCC feature extraction.
How to generate the .mfc file? What was the information that saved in .mfc file? why using binary format file?
Thank you for your help in advance.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank you for your response, but i have some question more.
Can u explain to me more detail about the header and the data?
Was the header save in the first byte then followed by the data?
The element of data saved in a matrix or in a sequence data?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have some question about MFC file format in Sphinx3.
sir I am currently working dysarthric speech to text alignment.
I want to modify the feature extraction algorithm (MFCC). Therefore i want to know about the format of mfc file that generated using MFCC feature extraction.
How to generate the .mfc file? What was the information that saved in .mfc file? why using binary format file?
Also, I want to know how to write the triphones for words in dictionary and whether I want to write a triphones for all the words in dictionary.
Is there any standard dictionary available with their phonetic description.
I need your help.
Thank you
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank you for your quick response.
From the command line I am try to execute sphinx_fe but it says try to install sphinx_fe for that I have installed Sphinx_fe and try to run. For that it will shows following error saying that no arguments should be given.
Sir, I want to implement feature extraction process in matlab instead of sphinx_fe for that it is necessary to know in which format the extracted features are before they get written into binary file.
If I extract features using matlab script how can I convert into .mfc binary file so that it can used with sphinx3.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sir I want to know how write a triphone for the words with an example please tell me sir. Is it compulsory to write triphone for all the words in dictionary.Is it possible to do phone level alignment if yes how can be done. Sir I have studied one paper in that they are using htk toolkit in that by changing language model weight they are getting alignment rates very high.Sir I want to know what is language model weight is it possible to change in cmu sphinx 3.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sir, I am really sorry if I am disturbing you. For phoneme level alignment how to start. What are the changes I need to do. Can I get any timing information from aligment results. Sir, I am m.tech student no one is here to assist. I don't know how to use htk toolkit I know some basics of hidden markov model theoretically our guide tell me directly to use cmu sphinx3. As I am new to this i need your help sir please help me. Thank you.
Last edit: Diwakar.G 2016-11-14
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Sir I am currently working on dysarthric speech to text alignment. Usually peoples who are suffered from dysarthric contain long pauses between words and even between words. So first initially I need locate their timing and remove those pauses and then it is applied to cmu sphinx3. For this i have written dictionary for those words they uttered. Now they have told first do phone level alignment. Here I have a confusion is it mandatory to write triphone for all words in dictionary. Should I need to modify the code for phone level alignment. Sir please help me.
Thank you.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have some question about MFC file format in Sphinx.
I want to modified the feature extraction algorithm (MFCC) using modified MFCC in order to reduce the noise. Therefore i want to know about the format of mfc file that generated using MFCC feature extraction.
How to generate the .mfc file? What was the information that saved in .mfc file? why using binary format file?
Thank you for your help in advance.
Thank you for your response, but i have some question more.
Can u explain to me more detail about the header and the data?
Was the header save in the first byte then followed by the data?
The element of data saved in a matrix or in a sequence data?
> Was the header save in the first byte then followed by the data?
Do you understand what you are asking? "header" is called "header" because it goes first.
It's just a length in frames followed by binary data.
/***Header**/
/ compute number of frames and write cepfile header /
numframes = fe_count_frames(FE,len,COUNT_PARTIAL);
if (P->logspec != ON)
outlen = numframesFE->NUM_CEPSTRA;
else
outlen = numframes*FE->MEL_FB->num_filters;
if (P->output_endian != P->machine_endian)
SWAPL(&outlen);
if (write(fp, &outlen, 4) != 4) {
E_ERROR("Data write error on %s\n",outfile);
close(fp);
return(FE_OUTPUT_FILE_WRITE_ERROR);
}
if (P->output_endian != P->machine_endian)
SWAPL(&outlen);
/**Data**/
int32 fe_writeblock_feat(param_t P, fe_t FE, int32 fp, int32 nframes, float32 feat)
{
}
I have some question about MFC file format in Sphinx3.
sir I am currently working dysarthric speech to text alignment.
I want to modify the feature extraction algorithm (MFCC). Therefore i want to know about the format of mfc file that generated using MFCC feature extraction.
How to generate the .mfc file? What was the information that saved in .mfc file? why using binary format file?
Also, I want to know how to write the triphones for words in dictionary and whether I want to write a triphones for all the words in dictionary.
Is there any standard dictionary available with their phonetic description.
I need your help.
Thank you
from command line with
sphinx_fe
binary, from API withsphinxbase/fe.h
headerFeature matrix
Binary representation is more efficient
You can write triphones by hand in editor or with a script.
Sure, google for cmudict.
Thank you for your quick response.
From the command line I am try to execute sphinx_fe but it says try to install sphinx_fe for that I have installed Sphinx_fe and try to run. For that it will shows following error saying that no arguments should be given.
Sir, please help from .sph file how to convert .mfc file what are arguments for that function in detail.
Thank you for your help in advance.
sphinx_fe -nist yes -i file.sph -o file.mfc
Sir, I want to implement feature extraction process in matlab instead of sphinx_fe for that it is necessary to know in which format the extracted features are before they get written into binary file.
If I extract features using matlab script how can I convert into .mfc binary file so that it can used with sphinx3.
You have to write a simple code for that, something like
MFCC file format is described here http://cmusphinx.sourceforge.net/wiki/mfcformat
Thank you sir
Thank yo u sir for your kind response.
Sir I want to know how write a triphone for the words with an example please tell me sir. Is it compulsory to write triphone for all the words in dictionary.Is it possible to do phone level alignment if yes how can be done. Sir I have studied one paper in that they are using htk toolkit in that by changing language model weight they are getting alignment rates very high.Sir I want to know what is language model weight is it possible to change in cmu sphinx 3.
cmusphinx is quite different from htk, you can't easily transfer htk work without deep understanding of all internals.
Sir,I am working lyrics to song alignment. I also have same doubt is it compulsory to write triphone for all words in dictionary.
No, unlike htk cmusphinx deals with triphones internally, they are not used as input.
Sir, I am really sorry if I am disturbing you. For phoneme level alignment how to start. What are the changes I need to do. Can I get any timing information from aligment results. Sir, I am m.tech student no one is here to assist. I don't know how to use htk toolkit I know some basics of hidden markov model theoretically our guide tell me directly to use cmu sphinx3. As I am new to this i need your help sir please help me. Thank you.
Last edit: Diwakar.G 2016-11-14
You can ask him for furhter details then.
Sir I am currently working on dysarthric speech to text alignment. Usually peoples who are suffered from dysarthric contain long pauses between words and even between words. So first initially I need locate their timing and remove those pauses and then it is applied to cmu sphinx3. For this i have written dictionary for those words they uttered. Now they have told first do phone level alignment. Here I have a confusion is it mandatory to write triphone for all words in dictionary. Should I need to modify the code for phone level alignment. Sir please help me.
Thank you.
Sir I don't understand what is ihe problem can you please tell me I stuck with this error. I am using my own dictionary,data for training the model.
I have checked all the phones are used once in dictionary
You are using outdated sphinxtrain.
You have bad empty lines in transcript and ctl files.
Sir, is there any newer version sphinxtrain available for cmu sphinx 3. If yes, how to install it.
I have attached the transcription file below sir please tell me how to remove bad empty lines.
Thank you.
In downloads.
With a text editor.
Sir, I have removed that empty line now i have while running this
perl scripts_pl/20.ci_hmm/slave_convg.pl
I am getting following error.
can you please help me.
Thank you.
Use latest sphinxtrain and follow the tutorial
http://cmusphinx.sourceforge.net/wiki/tutorialam
It gives correct and up-to-date information about acoustic model training.