Hi,
Do the values of the mfcc file extracted with sphinx_fe represent the feature vector ? What I mean here is, are there any other calculations before constructing the mfcc matrix ?
Last edit: Karim BEN ALAYA 2017-08-04
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
What do those values (mfcc file read with matlab) refer to exactely ? Are they only ceptrums ?
I noticed that in each file, the numbers are a multiplication of 39, So I guess it's the whole set in there, right ? (ceptrum + delta + delta-delta) And are they in the right order of the feat 1s_c_d_dd ?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Yes it does, the sphinx_fe script takes the wav input file ( speech signal) and convert it to acoustic features or feature vector, take a look to man sphinx_fe which produce as a result :
NAME
sphinx_fe - Convert audio files to acoustic feature files
SYNOPSIS
sphinx_fe [ options ]...
DESCRIPTION
This program converts audio files (in either Microsoft WAV, NIST Sphere, or raw format) to acoustic feature files for input
to batch-mode speech recognition. The resulting files are also useful for various other things. A list of options follows:
Hi,
Do the values of the mfcc file extracted with sphinx_fe represent the feature vector ? What I mean here is, are there any other calculations before constructing the mfcc matrix ?
Last edit: Karim BEN ALAYA 2017-08-04
What do those values (mfcc file read with matlab) refer to exactely ? Are they only ceptrums ?
I noticed that in each file, the numbers are a multiplication of 39, So I guess it's the whole set in there, right ? (ceptrum + delta + delta-delta) And are they in the right order of the feat 1s_c_d_dd ?
Yes it does, the sphinx_fe script takes the wav input file ( speech signal) and convert it to acoustic features or feature vector, take a look to man sphinx_fe which produce as a result :
NAME
sphinx_fe - Convert audio files to acoustic feature files
SYNOPSIS
sphinx_fe [ options ]...
DESCRIPTION
This program converts audio files (in either Microsoft WAV, NIST Sphere, or raw format) to acoustic feature files for input
to batch-mode speech recognition. The resulting files are also useful for various other things. A list of options follows: