Location of code for extracting MFCC from audio file?

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Location of code for extracting MFCC from audio file?

Forum: Help

Creator: Colin Gilbert

Created: 2014-09-01

Updated: 2014-09-01

Colin Gilbert - 2014-09-01

Hi,

I want to implement my own classifier, and I want to use the CMUSphinx MFCC-extraction implementation. I've seen many implementations, but for purposes of standardization/comparison it seems a better idea to use an existing open, tested, quality toolkit.

How would one use the code in Sphinxbase/Pocketsphinx to extract MFCC from an audio file (or live input) without passing it onto an HMM? Which functions, in which files? And, of course, I intend on open-sourcing it. :)

Thanks!
COlin

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2014-09-01

How would one use the code in Sphinxbase/Pocketsphinx to extract MFCC from an audio file (or live input) without passing it onto an HMM?

header is sphinxbase/include/sphinxbase/fe.h

implementation in sphinxbase/src/libsphinxbase/fe/*.[ch]

example is in sphinxbase/src/sphinx_fe

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Colin Gilbert - 2014-09-01

Cool! Thanks for the fast response. I really appreciate it. I am currently experimenting with the convenient sphinx_fe tool that was compiled along with the rest of the program.

Now my big question is: Is there a way to run this tool on live microphone input? If so, how?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2014-09-01
  
  Now my big question is: Is there a way to run this tool on live microphone input? If so, how?
  
  No, you can only process files.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Colin Gilbert - 2014-09-01

Would a unix pipe potentially work? And also, does the framerate refer to number of samples per second, or number of bytes per frame?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Colin Gilbert - 2014-09-01

Nevermind, I read the source code for the framerate question, and pipes are generally way too small to be reliable :P

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.