Speech Recognition based on a collection of audio

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Speech Recognition based on a collection of audio

Forum: Help

Creator: Amitai Rosenberg

Created: 2017-05-15

Updated: 2017-05-15

Amitai Rosenberg - 2017-05-15

I have a large collection of audio files with their transcripts in a foreign language.
I want to be able to recognize whether the user recites the right words from the text.
How do I start approaching this using CMU Sphinx? Do I need a language model, acoustic model?
I would like some guidance please and where to start from.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-05-15
  
  You already asked this question at http://stackoverflow.com/questions/43967550/detecting-speech-based-on-a-collection-of-audio#43967550
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.