CMU Sphinx / Forums / Speech Recognition Theory: What's my next step while developing a Speech Recognition system?

Mehmet Mert Yildiran - 2017-07-12

Hi, I'm trying to develop (and at the same time learn how a) Speech Recognition system (works). I'm currently able to draw the spectrum and find the peaks.

The current state of my program is in here (there is not much, just ~200 lines). Also I recommend you to read the README.md (it's not long, a few lines).

My question is what's my next step? I think my next step is trying to recognize the phonemes. But I'm not sure how can I do that? How can I distinguish the vowels 'a' from 'e' for example. Also how can I detect the boundaries (or transitions) between the phonemes. From this point, do I need to jump into Hidden Markov model or Neural Networks? Also what data should I feed to HMM or ANN, the peaks? or the spectrum graph's itself?

Last edit: Mehmet Mert Yildiran 2017-07-12

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-07-12
  
  My question is what's my next step?
  
  Read speech recognition textbook.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
  - Mehmet Mert Yildiran - 2017-07-12
    
    I already did. I read a lot of academic material. But most of the time they have created more questions than answers, in my mind...
    
    When it comes to practice, people like me (alien to this research because of certain reasons) can stuck and need guidance. So I'm asking for a short guide, a To-Do List.
    
    If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

What's my next step while developing a Speech Recognition system?

Speech Recognition Toolkit

Forums

Help

What's my next step while developing a Speech Recognition system?

What's my next step while developing a Speech Recognition system?

Speech Recognition Toolkit

Forums

Help

What's my next step while developing a Speech Recognition system? document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

What's my next step while developing a Speech Recognition system?