Regarding system performance

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Regarding system performance

Forum: Help

Created: 2017-06-02

Updated: 2017-06-02

SKR - 2017-06-02

Hi all,,

Greetings..

I selected 10 speakers, told them to speak 30 words. Then I select wave files of these 9 speakers to train the acoustic model and the remaining one set wavefiles for testing....

I got 100% WER and SER during decode..... !!

I surprise,, the engine made to study the same thing from all speakers but not able to recognize the same thing by a different speaker .......

How it possible.....? Is sphinx purely dependent on speakers??

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-06-02
  
  You can share your training folder to get help on this issue.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

SKR - 2017-06-02

Of course.. Here is a sharable link to my data....

https://drive.google.com/file/d/0B_74UylilDfCYmdyZmhhYlJSaEE/view?usp=sharing

Thank u for response..

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-06-02
  
  Your mistake is that arpa model is not properly prepared, it is build from phones instead of words.
  
  If you build arpa lm from words, it will be much more accurate, it will guess most of the words actually.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

SKR - 2017-06-02

Okay,,, That was a mistake.... Thank uu...

:)

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.