Utterance separated into phonetic level?

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Utterance separated into phonetic level?

Forum: Help

Creator: Lose Looser

Created: 2017-03-21

Updated: 2017-03-21

Lose Looser - 2017-03-21

I am currently using the an4 dataset available from here
I am trying to do phone recognition, but the utterance has only the words and not the phones. Is there someway i can extract utterance which is phonetically seperated?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-03-21
  
  Acoustic model is the same for phone and words recognition. You do not need a database separated on phonemes for phonetic recogniton, you can use conventional database. If you need higher accuracy you can use larger dataset like tedlium for training.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Lose Looser - 2017-03-21

But it only seems possible to do it on the test set.. rather than the train set..

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-03-21
  
  You need to be more clear what do you mean by "it".
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Lose Looser - 2017-03-21

I tried converting my utterances using the acoustic model to phonetic level.
What i ended up getting was only the test set seperated into phonemes.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-03-21
  
  And what is the problem? Until you explain in details nobody will help you.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Lose Looser - 2017-03-21

There is two problems.. I am not sure why but I was only able to do it using the test set..
Second problem being that the number of phone classes. Most phoneme recognition describe the issue as a 61 class problem, but i seem to have more than 61 classes. above 100?..

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2017-03-21
  
  In Kaldi group you provided much better description, it is sad you are trying to fool us here. No much to discuss then.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.