Phoneme recogniton in pocketsphinx

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

Phoneme recogniton in pocketsphinx

Forum: Help

Creator: lynucs

Created: 2009-11-30

Updated: 2012-09-22

lynucs - 2009-11-30

Hi,

I'm looking for a tool which prints the phonemes it recognized (maybe
pocketsphinx can do that with API, sphinx3 is too heavy. but i think sphinx3
is able to do that with the option -mode allphone)

The goal is to store and recognize proper nouns with their phonemes. DTW works
very well in this case... but it is completely speaker dependent.

Regards,
Laurent

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2009-12-03

Pocketsphinx doesn't support this as a part of the public API. Basically it's
because such recognition is not really reasonable/practical.

The proper usage of pocketsphinx depends on the type of applicatoin you are
going to implement. What exactly do you develop?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

lynucs - 2009-12-03

Voice command with a specific and simple vocabulary. Pocketsphinx works very
well in this case.
But there is a another tricky part i try to have, the user stores his name in
a database with his voice and later an administrator can remove his name from
this database with his voice.

I see 3 possibilties:

DTW: but it s speaker dependent.

Create a dicitonnary only with the alphabet and then spell the name. Once i have the written name i can produce the phoneme transcription with espeak or lia_phon, then i can create a dictionnary with names and phonemes for recogniton. Problem; the recognition of alphabet is not very efficient (pronounication of letters are maybe too much similar),
Advantage: speaker independent and the system can pronounce the name by using
a tts like mbrola.

Maybe a last possibility but i don't know if it is possible easily. The system stores the phonemes of a familly name (for example) from the user's speech. When the administrator speaks for a recognition of a username, the system tries to compare the administrator's phonemes between all previously stored phonemes by the users. This solution could the best one.

Don't know if it s clear???

Regards,
Laurent
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2009-12-04

In such situations names are usually transribed with g2p code and added both
to the grammar and to the dictionary. It can be done with lia_phon for
example.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

lynucs - 2009-12-05

But when the user tells his name how i can store it in phoneme format?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2009-12-07

Usually all possible names are already stored in the dictionary. At least most
common ones.

As an alternative you can build a grammar consisting of all possible
syllables/subword parts and use it for recognition. Also you can use a grammar
with phones like this one:

\<phones> = aa | b | c | ae | d .... ;

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

lynucs - 2009-12-08

yes, i will try...

something like:
<name> = series of phones
<phones> = aa | b | ae ... </phones></name>

I don't know the syntax for the grammar for the moment.
keep you inform

thx
laurent

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.