I am using pocketsphinx, ps_alignment to align a text and to get phoneme level timings . For some word, i have multiple alternative pronounciation in the dictionary.
AND AE N D
AND(2) AH N D
THEMSELVES DH EH M S EH L V Z
THEMSELVES(2) DH AH M S EH L V Z
Hi All,
I am using pocketsphinx, ps_alignment to align a text and to get phoneme level timings . For some word, i have multiple alternative pronounciation in the dictionary.
AND AE N D
AND(2) AH N D
THEMSELVES DH EH M S EH L V Z
THEMSELVES(2) DH AH M S EH L V Z
I am using following code for text alignment ,
It is always aligned to the first pronunciation of the word for all words. How I can solve this problem?
First decode to figure out exact pronunciation then align to get time boundaries.
Hi Nickolay,
Thanks for your suggestion.