[Kaldi-users] HMM Topology skipping question

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi all,

The standard Kaldi HMM topology doesn't seem to have state skipping, eg,
hmm state 0 and 1 doesn't go to state 3 directly.  Would this introduce a
limitation that a phone must be pronounced for at least 3 frames (30ms)?

The reason for asking is that we have seen some poor decoding accuracy for
very fast speeches.  Our analysis shows rather high phone error.  Some
phones in the fast speech segments were pronounced definitely less than
30ms.  gmm-align seems to point this as well.  The smallest phone alignment
window from gmm-align is 30ms.

We probably will experiment with introducing skipping in HMM topology.
Before we start, any heads-ups?  Potential pointers/ideas?  Or, am I
missing something entirely?

--
Thanks
Ben Jiang