PNCC Feature file - CMU Sphinx Support

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

PNCC Feature file - CMU Sphinx Support

Forum: Speech Recognition Theory

Created: 2011-05-19

Updated: 2012-09-22

Remus - 2011-05-19

Hello,

I was wondering, does CMU plan to release a support for PNC files as it does
for standard MFC files?

I have produced an PNCC acoustic model for French and the results showed an
improvement of about 12% WER against its MFCC counterpart. For that, I used
the matlab script produced buy CMU, embedded into a perl application that
allowed to automatize the pnc file creation.

But the process is slow and only experimental.

Thank you

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Nickolay V. Shmyrev - 2011-05-19

Hi Remus

The C code that existed for PNCC was rather dirty so I wouldn't rely on it
being released.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Remus - 2011-05-19

I understand. But still, after reading some papers and conducting this study
(which involved more than 5000 audio files), it seems PNCC is promising.

Hope a pncc "wave2feat" to be ultimately released :)

Best regards,

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Vassil Panayotov - 2011-05-19

Hi,

12% absolute WER reduction sounds quite impressive ...
My understanding from looking at some papers is that they made the comparisons
using 16KHz speech.
Does anyone knows whether the relative advantages and robustness of PNCC are
preserved with 8KHz telephone-grade speech?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.