Menu

sphinx4 recognition on recorded file

Help
2010-07-16
2012-09-22
  • Chaofeng Chen

    Chaofeng Chen - 2010-07-16

    I run sphinx 4 on 8Khz wave filed download with the demo as well as wave file
    downloaded from voxforge, the WER is 10% with ngram language model.

    But I recorded a wave file from asterisk server (since it is through VoIP and
    sampling frequency is 8KHz), and let sphinx4 to do transcript, the WER is over
    80%. Is hat becomes of the acoustic model of 8KHz is only good for microphone
    recorded file and not good for VoIP phone system recorded file?

    Or I did something wrong or I need to do some conversion on the recorded file?

    Please advice.

    Thanks.

     
  • Nickolay V. Shmyrev

    Most likely your audio files have wrong format. It might be stereo instead of
    mono, big endian instead of little endian or mu-law instead of pcm. Check
    everything again carefully.

     

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.