Menu

#11 Make Halef's sampling rate variable

1.0
open
None
2015-02-02
2014-11-24
No

Due to Halef originating from a conventional telephony platform, it is currently limited to 8kHz sampling rate for audio in- and output. This is suboptimal for ASR as well as perceived TTS quality. Explore how to make the sampling rate generic. This is particularly interesting when using softphones, web clients, or smart phone apps as SIP clients. Potentially affected modules include

  • the SIP client config
  • Asterisk
  • Cairo/MRCP
  • VAD
  • Sphinx/ASR config
  • acoustic models
  • Mary
  • Festival

Discussion

  • David Suendermann-Oeft

     
  • David Suendermann-Oeft

    Dear David,

    The DNN-AM training procedure has been finished successfully.

    Please see the attached file for the results.

    Best!

    AI

     
  • David Suendermann-Oeft

     
  • David Suendermann-Oeft

    Dear Alex,

    This is great! A mere 0.93% WER difference between 8kHz and 16kHz speech is gorgeous and shows that adding enhanced sampling rate support in Halef is of rather low priority compared to other activities.

    Thanks,
    Yours,

    DSO

     

Log in to post a comment.

MongoDB Logo MongoDB