User Activity

  • Posted a comment on discussion Help on CMU Sphinx

    The error message in the log file says ERROR: "sphinx_fe.c", line 119: Failed to open /data/SpeechData/train_si.bd4/TH001_1.wav: No such file or directory You need to firstly follow the error messgae and check if the file is available under the directory.

  • Posted a comment on discussion Sphinx4 Help on CMU Sphinx

    As Nickolay suggested, Kaldi should help you build a good engine. If you need help, please email me direclty.

  • Posted a comment on discussion Sphinx4 Help on CMU Sphinx

    Q1 & Q2: No, you don't have to use Sphinx. For the large training data you have collected, you may try other toolkits with deep learning methods. Q3: You are talking about a few different speech tasks other than STT. It is possible, but you may have to collect and annotate speech data differently for these tasks, and hire experienced speech scientists/engineers to work on these projects. Q4: Since Google or IBM's speech models are built on open domain, so it is possible to build/optimize your domain-specific...

  • Posted a comment on discussion Speech Recognition on CMU Sphinx

    The SLP textbook by Huang should cover all your listed topics

  • Posted a comment on discussion Sphinx4 Help on CMU Sphinx

    Kaldi has recipes for speaker identification, e.g., https://github.com/kaldi-asr/kaldi/tree/master/egs/sre08

  • Posted a comment on discussion Open Discussion on Kaldi

    You need to use 'int2sym.pl', for example, cat decode_tg_test/scoring/14.tra | utils/int2sym.pl...

  • Posted a comment on discussion Open Discussion on Kaldi

    Hi Dan, I have a similar question. I had a 1 GPU machine (K20) in the cluster before,...

  • Posted a comment on discussion Help on Kaldi

    Thanks. What's the performance difference in terms of speed and accuracy between...

View All

Personal Data

Username:
vjdtao
Joined:
2010-07-22 00:45:34

Projects

  • No projects to display.

Personal Tools

MongoDB Logo MongoDB