lupomuc / Profile

User Activity

Posted a comment on discussion Help on CMU Sphinx
It's been several years since I last used the tools, so I can't give you much feedback. I used them for two things: I seem to remember that the LM generation scripts were somehow hosted on the site, allowing me to download and study them. It's surprisingly hard to find good (and simple!) code that generates language models, and those scripts served as a great starting point for writing my own, application-specific implementation. I then used the output of the web interface as a kind of ground truth...
4 years ago
Posted a comment on discussion Help on CMU Sphinx
This article lists two other options for building language models. I haven't used them myself, but it sounds like they will give you the same result, albeit with a bit more effort.
4 years ago
Posted a comment on discussion Help on CMU Sphinx
Oh, that sucks. Good luck restoring the contents! Is there a public repo with the source code? I did some searching, but all links pointed to the website only.
4 years ago
Posted a comment on discussion Help on CMU Sphinx
The LMTool site (http://www.speech.cs.cmu.edu/tools/lmtool-new.html) appears to be down. It would be great if it could be got up again!
4 years ago
Modified a comment on discussion Help on CMU Sphinx
I'm thinking of training a custom acoustic model. The dictionary contains words with multiple pronunciations, like this: either a ɪ ð ə either(2) i ð ə Not let's suppose one of the training samples is the phrase "You say either [i ð ə] and I say either [a ɪ ð ə]". What should the transcript file contain? Is the trainer smart enough to determine the correct pronunciation from context, so that the transcript can be "<s> you say either and i say either </s>"? Or do I need to give it the exact word alternatives,...
5 years ago
Posted a comment on discussion Help on CMU Sphinx
I'm thinking of training a custom acoustic model. The dictionary contains words with multiple pronunciations, like this: either a ɪ ð ə either(2) i ð ə Not let's suppose one of the training samples is the phrase "You say either [i ð ə] and I say either [a ɪ ð ə]". What should the transcript file contain? Is the trainer smart enough to determine the correct pronunciation from context, so that the transcript file can be "<s> you say either and i say either </s>"? Or do I need to give it the exact word...
5 years ago
Posted a comment on discussion Help on CMU Sphinx
Thank you very much!
8 years ago
Modified a comment on discussion Help on CMU Sphinx
I also looked at other C/C++ implementations of MFCC extraction. But those that looked good either came with a non-permissive license (GPL) or were part of huge libraries. Any help on how to extract MFCCs in code with PocketSphinx would be appreciated!
8 years ago

View All

Personal Data

Username:: lupomuc
Joined:: 2010-09-18 19:40:16

Projects

No projects to display.

Daniel Wolf

User Activity

Personal Data

Projects

Personal Tools