bhiksha / Profile

User Activity

Posted a comment on discussion Help on CMU Sphinx
What do you plan to use the speech rec for? The basic recognizer is just an engine. You can recognize prerecorded files with it. If you want to use it within an application, the APIs must be called. There's documentation on how to do this. -Bhiksha On Wed, Aug 8, 2018 at 11:39 AM, lucifer lucifer258@users.sourceforge.net wrote: I want to use acoustic model availabe on site and develop a speech recognition, I reffered the documentation but it can't help me out . I have completed the starting process...
6 years ago
Posted a comment on discussion Speech Recognition on CMU Sphinx
Its iffy. Spelling to pronunciation rules can be very arcane in most languages, and you end up using much of the capacity of your network to capture these oddities. So in that sense, phoneme recognition is the more natural task. By the same token, though, the spelling oddities, if well captured, can end up providing you with a stronger grammar than just phonetic structure. I expect someone has run the comparison, although I haven't seen any myself. Perhaps I can have Vishal run this test; he's currently...
7 years ago
Posted a comment on discussion Help on CMU Sphinx
We have four types of triphones -- a) word-internal (marked "i" in the mdef files) b) begin-word (e.g. the triphone B (AH, AE) in the word-pair "A BAD ..". Marked "b" in the mdef file) c) end-word (e.g. the triphone B (AE, AH) in the word-pair "GRAB A ..". Marked "e" in the mdef file) d) single-phone-word (e.g. from the words "A" and "I", where the entire word is only one phoneme, so all triphones are both-begin word and end-word triphones. e.g. the triphone "AY (M, AE)" in "SAM I AM". These are...
7 years ago
Posted a comment on discussion Sphinx4 Help on CMU Sphinx
This should be easy. You only have to replace the GMM class with your DNN. You'd...
9 years ago
Posted a comment on discussion Help on CMU Sphinx
You dont need "sudo" in cygwin. -Bhiksha On Sun, May 3, 2015 at 6:41 AM, safia hammad...
9 years ago
Posted a comment on discussion Speech Recognition on CMU Sphinx
The overall duration of the phoneme is reasonably well modeled by the HMM. For the...
9 years ago
Posted a comment on discussion Speech Recognition on CMU Sphinx
Sure. Happens all the time. The point is, we're actually performing a search over...
1 decade ago
Posted a comment on a wiki page on bhiksha
Yes, that is it. It also includes a "language" score. You could try the T, the A...
1 decade ago

View All

Personal Data

Username:: bhiksha
Joined:: 2002-01-04 19:20:17

Projects

CMU Sphinx Speech Recognition Toolkit Last Updated: 2024-08-15
Distant Speech Recognition Beamforming and Speech Recognition Toolkit Last Updated: 2023-04-20

Bhiksha Raj

User Activity

Personal Data

Projects

This is a list of open source software projects that Bhiksha Raj is associated with:

Personal Tools