User Activity

  • Posted a comment on discussion Help on CMU Sphinx

    What do you plan to use the speech rec for? The basic recognizer is just an engine. You can recognize prerecorded files with it. If you want to use it within an application, the APIs must be called. There's documentation on how to do this. -Bhiksha On Wed, Aug 8, 2018 at 11:39 AM, lucifer lucifer258@users.sourceforge.net wrote: I want to use acoustic model availabe on site and develop a speech recognition, I reffered the documentation but it can't help me out . I have completed the starting process...

  • Posted a comment on discussion Speech Recognition on CMU Sphinx

    Its iffy. Spelling to pronunciation rules can be very arcane in most languages, and you end up using much of the capacity of your network to capture these oddities. So in that sense, phoneme recognition is the more natural task. By the same token, though, the spelling oddities, if well captured, can end up providing you with a stronger grammar than just phonetic structure. I expect someone has run the comparison, although I haven't seen any myself. Perhaps I can have Vishal run this test; he's currently...

  • Posted a comment on discussion Help on CMU Sphinx

    We have four types of triphones -- a) word-internal (marked "i" in the mdef files) b) begin-word (e.g. the triphone B (AH, AE) in the word-pair "A BAD ..". Marked "b" in the mdef file) c) end-word (e.g. the triphone B (AE, AH) in the word-pair "GRAB A ..". Marked "e" in the mdef file) d) single-phone-word (e.g. from the words "A" and "I", where the entire word is only one phoneme, so all triphones are both-begin word and end-word triphones. e.g. the triphone "AY (M, AE)" in "SAM I AM". These are...

  • Posted a comment on discussion Sphinx4 Help on CMU Sphinx

    This should be easy. You only have to replace the GMM class with your DNN. You'd...

  • Posted a comment on discussion Help on CMU Sphinx

    You dont need "sudo" in cygwin. -Bhiksha On Sun, May 3, 2015 at 6:41 AM, safia hammad...

  • Posted a comment on discussion Speech Recognition on CMU Sphinx

    The overall duration of the phoneme is reasonably well modeled by the HMM. For the...

  • Posted a comment on discussion Speech Recognition on CMU Sphinx

    Sure. Happens all the time. The point is, we're actually performing a search over...

  • Posted a comment on a wiki page on bhiksha

    Yes, that is it. It also includes a "language" score. You could try the T, the A...

View All

Personal Data

Username:
bhiksha
Joined:
2002-01-04 19:20:17

Projects

This is a list of open source software projects that Bhiksha Raj is associated with:

Personal Tools