Activity for Bhiksha Raj

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Help

    What do you plan to use the speech rec for? The basic recognizer is just an engine. You can recognize prerecorded files with it. If you want to use it within an application, the APIs must be called. There's documentation on how to do this. -Bhiksha On Wed, Aug 8, 2018 at 11:39 AM, lucifer lucifer258@users.sourceforge.net wrote: I want to use acoustic model availabe on site and develop a speech recognition, I reffered the documentation but it can't help me out . I have completed the starting process...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Speech Recognition

    Its iffy. Spelling to pronunciation rules can be very arcane in most languages, and you end up using much of the capacity of your network to capture these oddities. So in that sense, phoneme recognition is the more natural task. By the same token, though, the spelling oddities, if well captured, can end up providing you with a stronger grammar than just phonetic structure. I expect someone has run the comparison, although I haven't seen any myself. Perhaps I can have Vishal run this test; he's currently...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Help

    We have four types of triphones -- a) word-internal (marked "i" in the mdef files) b) begin-word (e.g. the triphone B (AH, AE) in the word-pair "A BAD ..". Marked "b" in the mdef file) c) end-word (e.g. the triphone B (AE, AH) in the word-pair "GRAB A ..". Marked "e" in the mdef file) d) single-phone-word (e.g. from the words "A" and "I", where the entire word is only one phoneme, so all triphones are both-begin word and end-word triphones. e.g. the triphone "AY (M, AE)" in "SAM I AM". These are...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Sphinx4 Help

    This should be easy. You only have to replace the GMM class with your DNN. You'd...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Help

    You dont need "sudo" in cygwin. -Bhiksha On Sun, May 3, 2015 at 6:41 AM, safia hammad...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Speech Recognition

    The overall duration of the phoneme is reasonably well modeled by the HMM. For the...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Speech Recognition

    Sure. Happens all the time. The point is, we're actually performing a search over...

  • Bhiksha Raj Bhiksha Raj posted a comment on a wiki page

    Yes, that is it. It also includes a "language" score. You could try the T, the A...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Sphinx4 Help

    Train models for all languages you want to identify. Run an allphone decode with...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Help

    That one will be on me.. :-) Ive heard this gripe before, and I couldnt locate the...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Help

    in principle, yes. But if you have enough training data the diffeence is monr. -Bhiksha...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Help

    Not at all. MFCs work just fine for chinese. It is just that you can get slightly...

  • Bhiksha Raj Bhiksha Raj posted a comment on discussion Sphinx4 Help

    Hi Minh, The slides at "asr.cs.cmu.edu/spring2012" might explain. We also have a...

1