CMU Sphinx / Forums / Help: FLV => FFMPEG => SPHINX4 problem

Hello

I am just getting to grips with Sphinx 4 and it is looking quite promising for my needs.

This works:
I record from my machine's webcam an audio clip using windows sound recorder at 22Khz. I upload and pass it through FFMPEG to reduce it down to 16Khz. I then run it through Sphinx 4 and it gives me back some words, great!

Doesn't work:
I record my sound using the same webcam but it is now streamed to the server and saved as an FLV file at 22Khz. I do the same, pass it through FFMPEG and reduce it to 16Khz. I now pass it through Sphinx 4 at there are no matches.

The clips look the same when I view the audio details:

Works:
WAVE (.wav) file, byte length: 3260104, data format: PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame, little-endian, frame length: 1630030

Doesn't Work:
WAVE (.wav) file, byte length: 723744, data format: PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame, little-endian, frame length: 361850

BTW when I say it doesn't work, the file is accepted but nothing is matched.

I am running 1.0Beta3 on Linux.

Any help would be really appreciated.

Thank you
Ben

If it helps, I can post it all if needed...

&lt;!-- ******************************************************** --&gt;
&lt;!-- The Dictionary configuration                            --&gt;
&lt;!-- ******************************************************** --&gt;
&lt;component name=&quot;dictionary&quot; 
    type=&quot;edu.cmu.sphinx.linguist.dictionary.FastDictionary&quot;&gt;
    &lt;property name=&quot;dictionaryPath&quot;
              value=&quot;resource:/edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.Model!/edu/cmu/sphinx/model/acoustic/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/cmudict.0.6d&quot;/&gt;
    &lt;property name=&quot;fillerPath&quot; 
          value=&quot;resource:/edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.Model!/edu/cmu/sphinx/model/acoustic/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/fillerdict&quot;/&gt;
    &lt;property name=&quot;addSilEndingPronunciation&quot; value=&quot;false&quot;/&gt;
    &lt;property name=&quot;wordReplacement&quot; value=&quot;&amp;lt;sil&amp;gt;&quot;/&gt;
    &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
&lt;/component&gt;


&lt;!-- ******************************************************** --&gt;
&lt;!-- The Language Model configuration                         --&gt;
&lt;!-- ******************************************************** --&gt;
&lt;component name=&quot;trigramModel&quot; 
    type=&quot;edu.cmu.sphinx.linguist.language.ngram.SimpleNGramModel&quot;&gt;
    &lt;property name=&quot;location&quot; 
        value=&quot;resource:/edu.cmu.sphinx.demo.transcriber.Transcriber!/edu/cmu/sphinx/demo/transcriber/transcriber.trigram.lm&quot;/&gt;
    &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
    &lt;property name=&quot;dictionary&quot; value=&quot;dictionary&quot;/&gt;
    &lt;property name=&quot;maxDepth&quot; value=&quot;3&quot;/&gt;
    &lt;property name=&quot;unigramWeight&quot; value=&quot;.7&quot;/&gt;
&lt;/component&gt;


&lt;!-- ******************************************************** --&gt;
&lt;!-- The acoustic model configuration                         --&gt;
&lt;!-- ******************************************************** --&gt;
&lt;component name=&quot;wsj&quot;
           type=&quot;edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.Model&quot;&gt;
    &lt;property name=&quot;loader&quot; value=&quot;wsjLoader&quot;/&gt;
    &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
&lt;/component&gt;

&lt;component name=&quot;wsjLoader&quot; type=&quot;edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.ModelLoader&quot;&gt;
    &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
    &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
&lt;/component&gt;

FLV =&gt; FFMPEG =&gt; SPHINX4 problem

Speech Recognition Toolkit

Forums

Help

FLV =&gt; FFMPEG =&gt; SPHINX4 problem document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

FLV => FFMPEG => SPHINX4 problem

FLV => FFMPEG => SPHINX4 problem