Menu

I need a dictionary and a model the language

Help
pini
2006-11-01
2012-09-22
  • pini

    pini - 2006-11-01

    Please I need, a dictionary, the model of language and the file .mdef to make an application of voice recognition.

    The application sirvira to move the mouse with voice commandos, but no matter how hard attempt I to create these archives is impossible to me to obtain that it works.

    the dicionario and the model of the language or which could construct thanks to the application that is offered in this connection http://www.speech.cs.cmu.edu/tools/lmtool.html. But even so it does not work to me.

    Please to have if somebody can help me outside whereupon a small dictionary it is worth to me well. because on the other hand when I prove with the dictionary unloaded the package (cmudict.0.6d) it says perhaps to me that the memory fills because this dictionary contains too many words.

    A greeting and thank you very much

     
    • pini

      pini - 2006-11-01

      as I must create a dicionario and a model of language in Spanish?

       
      • The Grand Janitor

        If you build the system from scratch, you should. but probably you could try to start from this page first.

        http://speech.mty.itesm.mx/~jnolazco/proyectos.htm

        -a

         
        • Vero

          Vero - 2006-11-02

          hi,
          I have a proyect in spanish to, but the rate of recognize of sphinx is very low, do you something else?
          i prove with htk and is very good but the license is a problem.
          bye

          Hola,
          Yo estoy realizando un proyecto en español (Español Colombiano), tengo el diccionario, el modelo acustico y el modelo de lenguaje, pero la tasa de reconocimiento es muy baja, como del 8% (tasa de error 91.88%), alguno de ustedes conoce un reconocedor mejor?
          Ya probe con htk, el reconocimiento es muy bueno casi del 100%, pero hay problemas con la licencia (software solo de experimentacion).
          Chao.

           
    • pini

      pini - 2006-11-01

      Thanks for this connection in dictionary in Spanish me vendra very well, as well as the Model Definition cases out for Spanish, nevertheless the language model that this page provides does not work to me as I can create a file .lm

       
      • The Grand Janitor

        It always take a while to prepare resource. :-) So have fun! -a

         
    • pini

      pini - 2006-11-01

      Hello thank you very much by the aid already and been able to make a reconocedor of voice that accepts a dictionary to me created in Spanish, nevertheless the results are not absolutely good and tendre that for taking other options to make my project.

      to have if I have luck and I can analyze the information stored in an object dataline, so that it recognizes although single it is a pair of words.

      Thank you very much again by the aid

       
      • The Grand Janitor

        You are always welcomed.
        Do note that the dictionary I pointed to you is for Mexican Spanish. You might want to fine-tune for the specific purpose.

        Have fun with Sphinx!
        -tgj

         
    • pini

      pini - 2006-11-02

      If desire a fine degree of frequency that I must change to the file config.xml?.

      This file I have it formed of the following way, and it does not pass anything because he is Spanish Mexican since there is no great difference with the words that I want to use for my application.

      Please hechame a look to the file config.xml to see if you can orient to me that I must change
      <?xml version="1.0" encoding="UTF-8"?>

      <!--
      Sphinx-4 Configuration file
      -->

      <!-- ******** -->
      <!-- biship configuration file -->
      <!-- ******** -->

      <config>
      <!-- ******** -->
      <!-- frequently tuned properties -->
      <!-- ******** -->
      <property name="absoluteBeamWidth" value="500"/>
      <property name="relativeBeamWidth" value="1E-80"/>
      <property name="absoluteWordBeamWidth" value="20"/>
      <property name="relativeWordBeamWidth" value="1E-60"/>
      <property name="wordInsertionProbability" value="1E-16"/>
      <property name="languageWeight" value="7.0"/>
      <property name="silenceInsertionProbability" value=".1"/>
      <property name="frontend" value="epFrontEnd"/>
      <property name="recognizer" value="recognizer"/>
      <property name="showCreations" value="false"/>

      &lt;!-- ******************************************************** --&gt;
      &lt;!-- word recognizer configuration                            --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;recognizer&quot; 
                            type=&quot;edu.cmu.sphinx.recognizer.Recognizer&quot;&gt;
          &lt;property name=&quot;decoder&quot; value=&quot;decoder&quot;/&gt;
          &lt;propertylist name=&quot;monitors&quot;&gt;
              &lt;item&gt;accuracyTracker &lt;/item&gt;
              &lt;item&gt;speedTracker &lt;/item&gt;
              &lt;item&gt;memoryTracker &lt;/item&gt;
              &lt;item&gt;recognizerMonitor &lt;/item&gt;
          &lt;/propertylist&gt;
      &lt;/component&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The Decoder   configuration                              --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;decoder&quot; type=&quot;edu.cmu.sphinx.decoder.Decoder&quot;&gt;
          &lt;property name=&quot;searchManager&quot; value=&quot;wordPruningSearchManager&quot;/&gt;
          &lt;property name=&quot;featureBlockSize&quot; value=&quot;50&quot;/&gt;
      &lt;/component&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The Search Manager                                       --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;wordPruningSearchManager&quot; 
      type=&quot;edu.cmu.sphinx.decoder.search.WordPruningBreadthFirstSearchManager&quot;&gt;
          &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
          &lt;property name=&quot;linguist&quot; value=&quot;lexTreeLinguist&quot;/&gt;
          &lt;property name=&quot;pruner&quot; value=&quot;trivialPruner&quot;/&gt;
          &lt;property name=&quot;scorer&quot; value=&quot;threadedScorer&quot;/&gt;
          &lt;property name=&quot;activeListManager&quot; value=&quot;activeListManager&quot;/&gt;
          &lt;property name=&quot;growSkipInterval&quot; value=&quot;0&quot;/&gt;
          &lt;property name=&quot;checkStateOrder&quot; value=&quot;false&quot;/&gt;
          &lt;property name=&quot;buildWordLattice&quot; value=&quot;false&quot;/&gt;
          &lt;property name=&quot;acousticLookaheadFrames&quot; value=&quot;1.7&quot;/&gt;
          &lt;property name=&quot;relativeBeamWidth&quot; value=&quot;${relativeBeamWidth}&quot;/&gt;
      &lt;/component&gt;
      
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The Active Lists                                         --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;activeListManager&quot; 
               type=&quot;edu.cmu.sphinx.decoder.search.SimpleActiveListManager&quot;&gt;
          &lt;propertylist name=&quot;activeListFactories&quot;&gt;
          &lt;item&gt;standardActiveListFactory&lt;/item&gt;
          &lt;item&gt;wordActiveListFactory&lt;/item&gt;
          &lt;item&gt;wordActiveListFactory&lt;/item&gt;
          &lt;item&gt;standardActiveListFactory&lt;/item&gt;
          &lt;item&gt;standardActiveListFactory&lt;/item&gt;
          &lt;item&gt;standardActiveListFactory&lt;/item&gt;
      &lt;/propertylist&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;standardActiveListFactory&quot; 
               type=&quot;edu.cmu.sphinx.decoder.search.PartitionActiveListFactory&quot;&gt;
          &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
          &lt;property name=&quot;absoluteBeamWidth&quot; value=&quot;${absoluteBeamWidth}&quot;/&gt;
          &lt;property name=&quot;relativeBeamWidth&quot; value=&quot;${relativeBeamWidth}&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;wordActiveListFactory&quot; 
               type=&quot;edu.cmu.sphinx.decoder.search.PartitionActiveListFactory&quot;&gt;
          &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
          &lt;property name=&quot;absoluteBeamWidth&quot; value=&quot;${absoluteWordBeamWidth}&quot;/&gt;
          &lt;property name=&quot;relativeBeamWidth&quot; value=&quot;${relativeWordBeamWidth}&quot;/&gt;
      &lt;/component&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The Pruner                                               --&gt;
      &lt;!-- ******************************************************** --&gt; 
      &lt;component name=&quot;trivialPruner&quot; 
                  type=&quot;edu.cmu.sphinx.decoder.pruner.SimplePruner&quot;/&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- TheScorer                                                --&gt;
      &lt;!-- ******************************************************** --&gt; 
      &lt;component name=&quot;threadedScorer&quot; 
                  type=&quot;edu.cmu.sphinx.decoder.scorer.ThreadedAcousticScorer&quot;&gt;
          &lt;property name=&quot;frontend&quot; value=&quot;${frontend}&quot;/&gt;
          &lt;property name=&quot;isCpuRelative&quot; value=&quot;true&quot;/&gt;
          &lt;property name=&quot;numThreads&quot; value=&quot;0&quot;/&gt;
          &lt;property name=&quot;minScoreablesPerThread&quot; value=&quot;10&quot;/&gt;
          &lt;property name=&quot;scoreablesKeepFeature&quot; value=&quot;true&quot;/&gt;
      &lt;/component&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The linguist  configuration                              --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;lexTreeLinguist&quot; 
                  type=&quot;edu.cmu.sphinx.linguist.lextree.LexTreeLinguist&quot;&gt;
          &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
          &lt;property name=&quot;acousticModel&quot; value=&quot;wsj&quot;/&gt;
          &lt;property name=&quot;languageModel&quot; value=&quot;trigramModel&quot;/&gt;
          &lt;property name=&quot;dictionary&quot; value=&quot;dictionary&quot;/&gt;
          &lt;property name=&quot;addFillerWords&quot; value=&quot;false&quot;/&gt;
          &lt;property name=&quot;fillerInsertionProbability&quot; value=&quot;1E-10&quot;/&gt;
          &lt;property name=&quot;generateUnitStates&quot; value=&quot;false&quot;/&gt;
          &lt;property name=&quot;wantUnigramSmear&quot; value=&quot;true&quot;/&gt;
          &lt;property name=&quot;unigramSmearWeight&quot; value=&quot;1&quot;/&gt;
          &lt;property name=&quot;wordInsertionProbability&quot; 
                  value=&quot;${wordInsertionProbability}&quot;/&gt;
          &lt;property name=&quot;silenceInsertionProbability&quot; 
                  value=&quot;${silenceInsertionProbability}&quot;/&gt;
          &lt;property name=&quot;languageWeight&quot; value=&quot;${languageWeight}&quot;/&gt;
          &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
      &lt;/component&gt;
      
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The Dictionary configuration                            --&gt;
      &lt;!-- ******************************************************** --&gt;
      &lt;component name=&quot;dictionary&quot; 
          type=&quot;edu.cmu.sphinx.linguist.dictionary.FastDictionary&quot;&gt;
          &lt;property name=&quot;dictionaryPath&quot;
                    value=&quot;resource:/edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.Model!/edu/cmu/sphinx/model/acoustic/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/dicEs.dict&quot;/&gt;
          &lt;property name=&quot;fillerPath&quot; 
                value=&quot;resource:/edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.Model!/edu/cmu/sphinx/model/acoustic/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/fillerdict&quot;/&gt;
          &lt;property name=&quot;addSilEndingPronunciation&quot; value=&quot;false&quot;/&gt;
          &lt;property name=&quot;wordReplacement&quot; value=&quot;&amp;lt;sil&amp;gt;&quot;/&gt;
          &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
      &lt;/component&gt;
      
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The Language Model configuration                         --&gt;
      &lt;!-- ******************************************************** --&gt;
      &lt;component name=&quot;trigramModel&quot; 
          type=&quot;edu.cmu.sphinx.linguist.language.ngram.SimpleNGramModel&quot;&gt;
          &lt;property name=&quot;location&quot; 
              value=&quot;resource:/AudioReconocedor!/trigramEs.lm&quot;/&gt;
          &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
          &lt;property name=&quot;dictionary&quot; value=&quot;dictionary&quot;/&gt;
          &lt;property name=&quot;maxDepth&quot; value=&quot;3&quot;/&gt;
          &lt;property name=&quot;unigramWeight&quot; value=&quot;.7&quot;/&gt;
      &lt;/component&gt;
      
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The acoustic model configuration                         --&gt;
      &lt;!-- ******************************************************** --&gt;
      &lt;component name=&quot;wsj&quot;
                 type=&quot;edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.Model&quot;&gt;
          &lt;property name=&quot;loader&quot; value=&quot;wsjLoader&quot;/&gt;
          &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;wsjLoader&quot; type=&quot;edu.cmu.sphinx.model.acoustic.WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz.ModelLoader&quot;&gt;
          &lt;property name=&quot;logMath&quot; value=&quot;logMath&quot;/&gt;
          &lt;property name=&quot;unitManager&quot; value=&quot;unitManager&quot;/&gt;
      &lt;/component&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The unit manager configuration                           --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;unitManager&quot; 
          type=&quot;edu.cmu.sphinx.linguist.acoustic.UnitManager&quot;/&gt;
      
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The frontend configuration                               --&gt;
      &lt;!-- ******************************************************** --&gt;
      
      &lt;component name=&quot;mfcFrontEnd&quot; type=&quot;edu.cmu.sphinx.frontend.FrontEnd&quot;&gt;
          &lt;propertylist name=&quot;pipeline&quot;&gt;
              &lt;item&gt;microphone &lt;/item&gt;
              &lt;item&gt;premphasizer &lt;/item&gt;
              &lt;item&gt;windower &lt;/item&gt;
              &lt;item&gt;fft &lt;/item&gt;
              &lt;item&gt;melFilterBank &lt;/item&gt;
              &lt;item&gt;dct &lt;/item&gt;
              &lt;item&gt;liveCMN &lt;/item&gt;
              &lt;item&gt;featureExtraction &lt;/item&gt;
          &lt;/propertylist&gt;
      &lt;/component&gt;
      
      &lt;!-- ******************************************************** --&gt;
      &lt;!-- The live frontend configuration                          --&gt;
      &lt;!-- ******************************************************** --&gt;
      &lt;component name=&quot;epFrontEnd&quot; type=&quot;edu.cmu.sphinx.frontend.FrontEnd&quot;&gt;
          &lt;propertylist name=&quot;pipeline&quot;&gt;
              &lt;item&gt;microphone &lt;/item&gt;
              &lt;item&gt;speechClassifier &lt;/item&gt;
              &lt;item&gt;speechMarker &lt;/item&gt;
              &lt;item&gt;nonSpeechDataFilter &lt;/item&gt;
              &lt;item&gt;premphasizer &lt;/item&gt;
              &lt;item&gt;windower &lt;/item&gt;
              &lt;item&gt;fft &lt;/item&gt;
              &lt;item&gt;melFilterBank &lt;/item&gt;
              &lt;item&gt;dct &lt;/item&gt;
              &lt;item&gt;liveCMN &lt;/item&gt;
              &lt;item&gt;featureExtraction &lt;/item&gt;
          &lt;/propertylist&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;microphone&quot; 
                  type=&quot;edu.cmu.sphinx.frontend.util.Microphone&quot;&gt;
          &lt;property name=&quot;closeBetweenUtterances&quot; value=&quot;false&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;speechClassifier&quot; 
                  type=&quot;edu.cmu.sphinx.frontend.endpoint.SpeechClassifier&quot;&gt;
          &lt;property name=&quot;threshold&quot; value=&quot;13&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;nonSpeechDataFilter&quot; 
                  type=&quot;edu.cmu.sphinx.frontend.endpoint.NonSpeechDataFilter&quot;/&gt;
      
      &lt;component name=&quot;speechMarker&quot; 
                  type=&quot;edu.cmu.sphinx.frontend.endpoint.SpeechMarker&quot;&gt;
          &lt;property name=&quot;speechTrailer&quot; value=&quot;50&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;premphasizer&quot; 
          type=&quot;edu.cmu.sphinx.frontend.filter.Preemphasizer&quot;/&gt;
      
      &lt;component name=&quot;windower&quot; 
      type=&quot;edu.cmu.sphinx.frontend.window.RaisedCosineWindower&quot;/&gt;
      
      &lt;component name=&quot;fft&quot; 
          type=&quot;edu.cmu.sphinx.frontend.transform.DiscreteFourierTransform&quot;/&gt;
      
      &lt;component name=&quot;melFilterBank&quot; 
          type=&quot;edu.cmu.sphinx.frontend.frequencywarp.MelFrequencyFilterBank&quot;/&gt;
      
      &lt;component name=&quot;dct&quot; 
              type=&quot;edu.cmu.sphinx.frontend.transform.DiscreteCosineTransform&quot;/&gt;
      
      &lt;component name=&quot;liveCMN&quot; 
                  type=&quot;edu.cmu.sphinx.frontend.feature.LiveCMN&quot;/&gt;
      
      &lt;component name=&quot;featureExtraction&quot; 
          type=&quot;edu.cmu.sphinx.frontend.feature.DeltasFeatureExtractor&quot;/&gt;
      
      &lt;!-- ******************************************************* --&gt;
      &lt;!--  monitors                                               --&gt;
      &lt;!-- ******************************************************* --&gt;
      
      &lt;component name=&quot;accuracyTracker&quot; 
                  type=&quot;edu.cmu.sphinx.instrumentation.AccuracyTracker&quot;&gt;
          &lt;property name=&quot;recognizer&quot; value=&quot;${recognizer}&quot;/&gt;
          &lt;property name=&quot;showRawResults&quot; value=&quot;false&quot;/&gt;
          &lt;property name=&quot;showAlignedResults&quot; value=&quot;false&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;memoryTracker&quot; 
                  type=&quot;edu.cmu.sphinx.instrumentation.MemoryTracker&quot;&gt;
          &lt;property name=&quot;recognizer&quot; value=&quot;${recognizer}&quot;/&gt;
      &lt;property name=&quot;showDetails&quot; value=&quot;false&quot;/&gt;
      &lt;property name=&quot;showSummary&quot; value=&quot;false&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;speedTracker&quot; 
                  type=&quot;edu.cmu.sphinx.instrumentation.SpeedTracker&quot;&gt;
          &lt;property name=&quot;recognizer&quot; value=&quot;${recognizer}&quot;/&gt;
          &lt;property name=&quot;frontend&quot; value=&quot;${frontend}&quot;/&gt;
      &lt;property name=&quot;showDetails&quot; value=&quot;false&quot;/&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;recognizerMonitor&quot; 
                  type=&quot;edu.cmu.sphinx.instrumentation.RecognizerMonitor&quot;&gt;
          &lt;property name=&quot;recognizer&quot; value=&quot;${recognizer}&quot;/&gt;
          &lt;propertylist name=&quot;allocatedMonitors&quot;&gt;
              &lt;item&gt;configMonitor &lt;/item&gt;
          &lt;/propertylist&gt;
      &lt;/component&gt;
      
      &lt;component name=&quot;configMonitor&quot; 
                  type=&quot;edu.cmu.sphinx.instrumentation.ConfigMonitor&quot;&gt;
          &lt;property name=&quot;showConfig&quot; value=&quot;false&quot;/&gt;
      &lt;/component&gt;
      
      
      &lt;!-- ******************************************************* --&gt;
      &lt;!--  Miscellaneous components                               --&gt;
      &lt;!-- ******************************************************* --&gt;
      
      &lt;component name=&quot;logMath&quot; type=&quot;edu.cmu.sphinx.util.LogMath&quot;&gt;
          &lt;property name=&quot;logBase&quot; value=&quot;1.0001&quot;/&gt;
          &lt;property name=&quot;useAddTable&quot; value=&quot;true&quot;/&gt;
      &lt;/component&gt;
      

      </config>

      the archive trigram.ml is:

      Language model created by QuickLM on Wed Nov 1 11:38:15 EST 2006
      Copyright (c) 1996-2000
      Carnegie Mellon University and Alexander I. Rudnicky

      This model based on a corpus of 2 sentences and 20 words
      The (fixed) discount mass is 0.5

      \data\
      ngram 1=20
      ngram 2=20
      ngram 3=18

      \1-grams:
      -1.3424 </s> -0.3010
      -1.3424 <s> -0.2808
      -1.6435 ABAJO -0.2910
      -1.6435 ARRIBA -0.2910
      -1.6435 CINCO -0.2910
      -1.6435 COGER -0.2910
      -1.6435 CUATRO -0.2910
      -1.6435 DEJAR -0.2910
      -1.6435 DERECHA -0.2910
      -1.6435 DOS -0.2910
      -1.6435 FIN -0.2910
      -1.6435 IZQUIERDA -0.2910
      -1.6435 MAS -0.2910
      -1.6435 MENOS -0.2910
      -1.6435 OCHO -0.2808
      -1.6435 PULSAR -0.2910
      -1.6435 SEIS -0.2910
      -1.6435 SIETE -0.2808
      -1.6435 TRES -0.2910
      -1.6435 UNO -0.2910

      \2-grams:
      -0.6021 <s> ARRIBA 0.0000
      -0.6021 <s> OCHO 0.0000
      -0.3010 ABAJO DERECHA 0.0000
      -0.3010 ARRIBA ABAJO 0.0000
      -0.3010 CINCO SEIS 0.0000
      -0.3010 COGER DEJAR 0.0000
      -0.3010 CUATRO CINCO 0.0000
      -0.3010 DEJAR FIN 0.0000
      -0.3010 DERECHA IZQUIERDA 0.0000
      -0.3010 DOS TRES 0.0000
      -0.3010 FIN MAS 0.0000
      -0.3010 IZQUIERDA PULSAR 0.0000
      -0.3010 MAS MENOS 0.0000
      -0.3010 MENOS UNO 0.0000
      -0.3010 OCHO </s> -0.3010
      -0.3010 PULSAR COGER 0.0000
      -0.3010 SEIS SIETE 0.0000
      -0.3010 SIETE </s> -0.3010
      -0.3010 TRES CUATRO 0.0000
      -0.3010 UNO DOS 0.0000

      \3-grams:
      -0.3010 <s> ARRIBA ABAJO
      -0.3010 <s> OCHO </s>
      -0.3010 ABAJO DERECHA IZQUIERDA
      -0.3010 ARRIBA ABAJO DERECHA
      -0.3010 CINCO SEIS SIETE
      -0.3010 COGER DEJAR FIN
      -0.3010 CUATRO CINCO SEIS
      -0.3010 DEJAR FIN MAS
      -0.3010 DERECHA IZQUIERDA PULSAR
      -0.3010 DOS TRES CUATRO
      -0.3010 FIN MAS MENOS
      -0.3010 IZQUIERDA PULSAR COGER
      -0.3010 MAS MENOS UNO
      -0.3010 MENOS UNO DOS
      -0.3010 PULSAR COGER DEJAR
      -0.3010 SEIS SIETE </s>
      -0.3010 TRES CUATRO CINCO
      -0.3010 UNO DOS TRES

      \end\

      And the diccionary is:

      ARRIBA A RR I B A
      ABAJO A B A J O
      DERECHA D E R E CH A
      IZQUIERDA I S K I E R D A

      PULSAR P U L S A R
      COGER K O J E R
      DEJAR D E J A R
      FIN F I N

      MAS M A S
      MENOS M E N O S

      UNO U N O
      DOS D O S
      TRES T R E S
      CUATRO K U A T R O
      CINCO S I N K O
      SEIS S E I S
      SIETE S I E T E
      OCHO O CH O

      thank you

       
    • pini

      pini - 2006-11-03

      Hola vero, yo realice un proyecto con los paquetes de cloudGarden y el reconocimiento no era del todo malo, aunque fallaba a veces, problema... es solo para windows porque necesita un motor de reconocimiento y ademas solo funciona durante treinta dias a menos que consigas una licencia.

      Por favor me podrias mandar lo que as conseguido con htk para hecharle un vistazo???, me interesa que la tasa de reconocimiento sea tan alta, seria a a la direccion ivan_pini@hotmail.com, si quieres cuando me envies lo que has conseguido te reenvio yo la aplicacion que consegui hacer con los paquetes de cloudGarden, se trata de reconocer palabras incluidas en una pequeña gramatica para despues dependendiendo de la palabra dicha mover el raton.

      Un saludo y gracias

       

Log in to post a comment.