CMU Sphinx / Forums / Help: Training speech to text

Jose Sanchez - 2018-09-26

Good morning:
I have a question, how many people and hours of audio are needed to train a good acoustic model?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2018-09-26
  
  Read http://cmusphinx.github.io/wiki/tutorialam
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jose Sanchez - 2018-09-27

Thanks, another question, can I adapt a 16k acoustic model with 8k audio?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2018-09-28
  
  No, it does not make much sense.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jose Sanchez - 2018-10-25

Hello Nickolay, is there any standard text to start with audio recording for sphinx training?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2018-10-25
  
  These days you should never record speech specifically for training, you'd better take existing recordings.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Jose Sanchez - 2018-10-25

Thank you, and what hardware requirements are needed for the training?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2018-10-27
  
  A good server for a beginner ASR developer is
  
  Intel® Core™ i7-6700 Quad-Core
  64 GB DDR4 RAM
  2 x 500 GB SATA 6 Gb/s SSD
  GeForce® GTX 1080
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
  - Jose Sanchez - 2018-10-30
    
    Thanks, do not know if an acoustic model is available in 8000 Hz for the Spanish language?
    
    If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
    - Nickolay V. Shmyrev - 2018-10-30
      
      Not yet
      
      If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
  - Jose Sanchez - 2018-11-06
    
    Hi Nickolay, I would like to know what is the calculation made by the software at the time of transcription and training, because if a mathematical calculation is made, maybe a Geforce GTX is not the ideal model, since this model is focused on graphics for video game. I remain attentive to your response, thank you in advance.
    
    If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
    - Nickolay V. Shmyrev - 2018-11-06
      
      made by the software at the time of transcription and training
      
      Matrix multiplication.
      
      Geforce GTX is not the ideal model, since this model is focused on graphics for video game
      
      No worries about that.
      
      If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
      - Jose Sanchez - 2018-12-18
        
        Thanks Nickolay, We just acquired a graphics card, how do I get CMU-Sphinx to occupy the gpu? Is there a configuration?
        
        If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
        
        Nickolay V. Shmyrev - 2018-12-18
        
        cmusphinx does not support gpu, other modern toolkit do.
        
        If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
        
        Jose Sanchez - 2018-12-18
        
        ok, thanks and what do you recommend to recognize voice using gpu?
        
        If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Training speech to text

Speech Recognition Toolkit

Forums

Help

Training speech to text document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

Training speech to text