Hi Nickolay, I would like to know what is the calculation made by the software at the time of transcription and training, because if a mathematical calculation is made, maybe a Geforce GTX is not the ideal model, since this model is focused on graphics for video game. I remain attentive to your response, thank you in advance.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Good morning:
I have a question, how many people and hours of audio are needed to train a good acoustic model?
Read http://cmusphinx.github.io/wiki/tutorialam
Thanks, another question, can I adapt a 16k acoustic model with 8k audio?
No, it does not make much sense.
Hello Nickolay, is there any standard text to start with audio recording for sphinx training?
These days you should never record speech specifically for training, you'd better take existing recordings.
Thank you, and what hardware requirements are needed for the training?
A good server for a beginner ASR developer is
Intel® Core™ i7-6700 Quad-Core
64 GB DDR4 RAM
2 x 500 GB SATA 6 Gb/s SSD
GeForce® GTX 1080
Thanks, do not know if an acoustic model is available in 8000 Hz for the Spanish language?
Not yet
Hi Nickolay, I would like to know what is the calculation made by the software at the time of transcription and training, because if a mathematical calculation is made, maybe a Geforce GTX is not the ideal model, since this model is focused on graphics for video game. I remain attentive to your response, thank you in advance.
Matrix multiplication.
No worries about that.
Thanks Nickolay, We just acquired a graphics card, how do I get CMU-Sphinx to occupy the gpu? Is there a configuration?
cmusphinx does not support gpu, other modern toolkit do.
ok, thanks and what do you recommend to recognize voice using gpu?