We (Me and a colleague) are developing speech recognition on our native language. We followed all the tutorial, and finished a few days ago the building of our accoustic model. We have 15 words, and collected 4 recordings from different people. The results are not accurate, the model only points out 2 correct words in 15 words total.
I'm posting this in order to understand, if possible, what can we do to improve the model acccuracy.
Grateful for your atention,
Hugo Alexandre
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Our tutorial says you need much more data to train a good model, you can collect lectures, maybe audiobooks, not necessary transcribed, you can also check radio shows or podcasts. You need 300+ hours of data from 50-100 speakers.
Once you collect the data you can contact me (nshmyrev@gmail.com), I'll help you to train a good model.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
We (Me and a colleague) are developing speech recognition on our native language. We followed all the tutorial, and finished a few days ago the building of our accoustic model. We have 15 words, and collected 4 recordings from different people. The results are not accurate, the model only points out 2 correct words in 15 words total.
I'm posting this in order to understand, if possible, what can we do to improve the model acccuracy.
Grateful for your atention,
Hugo Alexandre
Hello Hugo
This is a good start.
Our tutorial says you need much more data to train a good model, you can collect lectures, maybe audiobooks, not necessary transcribed, you can also check radio shows or podcasts. You need 300+ hours of data from 50-100 speakers.
Once you collect the data you can contact me (nshmyrev@gmail.com), I'll help you to train a good model.