What is the minimum sentence or audio recording required for adapting default acoustic model?
Another issue not related to Default acoustic model adaption
Consider following
"5 hour of recordings of 200 speakers for command and control for many speakers"
Here Is same sentence will be recorded by multiple user? (means 200 user)
Or We have 200 speakers with different sentences?
Waiting for you help.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
What is the minimum sentence or audio recording required for adapting default acoustic model?
Another issue not related to Default acoustic model adaption
Consider following
"5 hour of recordings of 200 speakers for command and control for many speakers"
Here Is same sentence will be recorded by multiple user? (means 200 user)
Or We have 200 speakers with different sentences?
Waiting for you help.
Sir still waiting for your help.
It's not quite clear what do you want to "adapt" to what.
Will the real users of your system speak same sentence or different sentences? You can easily get the answer with some thinking.