I have a recording of a person speaking slow and the accuracy is fine,
when he turn to speak fast (i.e. 2-3 wps) the accuracy decreases dramatically,
Is there any configuration for this case?
any pre-processing?
anything else that could be done?
Thanks
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi,
I have a recording of a person speaking slow and the accuracy is fine,
when he turn to speak fast (i.e. 2-3 wps) the accuracy decreases dramatically,
Is there any configuration for this case?
any pre-processing?
anything else that could be done?
Thanks
There is no configuration or preprocessing, you have to adapt an acoustic model or train a new one.
thanks Nickolay,
if that so, and I have just 10 minutes of this fast speaking person,
would you recommand adapting (map_adapt) or use the mllr transfrom ?
Map should be fine.