They come in .flac.
If I convert them to 16 KHz Mono and .wav using ffmpeg, will they give a good result..
Also, I guess it is free corpus.
Also is WSJ a free corpus? If not, can you suggest me some free corpus besides Librespeech.
Please advice..
Last edit: Senjam Shantirani 2016-05-13
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Following the Acouctic Model creation example using an4 I got the accuracy of 28.1% , which is almost same as given in the website.
Now, I got accuracy of 61% or WER of 39% using TIMIT.
Is this accuracy proper with TIMIT, based on any past study?
TIMIT is weird but for my current study I am using it, as I am struggling to get better data.
Regards,
Senjam Shantirani
It depends on parameters and langauge model you used.
You can download librispeech database instead of timit.
They come in .flac.
If I convert them to 16 KHz Mono and .wav using ffmpeg, will they give a good result..
Also, I guess it is free corpus.
Also is WSJ a free corpus? If not, can you suggest me some free corpus besides Librespeech.
Please advice..
Last edit: Senjam Shantirani 2016-05-13
Yes
No, wsj is not free
tedlium
Last edit: Nickolay V. Shmyrev 2016-05-13