In your github repository you have the "en-us-phone.lm.bin" file. I want to reduce it further, hence I need the training data. Is it available somewhere?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
In your github repository you have the "en-us-phone.lm.bin" file. I want to reduce it further, hence I need the training data. Is it available somewhere?
You can take any english text and convert it to phone sequences with python script.