Hello everyone.
I want to add some Arabic words to the en-us.lm.bin language model. I downloaded the en-70k-0.2.lm.gz text language model and decompress the tar gz file. I obtained a the en-70k-0.2.lm with 594.2 MB as a text file. My question is how can I add some of Arabic words ( about 100 words )to that model ?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Problem resolved with the use of ngram of SRILM tools. I created my LM, after that combining between he two models using ngram tool and next convert resulted model to .bin using sphinx_lm_convert .
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello everyone.
I want to add some Arabic words to the en-us.lm.bin language model. I downloaded the en-70k-0.2.lm.gz text language model and decompress the tar gz file. I obtained a the en-70k-0.2.lm with 594.2 MB as a text file. My question is how can I add some of Arabic words ( about 100 words )to that model ?
Problem resolved with the use of ngram of SRILM tools. I created my LM, after that combining between he two models using ngram tool and next convert resulted model to .bin using sphinx_lm_convert .