You are welcome to provide the data files to reproduce your problems.
Words might look the same but can be different in UTF-8 representation. Make sure that the byte values in the transcription and the dictionary are the same.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I'm trying to adapt some speech data to the zh_broadcastnews_ptm256_8000 and get to command as below:
I get the problem:
my transcription file:
公安部 交 管 局 通报 四百 家 存在 重大 安全 隐患 运输 企业 并 限期 整改(001)I can find word 公安部 in zh_broadcastnews_utf8.dic,wondering why the enginee can not find the word? please help, thanks.
You are welcome to provide the data files to reproduce your problems.
Words might look the same but can be different in UTF-8 representation. Make sure that the byte values in the transcription and the dictionary are the same.