I am trying to convert english (.wav) into text using transcriber in sphinx-4.
I have modified in the config.xml the dictionary configuration to use the WSJ model, and also the path in the manifest to the WSJ jar file under /lib.
I got everything compiled and able to run, thought it only outputs the digits.
I tried to modified the .gram file, but I am not sure how to ignore the grammar rules? Or make it to match anything? such that it will output the text of the word said in the .wav.
Thanks for the reply. However, maybe I have an unclear title or even in my post.
In particular, the digit.gram contains the digit grammar, ie: one | two |.... However, in the case which I want to include all the english word in the translation, do I need to do ie: a | aaa | aaberg | until z? <= which is a stupid solution I can think of at the moment.
Is there an easier way of doing it?
Thanks a lot.
-Francis
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hi all,
I am trying to convert english (.wav) into text using transcriber in sphinx-4.
I have modified in the config.xml the dictionary configuration to use the WSJ model, and also the path in the manifest to the WSJ jar file under /lib.
I got everything compiled and able to run, thought it only outputs the digits.
I tried to modified the .gram file, but I am not sure how to ignore the grammar rules? Or make it to match anything? such that it will output the text of the word said in the .wav.
Can anyone able give me some ideas on it?
Thanks in advance.
-Fran
Below is the modified config.xml for reference.
<config>
</component>
</config>
Could it be that here is the Problem ?
<property name="grammarName" value="digits"/>
Hi Chris,
Thanks for the reply. However, maybe I have an unclear title or even in my post.
In particular, the digit.gram contains the digit grammar, ie: one | two |.... However, in the case which I want to include all the english word in the translation, do I need to do ie: a | aaa | aaberg | until z? <= which is a stupid solution I can think of at the moment.
Is there an easier way of doing it?
Thanks a lot.
-Francis