Is there any change that I have to make in sphinx_train.cfg file to change from statistical language model to JSGF grammar?
I have the following error when trying to use JSGF grammar using the command: pocketsphinx_batch.exe -hmm xyz -cepdir feat ... -lm etc/hello.jsgf ...
INFO: ngram_model_trie.c(354): Trying to read LM in trie binary format
INFO: ngram_model_trie.c(365): Header doesn't match
INFO: ngram_model_trie.c(177): Trying to read LM in arpa format
INFO: ngram_model_trie.c(70): No \data\ mark in LM file
INFO: ngram_model_trie.c(445): Trying to read LM in dmp format
ERROR: "ngram_model_trie.c", line 458: Wrong magic header size number 234a5347: etc/hello.jsgf is not a dump file
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
a) word-internal (marked "i" in the mdef files)
b) begin-word (e.g. the triphone B (AH, AE) in the word-pair "A BAD
..". Marked "b" in the mdef file)
c) end-word (e.g. the triphone B (AE, AH) in the word-pair "GRAB A
..". Marked "e" in the mdef file)
d) single-phone-word (e.g. from the words "A" and "I", where the
entire word is only one phoneme, so all triphones are both-begin word
and end-word triphones. e.g. the triphone "AY (M, AE)" in "SAM I
AM". These are marked as "s" in the mdef file).
HTK recognizer does differentiate between small letters and capital letter characters. For instance, the phoneme /B/ is different from phoneme /b/. Does CMU sphinx consider /b/ different from /B/ or it considers same phoneme?
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
I have the following error:
...
Decoding 202 segments starting at 1812 (part 10 of 10)
0%
Aligning results to find error rate
word_align.pl failed with error code 65280 at /usr/local/lib/sphinxtrain/scripts/decode/slave.pl line 173.
Any help.
Last edit: Zain 2017-06-11
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Yes
Is there any link or tutorial how to employ "any-word" language model in CUM PocketSphinx.
Is there any change that I have to make in sphinx_train.cfg file to change from statistical language model to JSGF grammar?
I have the following error when trying to use JSGF grammar using the command:
pocketsphinx_batch.exe -hmm xyz -cepdir feat ... -lm etc/hello.jsgf ...
INFO: ngram_model_trie.c(354): Trying to read LM in trie binary format
INFO: ngram_model_trie.c(365): Header doesn't match
INFO: ngram_model_trie.c(177): Trying to read LM in arpa format
INFO: ngram_model_trie.c(70): No \data\ mark in LM file
INFO: ngram_model_trie.c(445): Trying to read LM in dmp format
ERROR: "ngram_model_trie.c", line 458: Wrong magic header size number 234a5347: etc/hello.jsgf is not a dump file
The option for jsgf is
-jsgf
, not-lm
.Is there any option to use only 1-gram or 2-gram of the 3-gram language model during decoding.
No, you have to modify the model outside for example with srilm:
Does CMU Sphinx support cross-word triphones as HTK?
Actually cmusphinx does not support within-word triphones, only cross-word ones.
We have four types of triphones --
a) word-internal (marked "i" in the mdef files)
b) begin-word (e.g. the triphone B (AH, AE) in the word-pair "A BAD
..". Marked "b" in the mdef file)
c) end-word (e.g. the triphone B (AE, AH) in the word-pair "GRAB A
..". Marked "e" in the mdef file)
d) single-phone-word (e.g. from the words "A" and "I", where the
entire word is only one phoneme, so all triphones are both-begin word
and end-word triphones. e.g. the triphone "AY (M, AE)" in "SAM I
AM". These are marked as "s" in the mdef file).
b,c and d are all cross-word triphones.
-Bhiksha
On Fri, May 19, 2017 at 2:57 PM, Zain zainzainzain3@users.sf.net wrote:
--
Bhiksha Raj
Carnegie Mellon University
Pittsburgh, PA, USA
Tel: 412 268 9826
Could you please let me know what is the difference between cmuclmtk and srilm for creating language models.
HTK recognizer does differentiate between small letters and capital letter characters. For instance, the phoneme /B/ is different from phoneme /b/. Does CMU sphinx consider /b/ different from /B/ or it considers same phoneme?
Same here
By "same", do you mean that /B/ and /b/ are same in Sphinx, or do you mean that Sphinx is same as HTK to handle them as two different phonemes?
Yes
I have the following error:
...
Decoding 202 segments starting at 1812 (part 10 of 10)
0%
Aligning results to find error rate
word_align.pl failed with error code 65280 at /usr/local/lib/sphinxtrain/scripts/decode/slave.pl line 173.
Any help.
Last edit: Zain 2017-06-11