You can subscribe to this list here.
2011 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
(4) |
Jul
|
Aug
|
Sep
(1) |
Oct
(4) |
Nov
(1) |
Dec
(14) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2012 |
Jan
(1) |
Feb
(8) |
Mar
|
Apr
(1) |
May
(3) |
Jun
(13) |
Jul
(7) |
Aug
(11) |
Sep
(6) |
Oct
(14) |
Nov
(16) |
Dec
(1) |
2013 |
Jan
(3) |
Feb
(8) |
Mar
(17) |
Apr
(21) |
May
(27) |
Jun
(11) |
Jul
(11) |
Aug
(21) |
Sep
(39) |
Oct
(17) |
Nov
(39) |
Dec
(28) |
2014 |
Jan
(36) |
Feb
(30) |
Mar
(35) |
Apr
(17) |
May
(22) |
Jun
(28) |
Jul
(23) |
Aug
(41) |
Sep
(17) |
Oct
(10) |
Nov
(22) |
Dec
(56) |
2015 |
Jan
(30) |
Feb
(32) |
Mar
(37) |
Apr
(28) |
May
(79) |
Jun
(18) |
Jul
(35) |
Aug
|
Sep
(1) |
Oct
|
Nov
|
Dec
|
From: Tvarog <tv...@gm...> - 2014-01-07 14:35:55
|
It seems that kernel 3.12 is responsible for weird behavior of src/fstbin/ binaries (no idea why though). After downgrading to 3.11 everything was ok for me. On Sun, Dec 29, 2013 at 10:31 AM, Xavier Anguera <xan...@gm...> wrote: > Hi, > I doublechecked that the svn was done correctly (if it was not before, it > is indeed now) and everything looks the same, with the same problem as I > reported above. > Comparing the error I get with another (successful) run on another > database I am suspicious of the LG.fst FST file. Is there a way to check it? > > Thanks, > > X. > > > > > On Sat, Dec 28, 2013 at 9:37 PM, Daniel Povey <dp...@gm...> wrote: > >> Could it be that you did "svn up" only in the fstbin directory and not in >> src? Do "svn up" in src/, and see if you get further updates. >> Dan >> >> >> >> On Sat, Dec 28, 2013 at 12:35 PM, Xavier Anguera <xan...@gm... >wrote: >> >>> Sure, here it is: >>> >>> The error: >>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>> exp/mono/graph_phn >>> # Started at Sat Dec 28 20:47:57 CET 2013 >>> # >>> fstminimizeencoded >>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>> data/lang_test_phn-mono/G.fst >>> fstdeterminizestar --use-log=true >>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>> 0.000358155 -0.000356635 >>> fstcomposecontext --context-size=1 --central-position=0 >>> --read-disambig-syms=data/lang_test_phn-mono/phones/ disambig.int--write-disambig-syms=data/lang_test_ >>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn- mono/tmp/ilabels_1_0 >>> WARNING (fstcomposecontext:main():fstcomposecontext.cc:130) >>> Disambiguation symbols list is empty; this likely indicates an error in >>> data preparation. >>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>> std::vector<B, std::allocator< >>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with >>> Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >>> `subsequenti >>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0' failed. >>> utils/mkgraph.sh: line 76: 7661 Aborted >>> fstcomposecontext --context-size=$N --central-position=$P >>> --read-disambig-syms=$lang/phones/disambig. >>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR: FstHeader::Read: Bad FST header: >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>> Reading FST: error reading FST header from >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>> Reading FST: error reading FST header from >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> >>> The execution of gdb: >>> (gdb) where >>> #0 0x00007ffff6be9475 in *__GI_raise (sig=<optimized out>) at >>> ../nptl/sysdeps/unix/sysv/linux/raise.c:64 >>> #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 >>> #2 0x00007ffff6be2621 in *__GI___assert_fail ( >>> assertion=0x498448 "subsequential_symbol != 0 && >>> disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0", >>> file=<optimized out>, line=105, >>> function=0x499700 "fst::ContextFstImpl<Arc, >>> LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, >>> std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, >>> int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 >>> #3 0x000000000045b419 in >>> fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >>> int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, >>> phone_syms=..., disambig_syms=..., N=1, P=0) at >>> ../fstext/context-fst-inl.h:103 >>> #4 0x0000000000457610 in >>> fst::ContextFst<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >>> int>::ContextFst (this=0x7fffffffd100, subsequential_symbol=97, >>> phones=..., disambig_syms=..., N=1, P=0) at >>> ../fstext/context-fst.h:223 >>> #5 0x0000000000455b95 in fst::ComposeContext (disambig_syms_in=..., >>> N=1, P=0, ifst=0x6c5be0, ofst=0x7fffffffd390, ilabels_out=0x7fffffffd3a0) >>> at ../fstext/context-fst-inl.h:522 >>> #6 0x00000000004522a3 in main (argc=7, argv=0x7fffffffdaa8) at >>> fstcomposecontext.cc:138 >>> (gdb) up >>> #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 >>> 92 abort.c: No such file or directory. >>> (gdb) up >>> #2 0x00007ffff6be2621 in *__GI___assert_fail ( >>> assertion=0x498448 "subsequential_symbol != 0 && >>> disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0", >>> file=<optimized out>, line=105, >>> function=0x499700 "fst::ContextFstImpl<Arc, >>> LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, >>> std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, >>> int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 >>> 81 assert.c: No such file or directory. >>> (gdb) p subsequential_symbol >>> No symbol "subsequential_symbol" in current context. >>> (gdb) up >>> #3 0x000000000045b419 in >>> fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >>> int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, >>> phone_syms=..., disambig_syms=..., N=1, P=0) at >>> ../fstext/context-fst-inl.h:103 >>> 103 assert(subsequential_symbol != 0 >>> (gdb) p subsequential_symbol >>> $1 = 97 >>> (gdb) p disambig_syms_.count(subsequential_symbol) >>> $2 = 0 >>> (gdb) p phone_syms_.count(subsequential_symbol) >>> $3 = 1 >>> (gdb) p phone_syms_.size() >>> $4 = 78 >>> (gdb) p disambig_syms_.size() >>> $5 = 0 >>> >>> >>> Thanks >>> >>> X. >>> >>> >>> >>> On Sat, Dec 28, 2013 at 9:01 PM, Daniel Povey <dp...@gm...> wrote: >>> >>>> The same error should not have happened. Can you please do the same >>>> steps in gdb as last time, and paste the screen from gdb? >>>> Dan >>>> >>>> >>>> >>>> On Sat, Dec 28, 2013 at 11:49 AM, Xavier Anguera <xan...@gm... >wrote: >>>> >>>>> Dan, >>>>> the same error occurred, just that now I got the extra Warning you >>>>> inserted. >>>>> Should I maybe modify the make_phone_bigram_lang.sh script to copy >>>>> the current disambig.* files into the new lang directory? >>>>> >>>>> Thanks, >>>>> >>>>> X. >>>>> >>>>> >>>>> >>>>> On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...>wrote: >>>>> >>>>>> OK, then try running the script with the code fix I checked in. I >>>>>> forgot about the existence of that script. Possibly it will work. I'll >>>>>> have to modify validate_lang.pl in that case. >>>>>> Dan >>>>>> >>>>>> >>>>>> >>>>>> On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm... >wrote: >>>>>> >>>>>>> Dan, >>>>>>> there must be something I do not do correctly in my current setup, >>>>>>> or you did not understand where my problem is. >>>>>>> I am currently calling the script mkgraph.sh (that is crashing) in >>>>>>> the following context: >>>>>>> >>>>>>> # Create phone-bigram grammar (unsmoothed) estimated from >>>>>>> alignments >>>>>>> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >>>>>>> data/lang_test_phn-mono || exit 1; >>>>>>> # Create phone recognition graph >>>>>>> $train_cmd exp/mono/graph/mkgraph_phn.log \ >>>>>>> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>> exp/mono/graph_phn || exit 1 >>>>>>> >>>>>>> As you can see, first the script make_phone_bigram_lang.sh is >>>>>>> called, which takes as an input a lang directory and creates a "test" lang >>>>>>> directory. Looking into this script I see that the disambig.* files are >>>>>>> left empty in purpose in the new directory (they are not empty in the >>>>>>> original lang directory, in fact, they have the #0 #1 values you proposed >>>>>>> in the previous email). >>>>>>> Then, when calling the mkgraph.sh script with this test_lang >>>>>>> directory it complaints as stated in my previous emails. >>>>>>> The question is then whether I should modify >>>>>>> make_phone_bigram_lang.sh to copy the original disambig.* files or should I >>>>>>> pass the original lang directory to the mkgraph.sh script, or am I doing >>>>>>> something else very wrong? >>>>>>> >>>>>>> Thanks for your help. >>>>>>> >>>>>>> Xavier Anguera >>>>>>> >>>>>>> >>>>>>> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm... >wrote: >>>>>>> >>>>>>>> OK, I just committed a fix because it should not have crashed at >>>>>>>> that particular point in the code, but the underlying error is with your >>>>>>>> lang directory. You do need to have the disambiguation symbols >>>>>>>> "disambig.txt", with at least #0 and #1. You should probably be creating >>>>>>>> the lang directory with the prepare_lang.sh script, and if not, at least >>>>>>>> you should validate it with the validate_lang.pl script. Also, >>>>>>>> there is no reason to have a separate "lang" directory for the monophone >>>>>>>> setup, the same directory is valid for monophone or triphone setups. >>>>>>>> >>>>>>>> Dan >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm... >>>>>>>> > wrote: >>>>>>>> >>>>>>>>> Dear Dan, >>>>>>>>> thank you for your help. >>>>>>>>> Next are the tests you asked me to perform: >>>>>>>>> >>>>>>>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>>>>>>> data/lang_test_phn-mono//phones/roots.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>>>>>>> data/lang_test_phn-mono//phones/sets.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, >>>>>>>>> int} ... >>>>>>>>> --> 9 entry/entries in >>>>>>>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.intcorresponds to data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are >>>>>>>>> OK >>>>>>>>> >>>>>>>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>>>>>>> --> silence.txt and nonsilence.txt are disjoint >>>>>>>>> --> silence.txt and disambig.txt are disjoint >>>>>>>>> --> disambig.txt and nonsilence.txt are disjoint >>>>>>>>> --> disjoint property is OK >>>>>>>>> >>>>>>>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty >>>>>>>>> or not exists >>>>>>>>> >>>>>>>>> Checking optional_silence.txt ... >>>>>>>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>>>>>>> >>>>>>>>> Checking disambiguation symbols: #0 and #1 >>>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty >>>>>>>>> or not exists >>>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't >>>>>>>>> have "#0" or "#1" >>>>>>>>> Checking topo ... >>>>>>>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>>>>>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>>>>>>> --> data/lang_test_phn-mono//topo is OK >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>>>>>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>>>>>>> >>>>>>>>> --> ERROR >>>>>>>>> >>>>>>>>> Apparently I do not have either oov.txt nore disambig.txt >>>>>>>>> Probably the test data I am using does not have any OOV in it. I >>>>>>>>> can add it artificially, but I guess this is not the main problem here... >>>>>>>>> regarding the disambig.txt file, what should it contain? >>>>>>>>> >>>>>>>>> I did run gdb as you indicated (thank you for such detailed info) >>>>>>>>> and gives me: >>>>>>>>> (gdb) p subsequential_symbol >>>>>>>>> $1 = 97 >>>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>>> $2 = 0 >>>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>>> $3 = 1 >>>>>>>>> (gdb) p phone_syms_.size() >>>>>>>>> $4 = 78 >>>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>>> $5 = 0 >>>>>>>>> >>>>>>>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>>>>>>> disambig.int is also empty. >>>>>>>>> >>>>>>>>> Thanks again for your help! >>>>>>>>> >>>>>>>>> yours, >>>>>>>>> >>>>>>>>> Xavier Anguera >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm... >wrote: >>>>>>>>> >>>>>>>>>> Could you please do the following. [apologies if you already know >>>>>>>>>> gdb] >>>>>>>>>> >>>>>>>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>>>>>>> and let me know if it fails. >>>>>>>>>> If it doesn't fail, do: >>>>>>>>>> >>>>>>>>>> gdb --args fstcomposecontext --context-size=1 >>>>>>>>>> --central-position=0 --read-disambig-syms=data/ >>>>>>>>>> lang_test_phn-mono/phones/disambig.int >>>>>>>>>> --write-disambig-syms=data/lang_test_ >>>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>>> >>>>>>>>>> (gdb) r >>>>>>>>>> # wait till it crashes >>>>>>>>>> # go up the stack by typing "up" until you get to the right >>>>>>>>>> frame; type "down" if you go too far >>>>>>>>>> >>>>>>>>>> (gdb) p subsequential_symbol >>>>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>>>> (gdb) p phone_syms_.size() >>>>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>>>> (gdb) quit >>>>>>>>>> >>>>>>>>>> [I hope this works; sometimes it will fail because functions are >>>>>>>>>> inlined]. >>>>>>>>>> Anyway, send the output, and also >>>>>>>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>>>>>>> and show me that output too. >>>>>>>>>> >>>>>>>>>> Dan >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera < >>>>>>>>>> xan...@gm...> wrote: >>>>>>>>>> >>>>>>>>>>> Dear all, >>>>>>>>>>> I am encounering a problem when training mono-state NN using a >>>>>>>>>>> recipe adapted from the SWBD S5 recipe. I am able to train, decode and >>>>>>>>>>> phone-align a GMM system, but when I use these results to train the NN I >>>>>>>>>>> get the following error (see below). I have used this recipe in the past to >>>>>>>>>>> successfully train one ASR system and now the only difference is that I am >>>>>>>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>>>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>>>>>>> Any help is appreciated. >>>>>>>>>>> >>>>>>>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn. log: >>>>>>>>>>> >>>>>>>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>>>>>> exp/mono/graph_phn >>>>>>>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>>>>>>> # >>>>>>>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>>>>>>> data/lang_test_phn-mono/G.fst >>>>>>>>>>> fstdeterminizestar --use-log=true >>>>>>>>>>> fstminimizeencoded >>>>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>>>> 0.000358155 -0.000356635 >>>>>>>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/ disambig.int--write-disambig-syms=data/lang_test_ >>>>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn- mono/tmp/ilabels_1_0 >>>>>>>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>>>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>>>>>>> std::vector<B, std::allocator< >>>>>>>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) >>>>>>>>>>> [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: >>>>>>>>>>> Assertion `subsequenti >>>>>>>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == >>>>>>>>>>> 0 && phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>>>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>>>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>>>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>>>>>>> int >>>>>>>>>>> --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>>>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>>>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>>> ERROR >>>>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/ CLG_1_0.fst >>>>>>>>>>> ERROR >>>>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/ CLG_1_0.fst >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> ------------------------------------------------------------ ------------------ >>>>>>>>>>> Rapidly troubleshoot problems before they affect your business. >>>>>>>>>>> Most IT >>>>>>>>>>> organizations don't have a clear picture of how application >>>>>>>>>>> performance >>>>>>>>>>> affects their revenue. With AppDynamics, you get 100% visibility >>>>>>>>>>> into your >>>>>>>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>>>>>>> AppDynamics Pro! >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> _______________________________________________ >>>>>>>>>>> Kaldi-developers mailing list >>>>>>>>>>> Kal...@li... >>>>>>>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Daniel P. <dp...@gm...> - 2013-12-29 19:58:44
|
You should redo "make" and, if that fails, do "make depend" and then "make", and if that fails, "make clean" and "make". Your line numbers show that your source is out of date. Dan On Sun, Dec 29, 2013 at 10:31 AM, Xavier Anguera <xan...@gm...> wrote: > Hi, > I doublechecked that the svn was done correctly (if it was not before, it > is indeed now) and everything looks the same, with the same problem as I > reported above. > Comparing the error I get with another (successful) run on another > database I am suspicious of the LG.fst FST file. Is there a way to check it? > > Thanks, > > X. > > > > > On Sat, Dec 28, 2013 at 9:37 PM, Daniel Povey <dp...@gm...> wrote: > >> Could it be that you did "svn up" only in the fstbin directory and not in >> src? Do "svn up" in src/, and see if you get further updates. >> Dan >> >> >> >> On Sat, Dec 28, 2013 at 12:35 PM, Xavier Anguera <xan...@gm...>wrote: >> >>> Sure, here it is: >>> >>> The error: >>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>> exp/mono/graph_phn >>> # Started at Sat Dec 28 20:47:57 CET 2013 >>> # >>> fstminimizeencoded >>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>> data/lang_test_phn-mono/G.fst >>> fstdeterminizestar --use-log=true >>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>> 0.000358155 -0.000356635 >>> fstcomposecontext --context-size=1 --central-position=0 >>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>> WARNING (fstcomposecontext:main():fstcomposecontext.cc:130) >>> Disambiguation symbols list is empty; this likely indicates an error in >>> data preparation. >>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>> std::vector<B, std::allocator< >>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with >>> Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >>> `subsequenti >>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0' failed. >>> utils/mkgraph.sh: line 76: 7661 Aborted >>> fstcomposecontext --context-size=$N --central-position=$P >>> --read-disambig-syms=$lang/phones/disambig. >>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR: FstHeader::Read: Bad FST header: >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>> Reading FST: error reading FST header from >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>> Reading FST: error reading FST header from >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> >>> The execution of gdb: >>> (gdb) where >>> #0 0x00007ffff6be9475 in *__GI_raise (sig=<optimized out>) at >>> ../nptl/sysdeps/unix/sysv/linux/raise.c:64 >>> #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 >>> #2 0x00007ffff6be2621 in *__GI___assert_fail ( >>> assertion=0x498448 "subsequential_symbol != 0 && >>> disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0", >>> file=<optimized out>, line=105, >>> function=0x499700 "fst::ContextFstImpl<Arc, >>> LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, >>> std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, >>> int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 >>> #3 0x000000000045b419 in >>> fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >>> int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, >>> phone_syms=..., disambig_syms=..., N=1, P=0) at >>> ../fstext/context-fst-inl.h:103 >>> #4 0x0000000000457610 in >>> fst::ContextFst<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >>> int>::ContextFst (this=0x7fffffffd100, subsequential_symbol=97, >>> phones=..., disambig_syms=..., N=1, P=0) at >>> ../fstext/context-fst.h:223 >>> #5 0x0000000000455b95 in fst::ComposeContext (disambig_syms_in=..., >>> N=1, P=0, ifst=0x6c5be0, ofst=0x7fffffffd390, ilabels_out=0x7fffffffd3a0) >>> at ../fstext/context-fst-inl.h:522 >>> #6 0x00000000004522a3 in main (argc=7, argv=0x7fffffffdaa8) at >>> fstcomposecontext.cc:138 >>> (gdb) up >>> #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 >>> 92 abort.c: No such file or directory. >>> (gdb) up >>> #2 0x00007ffff6be2621 in *__GI___assert_fail ( >>> assertion=0x498448 "subsequential_symbol != 0 && >>> disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0", >>> file=<optimized out>, line=105, >>> function=0x499700 "fst::ContextFstImpl<Arc, >>> LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, >>> std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, >>> int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 >>> 81 assert.c: No such file or directory. >>> (gdb) p subsequential_symbol >>> No symbol "subsequential_symbol" in current context. >>> (gdb) up >>> #3 0x000000000045b419 in >>> fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >>> int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, >>> phone_syms=..., disambig_syms=..., N=1, P=0) at >>> ../fstext/context-fst-inl.h:103 >>> 103 assert(subsequential_symbol != 0 >>> (gdb) p subsequential_symbol >>> $1 = 97 >>> (gdb) p disambig_syms_.count(subsequential_symbol) >>> $2 = 0 >>> (gdb) p phone_syms_.count(subsequential_symbol) >>> $3 = 1 >>> (gdb) p phone_syms_.size() >>> $4 = 78 >>> (gdb) p disambig_syms_.size() >>> $5 = 0 >>> >>> >>> Thanks >>> >>> X. >>> >>> >>> >>> On Sat, Dec 28, 2013 at 9:01 PM, Daniel Povey <dp...@gm...> wrote: >>> >>>> The same error should not have happened. Can you please do the same >>>> steps in gdb as last time, and paste the screen from gdb? >>>> Dan >>>> >>>> >>>> >>>> On Sat, Dec 28, 2013 at 11:49 AM, Xavier Anguera <xan...@gm...>wrote: >>>> >>>>> Dan, >>>>> the same error occurred, just that now I got the extra Warning you >>>>> inserted. >>>>> Should I maybe modify the make_phone_bigram_lang.sh script to copy >>>>> the current disambig.* files into the new lang directory? >>>>> >>>>> Thanks, >>>>> >>>>> X. >>>>> >>>>> >>>>> >>>>> On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...>wrote: >>>>> >>>>>> OK, then try running the script with the code fix I checked in. I >>>>>> forgot about the existence of that script. Possibly it will work. I'll >>>>>> have to modify validate_lang.pl in that case. >>>>>> Dan >>>>>> >>>>>> >>>>>> >>>>>> On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...>wrote: >>>>>> >>>>>>> Dan, >>>>>>> there must be something I do not do correctly in my current setup, >>>>>>> or you did not understand where my problem is. >>>>>>> I am currently calling the script mkgraph.sh (that is crashing) in >>>>>>> the following context: >>>>>>> >>>>>>> # Create phone-bigram grammar (unsmoothed) estimated from >>>>>>> alignments >>>>>>> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >>>>>>> data/lang_test_phn-mono || exit 1; >>>>>>> # Create phone recognition graph >>>>>>> $train_cmd exp/mono/graph/mkgraph_phn.log \ >>>>>>> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>> exp/mono/graph_phn || exit 1 >>>>>>> >>>>>>> As you can see, first the script make_phone_bigram_lang.sh is >>>>>>> called, which takes as an input a lang directory and creates a "test" lang >>>>>>> directory. Looking into this script I see that the disambig.* files are >>>>>>> left empty in purpose in the new directory (they are not empty in the >>>>>>> original lang directory, in fact, they have the #0 #1 values you proposed >>>>>>> in the previous email). >>>>>>> Then, when calling the mkgraph.sh script with this test_lang >>>>>>> directory it complaints as stated in my previous emails. >>>>>>> The question is then whether I should modify >>>>>>> make_phone_bigram_lang.sh to copy the original disambig.* files or should I >>>>>>> pass the original lang directory to the mkgraph.sh script, or am I doing >>>>>>> something else very wrong? >>>>>>> >>>>>>> Thanks for your help. >>>>>>> >>>>>>> Xavier Anguera >>>>>>> >>>>>>> >>>>>>> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...>wrote: >>>>>>> >>>>>>>> OK, I just committed a fix because it should not have crashed at >>>>>>>> that particular point in the code, but the underlying error is with your >>>>>>>> lang directory. You do need to have the disambiguation symbols >>>>>>>> "disambig.txt", with at least #0 and #1. You should probably be creating >>>>>>>> the lang directory with the prepare_lang.sh script, and if not, at least >>>>>>>> you should validate it with the validate_lang.pl script. Also, >>>>>>>> there is no reason to have a separate "lang" directory for the monophone >>>>>>>> setup, the same directory is valid for monophone or triphone setups. >>>>>>>> >>>>>>>> Dan >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm... >>>>>>>> > wrote: >>>>>>>> >>>>>>>>> Dear Dan, >>>>>>>>> thank you for your help. >>>>>>>>> Next are the tests you asked me to perform: >>>>>>>>> >>>>>>>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>>>>>>> data/lang_test_phn-mono//phones/roots.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>>>>>>> data/lang_test_phn-mono//phones/sets.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, >>>>>>>>> int} ... >>>>>>>>> --> 9 entry/entries in >>>>>>>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.intcorresponds to data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are >>>>>>>>> OK >>>>>>>>> >>>>>>>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>>>>>>> --> silence.txt and nonsilence.txt are disjoint >>>>>>>>> --> silence.txt and disambig.txt are disjoint >>>>>>>>> --> disambig.txt and nonsilence.txt are disjoint >>>>>>>>> --> disjoint property is OK >>>>>>>>> >>>>>>>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty >>>>>>>>> or not exists >>>>>>>>> >>>>>>>>> Checking optional_silence.txt ... >>>>>>>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>>>>>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>>>>>>> >>>>>>>>> Checking disambiguation symbols: #0 and #1 >>>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty >>>>>>>>> or not exists >>>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't >>>>>>>>> have "#0" or "#1" >>>>>>>>> Checking topo ... >>>>>>>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>>>>>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>>>>>>> --> data/lang_test_phn-mono//topo is OK >>>>>>>>> >>>>>>>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>>>>>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>>>>>>> >>>>>>>>> --> ERROR >>>>>>>>> >>>>>>>>> Apparently I do not have either oov.txt nore disambig.txt >>>>>>>>> Probably the test data I am using does not have any OOV in it. I >>>>>>>>> can add it artificially, but I guess this is not the main problem here... >>>>>>>>> regarding the disambig.txt file, what should it contain? >>>>>>>>> >>>>>>>>> I did run gdb as you indicated (thank you for such detailed info) >>>>>>>>> and gives me: >>>>>>>>> (gdb) p subsequential_symbol >>>>>>>>> $1 = 97 >>>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>>> $2 = 0 >>>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>>> $3 = 1 >>>>>>>>> (gdb) p phone_syms_.size() >>>>>>>>> $4 = 78 >>>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>>> $5 = 0 >>>>>>>>> >>>>>>>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>>>>>>> disambig.int is also empty. >>>>>>>>> >>>>>>>>> Thanks again for your help! >>>>>>>>> >>>>>>>>> yours, >>>>>>>>> >>>>>>>>> Xavier Anguera >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...>wrote: >>>>>>>>> >>>>>>>>>> Could you please do the following. [apologies if you already know >>>>>>>>>> gdb] >>>>>>>>>> >>>>>>>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>>>>>>> and let me know if it fails. >>>>>>>>>> If it doesn't fail, do: >>>>>>>>>> >>>>>>>>>> gdb --args fstcomposecontext --context-size=1 >>>>>>>>>> --central-position=0 --read-disambig-syms=data/ >>>>>>>>>> lang_test_phn-mono/phones/disambig.int >>>>>>>>>> --write-disambig-syms=data/lang_test_ >>>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>>> >>>>>>>>>> (gdb) r >>>>>>>>>> # wait till it crashes >>>>>>>>>> # go up the stack by typing "up" until you get to the right >>>>>>>>>> frame; type "down" if you go too far >>>>>>>>>> >>>>>>>>>> (gdb) p subsequential_symbol >>>>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>>>> (gdb) p phone_syms_.size() >>>>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>>>> (gdb) quit >>>>>>>>>> >>>>>>>>>> [I hope this works; sometimes it will fail because functions are >>>>>>>>>> inlined]. >>>>>>>>>> Anyway, send the output, and also >>>>>>>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>>>>>>> and show me that output too. >>>>>>>>>> >>>>>>>>>> Dan >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera < >>>>>>>>>> xan...@gm...> wrote: >>>>>>>>>> >>>>>>>>>>> Dear all, >>>>>>>>>>> I am encounering a problem when training mono-state NN using a >>>>>>>>>>> recipe adapted from the SWBD S5 recipe. I am able to train, decode and >>>>>>>>>>> phone-align a GMM system, but when I use these results to train the NN I >>>>>>>>>>> get the following error (see below). I have used this recipe in the past to >>>>>>>>>>> successfully train one ASR system and now the only difference is that I am >>>>>>>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>>>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>>>>>>> Any help is appreciated. >>>>>>>>>>> >>>>>>>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>>>>>>>> >>>>>>>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>>>>>> exp/mono/graph_phn >>>>>>>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>>>>>>> # >>>>>>>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>>>>>>> data/lang_test_phn-mono/G.fst >>>>>>>>>>> fstdeterminizestar --use-log=true >>>>>>>>>>> fstminimizeencoded >>>>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>>>> 0.000358155 -0.000356635 >>>>>>>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>>>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>>>>>>> std::vector<B, std::allocator< >>>>>>>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) >>>>>>>>>>> [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: >>>>>>>>>>> Assertion `subsequenti >>>>>>>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == >>>>>>>>>>> 0 && phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>>>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>>>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>>>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>>>>>>> int >>>>>>>>>>> --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>>>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>>>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>>> ERROR >>>>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>>> ERROR >>>>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> ------------------------------------------------------------------------------ >>>>>>>>>>> Rapidly troubleshoot problems before they affect your business. >>>>>>>>>>> Most IT >>>>>>>>>>> organizations don't have a clear picture of how application >>>>>>>>>>> performance >>>>>>>>>>> affects their revenue. With AppDynamics, you get 100% visibility >>>>>>>>>>> into your >>>>>>>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>>>>>>> AppDynamics Pro! >>>>>>>>>>> >>>>>>>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>>>>>>>> _______________________________________________ >>>>>>>>>>> Kaldi-developers mailing list >>>>>>>>>>> Kal...@li... >>>>>>>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Xavier A. <xan...@gm...> - 2013-12-29 18:31:46
|
Hi, I doublechecked that the svn was done correctly (if it was not before, it is indeed now) and everything looks the same, with the same problem as I reported above. Comparing the error I get with another (successful) run on another database I am suspicious of the LG.fst FST file. Is there a way to check it? Thanks, X. On Sat, Dec 28, 2013 at 9:37 PM, Daniel Povey <dp...@gm...> wrote: > Could it be that you did "svn up" only in the fstbin directory and not in > src? Do "svn up" in src/, and see if you get further updates. > Dan > > > > On Sat, Dec 28, 2013 at 12:35 PM, Xavier Anguera <xan...@gm...>wrote: > >> Sure, here it is: >> >> The error: >> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >> exp/mono/graph_phn >> # Started at Sat Dec 28 20:47:57 CET 2013 >> # >> fstminimizeencoded >> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >> data/lang_test_phn-mono/G.fst >> fstdeterminizestar --use-log=true >> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >> 0.000358155 -0.000356635 >> fstcomposecontext --context-size=1 --central-position=0 >> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >> WARNING (fstcomposecontext:main():fstcomposecontext.cc:130) >> Disambiguation symbols list is empty; this likely indicates an error in >> data preparation. >> fstcomposecontext: ../fstext/context-fst-inl.h:105: >> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >> std::vector<B, std::allocator< >> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc >> = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >> `subsequenti >> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >> phone_syms_.count(subsequential_symbol) == 0' failed. >> utils/mkgraph.sh: line 76: 7661 Aborted >> fstcomposecontext --context-size=$N --central-position=$P >> --read-disambig-syms=$lang/phones/disambig. >> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >> ERROR: FstHeader::Read: Bad FST header: >> data/lang_test_phn-mono/tmp/CLG_1_0.fst >> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >> Reading FST: error reading FST header from >> data/lang_test_phn-mono/tmp/CLG_1_0.fst >> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >> Reading FST: error reading FST header from >> data/lang_test_phn-mono/tmp/CLG_1_0.fst >> >> The execution of gdb: >> (gdb) where >> #0 0x00007ffff6be9475 in *__GI_raise (sig=<optimized out>) at >> ../nptl/sysdeps/unix/sysv/linux/raise.c:64 >> #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 >> #2 0x00007ffff6be2621 in *__GI___assert_fail ( >> assertion=0x498448 "subsequential_symbol != 0 && >> disambig_syms_.count(subsequential_symbol) == 0 && >> phone_syms_.count(subsequential_symbol) == 0", >> file=<optimized out>, line=105, >> function=0x499700 "fst::ContextFstImpl<Arc, >> LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, >> std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, >> int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 >> #3 0x000000000045b419 in >> fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >> int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, >> phone_syms=..., disambig_syms=..., N=1, P=0) at >> ../fstext/context-fst-inl.h:103 >> #4 0x0000000000457610 in >> fst::ContextFst<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >> int>::ContextFst (this=0x7fffffffd100, subsequential_symbol=97, >> phones=..., disambig_syms=..., N=1, P=0) at >> ../fstext/context-fst.h:223 >> #5 0x0000000000455b95 in fst::ComposeContext (disambig_syms_in=..., N=1, >> P=0, ifst=0x6c5be0, ofst=0x7fffffffd390, ilabels_out=0x7fffffffd3a0) >> at ../fstext/context-fst-inl.h:522 >> #6 0x00000000004522a3 in main (argc=7, argv=0x7fffffffdaa8) at >> fstcomposecontext.cc:138 >> (gdb) up >> #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 >> 92 abort.c: No such file or directory. >> (gdb) up >> #2 0x00007ffff6be2621 in *__GI___assert_fail ( >> assertion=0x498448 "subsequential_symbol != 0 && >> disambig_syms_.count(subsequential_symbol) == 0 && >> phone_syms_.count(subsequential_symbol) == 0", >> file=<optimized out>, line=105, >> function=0x499700 "fst::ContextFstImpl<Arc, >> LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, >> std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, >> int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 >> 81 assert.c: No such file or directory. >> (gdb) p subsequential_symbol >> No symbol "subsequential_symbol" in current context. >> (gdb) up >> #3 0x000000000045b419 in >> fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, >> int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, >> phone_syms=..., disambig_syms=..., N=1, P=0) at >> ../fstext/context-fst-inl.h:103 >> 103 assert(subsequential_symbol != 0 >> (gdb) p subsequential_symbol >> $1 = 97 >> (gdb) p disambig_syms_.count(subsequential_symbol) >> $2 = 0 >> (gdb) p phone_syms_.count(subsequential_symbol) >> $3 = 1 >> (gdb) p phone_syms_.size() >> $4 = 78 >> (gdb) p disambig_syms_.size() >> $5 = 0 >> >> >> Thanks >> >> X. >> >> >> >> On Sat, Dec 28, 2013 at 9:01 PM, Daniel Povey <dp...@gm...> wrote: >> >>> The same error should not have happened. Can you please do the same >>> steps in gdb as last time, and paste the screen from gdb? >>> Dan >>> >>> >>> >>> On Sat, Dec 28, 2013 at 11:49 AM, Xavier Anguera <xan...@gm...>wrote: >>> >>>> Dan, >>>> the same error occurred, just that now I got the extra Warning you >>>> inserted. >>>> Should I maybe modify the make_phone_bigram_lang.sh script to copy the >>>> current disambig.* files into the new lang directory? >>>> >>>> Thanks, >>>> >>>> X. >>>> >>>> >>>> >>>> On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...> wrote: >>>> >>>>> OK, then try running the script with the code fix I checked in. I >>>>> forgot about the existence of that script. Possibly it will work. I'll >>>>> have to modify validate_lang.pl in that case. >>>>> Dan >>>>> >>>>> >>>>> >>>>> On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...>wrote: >>>>> >>>>>> Dan, >>>>>> there must be something I do not do correctly in my current setup, or >>>>>> you did not understand where my problem is. >>>>>> I am currently calling the script mkgraph.sh (that is crashing) in >>>>>> the following context: >>>>>> >>>>>> # Create phone-bigram grammar (unsmoothed) estimated from >>>>>> alignments >>>>>> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >>>>>> data/lang_test_phn-mono || exit 1; >>>>>> # Create phone recognition graph >>>>>> $train_cmd exp/mono/graph/mkgraph_phn.log \ >>>>>> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>> exp/mono/graph_phn || exit 1 >>>>>> >>>>>> As you can see, first the script make_phone_bigram_lang.sh is called, >>>>>> which takes as an input a lang directory and creates a "test" lang >>>>>> directory. Looking into this script I see that the disambig.* files are >>>>>> left empty in purpose in the new directory (they are not empty in the >>>>>> original lang directory, in fact, they have the #0 #1 values you proposed >>>>>> in the previous email). >>>>>> Then, when calling the mkgraph.sh script with this test_lang >>>>>> directory it complaints as stated in my previous emails. >>>>>> The question is then whether I should modify >>>>>> make_phone_bigram_lang.sh to copy the original disambig.* files or should I >>>>>> pass the original lang directory to the mkgraph.sh script, or am I doing >>>>>> something else very wrong? >>>>>> >>>>>> Thanks for your help. >>>>>> >>>>>> Xavier Anguera >>>>>> >>>>>> >>>>>> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...>wrote: >>>>>> >>>>>>> OK, I just committed a fix because it should not have crashed at >>>>>>> that particular point in the code, but the underlying error is with your >>>>>>> lang directory. You do need to have the disambiguation symbols >>>>>>> "disambig.txt", with at least #0 and #1. You should probably be creating >>>>>>> the lang directory with the prepare_lang.sh script, and if not, at least >>>>>>> you should validate it with the validate_lang.pl script. Also, >>>>>>> there is no reason to have a separate "lang" directory for the monophone >>>>>>> setup, the same directory is valid for monophone or triphone setups. >>>>>>> >>>>>>> Dan >>>>>>> >>>>>>> >>>>>>> >>>>>>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: >>>>>>> >>>>>>>> Dear Dan, >>>>>>>> thank you for your help. >>>>>>>> Next are the tests you asked me to perform: >>>>>>>> >>>>>>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>>>>>> >>>>>>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>>>>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>>>>>> data/lang_test_phn-mono//phones/roots.txt >>>>>>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>>>>>> >>>>>>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>>>>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>>>>>> data/lang_test_phn-mono//phones/sets.txt >>>>>>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>>>>>> >>>>>>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} >>>>>>>> ... >>>>>>>> --> 9 entry/entries in >>>>>>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.intcorresponds to data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are >>>>>>>> OK >>>>>>>> >>>>>>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>>>>>> --> silence.txt and nonsilence.txt are disjoint >>>>>>>> --> silence.txt and disambig.txt are disjoint >>>>>>>> --> disambig.txt and nonsilence.txt are disjoint >>>>>>>> --> disjoint property is OK >>>>>>>> >>>>>>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>>>>> not exists >>>>>>>> >>>>>>>> Checking optional_silence.txt ... >>>>>>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>>>>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>>>>>> >>>>>>>> Checking disambiguation symbols: #0 and #1 >>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>>>>> not exists >>>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't >>>>>>>> have "#0" or "#1" >>>>>>>> Checking topo ... >>>>>>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>>>>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>>>>>> --> data/lang_test_phn-mono//topo is OK >>>>>>>> >>>>>>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>>>>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>>>>>> >>>>>>>> --> ERROR >>>>>>>> >>>>>>>> Apparently I do not have either oov.txt nore disambig.txt >>>>>>>> Probably the test data I am using does not have any OOV in it. I >>>>>>>> can add it artificially, but I guess this is not the main problem here... >>>>>>>> regarding the disambig.txt file, what should it contain? >>>>>>>> >>>>>>>> I did run gdb as you indicated (thank you for such detailed info) >>>>>>>> and gives me: >>>>>>>> (gdb) p subsequential_symbol >>>>>>>> $1 = 97 >>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>> $2 = 0 >>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>> $3 = 1 >>>>>>>> (gdb) p phone_syms_.size() >>>>>>>> $4 = 78 >>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>> $5 = 0 >>>>>>>> >>>>>>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>>>>>> disambig.int is also empty. >>>>>>>> >>>>>>>> Thanks again for your help! >>>>>>>> >>>>>>>> yours, >>>>>>>> >>>>>>>> Xavier Anguera >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...>wrote: >>>>>>>> >>>>>>>>> Could you please do the following. [apologies if you already know >>>>>>>>> gdb] >>>>>>>>> >>>>>>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>>>>>> and let me know if it fails. >>>>>>>>> If it doesn't fail, do: >>>>>>>>> >>>>>>>>> gdb --args fstcomposecontext --context-size=1 >>>>>>>>> --central-position=0 --read-disambig-syms=data/ >>>>>>>>> lang_test_phn-mono/phones/disambig.int --write-disambig-syms=data/ >>>>>>>>> lang_test_ >>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>> >>>>>>>>> (gdb) r >>>>>>>>> # wait till it crashes >>>>>>>>> # go up the stack by typing "up" until you get to the right frame; >>>>>>>>> type "down" if you go too far >>>>>>>>> >>>>>>>>> (gdb) p subsequential_symbol >>>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>>> (gdb) p phone_syms_.size() >>>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>>> (gdb) quit >>>>>>>>> >>>>>>>>> [I hope this works; sometimes it will fail because functions are >>>>>>>>> inlined]. >>>>>>>>> Anyway, send the output, and also >>>>>>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>>>>>> and show me that output too. >>>>>>>>> >>>>>>>>> Dan >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera < >>>>>>>>> xan...@gm...> wrote: >>>>>>>>> >>>>>>>>>> Dear all, >>>>>>>>>> I am encounering a problem when training mono-state NN using a >>>>>>>>>> recipe adapted from the SWBD S5 recipe. I am able to train, decode and >>>>>>>>>> phone-align a GMM system, but when I use these results to train the NN I >>>>>>>>>> get the following error (see below). I have used this recipe in the past to >>>>>>>>>> successfully train one ASR system and now the only difference is that I am >>>>>>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>>>>>> Any help is appreciated. >>>>>>>>>> >>>>>>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>>>>>>> >>>>>>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>>>>> exp/mono/graph_phn >>>>>>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>>>>>> # >>>>>>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>>>>>> data/lang_test_phn-mono/G.fst >>>>>>>>>> fstdeterminizestar --use-log=true >>>>>>>>>> fstminimizeencoded >>>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>>> 0.000358155 -0.000356635 >>>>>>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>>>>>> std::vector<B, std::allocator< >>>>>>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) >>>>>>>>>> [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: >>>>>>>>>> Assertion `subsequenti >>>>>>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 >>>>>>>>>> && phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>>>>>> int >>>>>>>>>> --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>> ERROR >>>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>> ERROR >>>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> ------------------------------------------------------------------------------ >>>>>>>>>> Rapidly troubleshoot problems before they affect your business. >>>>>>>>>> Most IT >>>>>>>>>> organizations don't have a clear picture of how application >>>>>>>>>> performance >>>>>>>>>> affects their revenue. With AppDynamics, you get 100% visibility >>>>>>>>>> into your >>>>>>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>>>>>> AppDynamics Pro! >>>>>>>>>> >>>>>>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>>>>>>> _______________________________________________ >>>>>>>>>> Kaldi-developers mailing list >>>>>>>>>> Kal...@li... >>>>>>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Daniel P. <dp...@gm...> - 2013-12-28 20:37:34
|
Could it be that you did "svn up" only in the fstbin directory and not in src? Do "svn up" in src/, and see if you get further updates. Dan On Sat, Dec 28, 2013 at 12:35 PM, Xavier Anguera <xan...@gm...> wrote: > Sure, here it is: > > The error: > # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono > exp/mono/graph_phn > # Started at Sat Dec 28 20:47:57 CET 2013 > # > fstminimizeencoded > fsttablecompose data/lang_test_phn-mono/L_disambig.fst > data/lang_test_phn-mono/G.fst > fstdeterminizestar --use-log=true > fstisstochastic data/lang_test_phn-mono/tmp/LG.fst > 0.000358155 -0.000356635 > fstcomposecontext --context-size=1 --central-position=0 > --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ > phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 > WARNING (fstcomposecontext:main():fstcomposecontext.cc:130) Disambiguation > symbols list is empty; this likely indicates an error in data preparation. > fstcomposecontext: ../fstext/context-fst-inl.h:105: > fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const > std::vector<B, std::allocator< > _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc > = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion > `subsequenti > al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && > phone_syms_.count(subsequential_symbol) == 0' failed. > utils/mkgraph.sh: line 76: 7661 Aborted fstcomposecontext > --context-size=$N --central-position=$P > --read-disambig-syms=$lang/phones/disambig. > int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int > $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg > fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst > ERROR: FstHeader::Read: Bad FST header: > data/lang_test_phn-mono/tmp/CLG_1_0.fst > ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) > Reading FST: error reading FST header from > data/lang_test_phn-mono/tmp/CLG_1_0.fst > ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) > Reading FST: error reading FST header from > data/lang_test_phn-mono/tmp/CLG_1_0.fst > > The execution of gdb: > (gdb) where > #0 0x00007ffff6be9475 in *__GI_raise (sig=<optimized out>) at > ../nptl/sysdeps/unix/sysv/linux/raise.c:64 > #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 > #2 0x00007ffff6be2621 in *__GI___assert_fail ( > assertion=0x498448 "subsequential_symbol != 0 && > disambig_syms_.count(subsequential_symbol) == 0 && > phone_syms_.count(subsequential_symbol) == 0", > file=<optimized out>, line=105, > function=0x499700 "fst::ContextFstImpl<Arc, > LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, > std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, > int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 > #3 0x000000000045b419 in > fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, > int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, > phone_syms=..., disambig_syms=..., N=1, P=0) at > ../fstext/context-fst-inl.h:103 > #4 0x0000000000457610 in > fst::ContextFst<fst::ArcTpl<fst::TropicalWeightTpl<float> >, > int>::ContextFst (this=0x7fffffffd100, subsequential_symbol=97, > phones=..., disambig_syms=..., N=1, P=0) at ../fstext/context-fst.h:223 > #5 0x0000000000455b95 in fst::ComposeContext (disambig_syms_in=..., N=1, > P=0, ifst=0x6c5be0, ofst=0x7fffffffd390, ilabels_out=0x7fffffffd3a0) > at ../fstext/context-fst-inl.h:522 > #6 0x00000000004522a3 in main (argc=7, argv=0x7fffffffdaa8) at > fstcomposecontext.cc:138 > (gdb) up > #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 > 92 abort.c: No such file or directory. > (gdb) up > #2 0x00007ffff6be2621 in *__GI___assert_fail ( > assertion=0x498448 "subsequential_symbol != 0 && > disambig_syms_.count(subsequential_symbol) == 0 && > phone_syms_.count(subsequential_symbol) == 0", > file=<optimized out>, line=105, > function=0x499700 "fst::ContextFstImpl<Arc, > LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, > std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, > int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 > 81 assert.c: No such file or directory. > (gdb) p subsequential_symbol > No symbol "subsequential_symbol" in current context. > (gdb) up > #3 0x000000000045b419 in > fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, > int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, > phone_syms=..., disambig_syms=..., N=1, P=0) at > ../fstext/context-fst-inl.h:103 > 103 assert(subsequential_symbol != 0 > (gdb) p subsequential_symbol > $1 = 97 > (gdb) p disambig_syms_.count(subsequential_symbol) > $2 = 0 > (gdb) p phone_syms_.count(subsequential_symbol) > $3 = 1 > (gdb) p phone_syms_.size() > $4 = 78 > (gdb) p disambig_syms_.size() > $5 = 0 > > > Thanks > > X. > > > > On Sat, Dec 28, 2013 at 9:01 PM, Daniel Povey <dp...@gm...> wrote: > >> The same error should not have happened. Can you please do the same >> steps in gdb as last time, and paste the screen from gdb? >> Dan >> >> >> >> On Sat, Dec 28, 2013 at 11:49 AM, Xavier Anguera <xan...@gm...>wrote: >> >>> Dan, >>> the same error occurred, just that now I got the extra Warning you >>> inserted. >>> Should I maybe modify the make_phone_bigram_lang.sh script to copy the >>> current disambig.* files into the new lang directory? >>> >>> Thanks, >>> >>> X. >>> >>> >>> >>> On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...> wrote: >>> >>>> OK, then try running the script with the code fix I checked in. I >>>> forgot about the existence of that script. Possibly it will work. I'll >>>> have to modify validate_lang.pl in that case. >>>> Dan >>>> >>>> >>>> >>>> On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...>wrote: >>>> >>>>> Dan, >>>>> there must be something I do not do correctly in my current setup, or >>>>> you did not understand where my problem is. >>>>> I am currently calling the script mkgraph.sh (that is crashing) in the >>>>> following context: >>>>> >>>>> # Create phone-bigram grammar (unsmoothed) estimated from >>>>> alignments >>>>> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >>>>> data/lang_test_phn-mono || exit 1; >>>>> # Create phone recognition graph >>>>> $train_cmd exp/mono/graph/mkgraph_phn.log \ >>>>> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>> exp/mono/graph_phn || exit 1 >>>>> >>>>> As you can see, first the script make_phone_bigram_lang.sh is called, >>>>> which takes as an input a lang directory and creates a "test" lang >>>>> directory. Looking into this script I see that the disambig.* files are >>>>> left empty in purpose in the new directory (they are not empty in the >>>>> original lang directory, in fact, they have the #0 #1 values you proposed >>>>> in the previous email). >>>>> Then, when calling the mkgraph.sh script with this test_lang directory >>>>> it complaints as stated in my previous emails. >>>>> The question is then whether I should modify make_phone_bigram_lang.sh >>>>> to copy the original disambig.* files or should I pass the original lang >>>>> directory to the mkgraph.sh script, or am I doing something else very >>>>> wrong? >>>>> >>>>> Thanks for your help. >>>>> >>>>> Xavier Anguera >>>>> >>>>> >>>>> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...>wrote: >>>>> >>>>>> OK, I just committed a fix because it should not have crashed at that >>>>>> particular point in the code, but the underlying error is with your lang >>>>>> directory. You do need to have the disambiguation symbols "disambig.txt", >>>>>> with at least #0 and #1. You should probably be creating the lang >>>>>> directory with the prepare_lang.sh script, and if not, at least you should >>>>>> validate it with the validate_lang.pl script. Also, there is no >>>>>> reason to have a separate "lang" directory for the monophone setup, the >>>>>> same directory is valid for monophone or triphone setups. >>>>>> >>>>>> Dan >>>>>> >>>>>> >>>>>> >>>>>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: >>>>>> >>>>>>> Dear Dan, >>>>>>> thank you for your help. >>>>>>> Next are the tests you asked me to perform: >>>>>>> >>>>>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>>>>> >>>>>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>>>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>>>>> data/lang_test_phn-mono//phones/roots.txt >>>>>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>>>>> >>>>>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>>>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>>>>> data/lang_test_phn-mono//phones/sets.txt >>>>>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>>>>> >>>>>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} >>>>>>> ... >>>>>>> --> 9 entry/entries in >>>>>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.int corresponds >>>>>>> to data/lang_test_phn-mono//phones/extra_questions.txt >>>>>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK >>>>>>> >>>>>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>>>>> --> silence.txt and nonsilence.txt are disjoint >>>>>>> --> silence.txt and disambig.txt are disjoint >>>>>>> --> disambig.txt and nonsilence.txt are disjoint >>>>>>> --> disjoint property is OK >>>>>>> >>>>>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>>>> not exists >>>>>>> >>>>>>> Checking optional_silence.txt ... >>>>>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>>>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>>>>> >>>>>>> Checking disambiguation symbols: #0 and #1 >>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>>>> not exists >>>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have >>>>>>> "#0" or "#1" >>>>>>> Checking topo ... >>>>>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>>>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>>>>> --> data/lang_test_phn-mono//topo is OK >>>>>>> >>>>>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>>>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>>>>> >>>>>>> --> ERROR >>>>>>> >>>>>>> Apparently I do not have either oov.txt nore disambig.txt >>>>>>> Probably the test data I am using does not have any OOV in it. I can >>>>>>> add it artificially, but I guess this is not the main problem here... >>>>>>> regarding the disambig.txt file, what should it contain? >>>>>>> >>>>>>> I did run gdb as you indicated (thank you for such detailed info) >>>>>>> and gives me: >>>>>>> (gdb) p subsequential_symbol >>>>>>> $1 = 97 >>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>> $2 = 0 >>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>> $3 = 1 >>>>>>> (gdb) p phone_syms_.size() >>>>>>> $4 = 78 >>>>>>> (gdb) p disambig_syms_.size() >>>>>>> $5 = 0 >>>>>>> >>>>>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>>>>> disambig.int is also empty. >>>>>>> >>>>>>> Thanks again for your help! >>>>>>> >>>>>>> yours, >>>>>>> >>>>>>> Xavier Anguera >>>>>>> >>>>>>> >>>>>>> >>>>>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...>wrote: >>>>>>> >>>>>>>> Could you please do the following. [apologies if you already know >>>>>>>> gdb] >>>>>>>> >>>>>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>>>>> and let me know if it fails. >>>>>>>> If it doesn't fail, do: >>>>>>>> >>>>>>>> gdb --args fstcomposecontext --context-size=1 --central-position=0 >>>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >>>>>>>> --write-disambig-syms=data/lang_test_ >>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>>>>> >>>>>>>> (gdb) r >>>>>>>> # wait till it crashes >>>>>>>> # go up the stack by typing "up" until you get to the right frame; >>>>>>>> type "down" if you go too far >>>>>>>> >>>>>>>> (gdb) p subsequential_symbol >>>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>>> (gdb) p phone_syms_.size() >>>>>>>> (gdb) p disambig_syms_.size() >>>>>>>> (gdb) quit >>>>>>>> >>>>>>>> [I hope this works; sometimes it will fail because functions are >>>>>>>> inlined]. >>>>>>>> Anyway, send the output, and also >>>>>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>>>>> and show me that output too. >>>>>>>> >>>>>>>> Dan >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera < >>>>>>>> xan...@gm...> wrote: >>>>>>>> >>>>>>>>> Dear all, >>>>>>>>> I am encounering a problem when training mono-state NN using a >>>>>>>>> recipe adapted from the SWBD S5 recipe. I am able to train, decode and >>>>>>>>> phone-align a GMM system, but when I use these results to train the NN I >>>>>>>>> get the following error (see below). I have used this recipe in the past to >>>>>>>>> successfully train one ASR system and now the only difference is that I am >>>>>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>>>>> Any help is appreciated. >>>>>>>>> >>>>>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>>>>>> >>>>>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>>>> exp/mono/graph_phn >>>>>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>>>>> # >>>>>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>>>>> data/lang_test_phn-mono/G.fst >>>>>>>>> fstdeterminizestar --use-log=true >>>>>>>>> fstminimizeencoded >>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>>>>> 0.000358155 -0.000356635 >>>>>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>>>>> std::vector<B, std::allocator< >>>>>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) >>>>>>>>> [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: >>>>>>>>> Assertion `subsequenti >>>>>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 >>>>>>>>> && phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>>>>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>> ERROR >>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>> ERROR >>>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> ------------------------------------------------------------------------------ >>>>>>>>> Rapidly troubleshoot problems before they affect your business. >>>>>>>>> Most IT >>>>>>>>> organizations don't have a clear picture of how application >>>>>>>>> performance >>>>>>>>> affects their revenue. With AppDynamics, you get 100% visibility >>>>>>>>> into your >>>>>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>>>>> AppDynamics Pro! >>>>>>>>> >>>>>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>>>>>> _______________________________________________ >>>>>>>>> Kaldi-developers mailing list >>>>>>>>> Kal...@li... >>>>>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Xavier A. <xan...@gm...> - 2013-12-28 20:35:37
|
Sure, here it is: The error: # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono exp/mono/graph_phn # Started at Sat Dec 28 20:47:57 CET 2013 # fstminimizeencoded fsttablecompose data/lang_test_phn-mono/L_disambig.fst data/lang_test_phn-mono/G.fst fstdeterminizestar --use-log=true fstisstochastic data/lang_test_phn-mono/tmp/LG.fst 0.000358155 -0.000356635 fstcomposecontext --context-size=1 --central-position=0 --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 WARNING (fstcomposecontext:main():fstcomposecontext.cc:130) Disambiguation symbols list is empty; this likely indicates an error in data preparation. fstcomposecontext: ../fstext/context-fst-inl.h:105: fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, std::allocator< _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion `subsequenti al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && phone_syms_.count(subsequential_symbol) == 0' failed. utils/mkgraph.sh: line 76: 7661 Aborted fstcomposecontext --context-size=$N --central-position=$P --read-disambig-syms=$lang/phones/disambig. int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst ERROR: FstHeader::Read: Bad FST header: data/lang_test_phn-mono/tmp/CLG_1_0.fst ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst The execution of gdb: (gdb) where #0 0x00007ffff6be9475 in *__GI_raise (sig=<optimized out>) at ../nptl/sysdeps/unix/sysv/linux/raise.c:64 #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 #2 0x00007ffff6be2621 in *__GI___assert_fail ( assertion=0x498448 "subsequential_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && phone_syms_.count(subsequential_symbol) == 0", file=<optimized out>, line=105, function=0x499700 "fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 #3 0x000000000045b419 in fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, phone_syms=..., disambig_syms=..., N=1, P=0) at ../fstext/context-fst-inl.h:103 #4 0x0000000000457610 in fst::ContextFst<fst::ArcTpl<fst::TropicalWeightTpl<float> >, int>::ContextFst (this=0x7fffffffd100, subsequential_symbol=97, phones=..., disambig_syms=..., N=1, P=0) at ../fstext/context-fst.h:223 #5 0x0000000000455b95 in fst::ComposeContext (disambig_syms_in=..., N=1, P=0, ifst=0x6c5be0, ofst=0x7fffffffd390, ilabels_out=0x7fffffffd3a0) at ../fstext/context-fst-inl.h:522 #6 0x00000000004522a3 in main (argc=7, argv=0x7fffffffdaa8) at fstcomposecontext.cc:138 (gdb) up #1 0x00007ffff6bec6f0 in *__GI_abort () at abort.c:92 92 abort.c: No such file or directory. (gdb) up #2 0x00007ffff6be2621 in *__GI___assert_fail ( assertion=0x498448 "subsequential_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && phone_syms_.count(subsequential_symbol) == 0", file=<optimized out>, line=105, function=0x499700 "fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, std::allocator<_T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc = fst::ArcTpl<fst::T"...) at assert.c:81 81 assert.c: No such file or directory. (gdb) p subsequential_symbol No symbol "subsequential_symbol" in current context. (gdb) up #3 0x000000000045b419 in fst::ContextFstImpl<fst::ArcTpl<fst::TropicalWeightTpl<float> >, int>::ContextFstImpl (this=0x6bd520, subsequential_symbol=97, phone_syms=..., disambig_syms=..., N=1, P=0) at ../fstext/context-fst-inl.h:103 103 assert(subsequential_symbol != 0 (gdb) p subsequential_symbol $1 = 97 (gdb) p disambig_syms_.count(subsequential_symbol) $2 = 0 (gdb) p phone_syms_.count(subsequential_symbol) $3 = 1 (gdb) p phone_syms_.size() $4 = 78 (gdb) p disambig_syms_.size() $5 = 0 Thanks X. On Sat, Dec 28, 2013 at 9:01 PM, Daniel Povey <dp...@gm...> wrote: > The same error should not have happened. Can you please do the same steps > in gdb as last time, and paste the screen from gdb? > Dan > > > > On Sat, Dec 28, 2013 at 11:49 AM, Xavier Anguera <xan...@gm...>wrote: > >> Dan, >> the same error occurred, just that now I got the extra Warning you >> inserted. >> Should I maybe modify the make_phone_bigram_lang.sh script to copy the >> current disambig.* files into the new lang directory? >> >> Thanks, >> >> X. >> >> >> >> On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...> wrote: >> >>> OK, then try running the script with the code fix I checked in. I >>> forgot about the existence of that script. Possibly it will work. I'll >>> have to modify validate_lang.pl in that case. >>> Dan >>> >>> >>> >>> On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...>wrote: >>> >>>> Dan, >>>> there must be something I do not do correctly in my current setup, or >>>> you did not understand where my problem is. >>>> I am currently calling the script mkgraph.sh (that is crashing) in the >>>> following context: >>>> >>>> # Create phone-bigram grammar (unsmoothed) estimated from >>>> alignments >>>> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >>>> data/lang_test_phn-mono || exit 1; >>>> # Create phone recognition graph >>>> $train_cmd exp/mono/graph/mkgraph_phn.log \ >>>> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>> exp/mono/graph_phn || exit 1 >>>> >>>> As you can see, first the script make_phone_bigram_lang.sh is called, >>>> which takes as an input a lang directory and creates a "test" lang >>>> directory. Looking into this script I see that the disambig.* files are >>>> left empty in purpose in the new directory (they are not empty in the >>>> original lang directory, in fact, they have the #0 #1 values you proposed >>>> in the previous email). >>>> Then, when calling the mkgraph.sh script with this test_lang directory >>>> it complaints as stated in my previous emails. >>>> The question is then whether I should modify make_phone_bigram_lang.sh >>>> to copy the original disambig.* files or should I pass the original lang >>>> directory to the mkgraph.sh script, or am I doing something else very >>>> wrong? >>>> >>>> Thanks for your help. >>>> >>>> Xavier Anguera >>>> >>>> >>>> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...> wrote: >>>> >>>>> OK, I just committed a fix because it should not have crashed at that >>>>> particular point in the code, but the underlying error is with your lang >>>>> directory. You do need to have the disambiguation symbols "disambig.txt", >>>>> with at least #0 and #1. You should probably be creating the lang >>>>> directory with the prepare_lang.sh script, and if not, at least you should >>>>> validate it with the validate_lang.pl script. Also, there is no >>>>> reason to have a separate "lang" directory for the monophone setup, the >>>>> same directory is valid for monophone or triphone setups. >>>>> >>>>> Dan >>>>> >>>>> >>>>> >>>>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: >>>>> >>>>>> Dear Dan, >>>>>> thank you for your help. >>>>>> Next are the tests you asked me to perform: >>>>>> >>>>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>>>> >>>>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>>>> data/lang_test_phn-mono//phones/roots.txt >>>>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>>>> >>>>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>>>> data/lang_test_phn-mono//phones/sets.txt >>>>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>>>> >>>>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} >>>>>> ... >>>>>> --> 9 entry/entries in >>>>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>>>> --> data/lang_test_phn-mono//phones/extra_questions.int corresponds >>>>>> to data/lang_test_phn-mono//phones/extra_questions.txt >>>>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK >>>>>> >>>>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>>>> --> silence.txt and nonsilence.txt are disjoint >>>>>> --> silence.txt and disambig.txt are disjoint >>>>>> --> disambig.txt and nonsilence.txt are disjoint >>>>>> --> disjoint property is OK >>>>>> >>>>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>>> not exists >>>>>> >>>>>> Checking optional_silence.txt ... >>>>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>>>> >>>>>> Checking disambiguation symbols: #0 and #1 >>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>>> not exists >>>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have >>>>>> "#0" or "#1" >>>>>> Checking topo ... >>>>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>>>> --> data/lang_test_phn-mono//topo is OK >>>>>> >>>>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>>>> >>>>>> --> ERROR >>>>>> >>>>>> Apparently I do not have either oov.txt nore disambig.txt >>>>>> Probably the test data I am using does not have any OOV in it. I can >>>>>> add it artificially, but I guess this is not the main problem here... >>>>>> regarding the disambig.txt file, what should it contain? >>>>>> >>>>>> I did run gdb as you indicated (thank you for such detailed info) and >>>>>> gives me: >>>>>> (gdb) p subsequential_symbol >>>>>> $1 = 97 >>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>> $2 = 0 >>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>> $3 = 1 >>>>>> (gdb) p phone_syms_.size() >>>>>> $4 = 78 >>>>>> (gdb) p disambig_syms_.size() >>>>>> $5 = 0 >>>>>> >>>>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>>>> disambig.int is also empty. >>>>>> >>>>>> Thanks again for your help! >>>>>> >>>>>> yours, >>>>>> >>>>>> Xavier Anguera >>>>>> >>>>>> >>>>>> >>>>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...>wrote: >>>>>> >>>>>>> Could you please do the following. [apologies if you already know >>>>>>> gdb] >>>>>>> >>>>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>>>> and let me know if it fails. >>>>>>> If it doesn't fail, do: >>>>>>> >>>>>>> gdb --args fstcomposecontext --context-size=1 --central-position=0 >>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >>>>>>> --write-disambig-syms=data/lang_test_ >>>>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>>>> >>>>>>> (gdb) r >>>>>>> # wait till it crashes >>>>>>> # go up the stack by typing "up" until you get to the right frame; >>>>>>> type "down" if you go too far >>>>>>> >>>>>>> (gdb) p subsequential_symbol >>>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>>> (gdb) p phone_syms_.size() >>>>>>> (gdb) p disambig_syms_.size() >>>>>>> (gdb) quit >>>>>>> >>>>>>> [I hope this works; sometimes it will fail because functions are >>>>>>> inlined]. >>>>>>> Anyway, send the output, and also >>>>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>>>> and show me that output too. >>>>>>> >>>>>>> Dan >>>>>>> >>>>>>> >>>>>>> >>>>>>> >>>>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm... >>>>>>> > wrote: >>>>>>> >>>>>>>> Dear all, >>>>>>>> I am encounering a problem when training mono-state NN using a >>>>>>>> recipe adapted from the SWBD S5 recipe. I am able to train, decode and >>>>>>>> phone-align a GMM system, but when I use these results to train the NN I >>>>>>>> get the following error (see below). I have used this recipe in the past to >>>>>>>> successfully train one ASR system and now the only difference is that I am >>>>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>>>> Any help is appreciated. >>>>>>>> >>>>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>>>>> >>>>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>>> exp/mono/graph_phn >>>>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>>>> # >>>>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>>>> data/lang_test_phn-mono/G.fst >>>>>>>> fstdeterminizestar --use-log=true >>>>>>>> fstminimizeencoded >>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>>>> 0.000358155 -0.000356635 >>>>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>>>> std::vector<B, std::allocator< >>>>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) >>>>>>>> [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: >>>>>>>> Assertion `subsequenti >>>>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 >>>>>>>> && phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>>>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>> ERROR >>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>> ERROR >>>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> ------------------------------------------------------------------------------ >>>>>>>> Rapidly troubleshoot problems before they affect your business. >>>>>>>> Most IT >>>>>>>> organizations don't have a clear picture of how application >>>>>>>> performance >>>>>>>> affects their revenue. With AppDynamics, you get 100% visibility >>>>>>>> into your >>>>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>>>> AppDynamics Pro! >>>>>>>> >>>>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>>>>> _______________________________________________ >>>>>>>> Kaldi-developers mailing list >>>>>>>> Kal...@li... >>>>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Daniel P. <dp...@gm...> - 2013-12-28 20:01:48
|
The same error should not have happened. Can you please do the same steps in gdb as last time, and paste the screen from gdb? Dan On Sat, Dec 28, 2013 at 11:49 AM, Xavier Anguera <xan...@gm...> wrote: > Dan, > the same error occurred, just that now I got the extra Warning you > inserted. > Should I maybe modify the make_phone_bigram_lang.sh script to copy the > current disambig.* files into the new lang directory? > > Thanks, > > X. > > > > On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...> wrote: > >> OK, then try running the script with the code fix I checked in. I forgot >> about the existence of that script. Possibly it will work. I'll have to >> modify validate_lang.pl in that case. >> Dan >> >> >> >> On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...>wrote: >> >>> Dan, >>> there must be something I do not do correctly in my current setup, or >>> you did not understand where my problem is. >>> I am currently calling the script mkgraph.sh (that is crashing) in the >>> following context: >>> >>> # Create phone-bigram grammar (unsmoothed) estimated from >>> alignments >>> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >>> data/lang_test_phn-mono || exit 1; >>> # Create phone recognition graph >>> $train_cmd exp/mono/graph/mkgraph_phn.log \ >>> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>> exp/mono/graph_phn || exit 1 >>> >>> As you can see, first the script make_phone_bigram_lang.sh is called, >>> which takes as an input a lang directory and creates a "test" lang >>> directory. Looking into this script I see that the disambig.* files are >>> left empty in purpose in the new directory (they are not empty in the >>> original lang directory, in fact, they have the #0 #1 values you proposed >>> in the previous email). >>> Then, when calling the mkgraph.sh script with this test_lang directory >>> it complaints as stated in my previous emails. >>> The question is then whether I should modify make_phone_bigram_lang.sh >>> to copy the original disambig.* files or should I pass the original lang >>> directory to the mkgraph.sh script, or am I doing something else very >>> wrong? >>> >>> Thanks for your help. >>> >>> Xavier Anguera >>> >>> >>> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...> wrote: >>> >>>> OK, I just committed a fix because it should not have crashed at that >>>> particular point in the code, but the underlying error is with your lang >>>> directory. You do need to have the disambiguation symbols "disambig.txt", >>>> with at least #0 and #1. You should probably be creating the lang >>>> directory with the prepare_lang.sh script, and if not, at least you should >>>> validate it with the validate_lang.pl script. Also, there is no >>>> reason to have a separate "lang" directory for the monophone setup, the >>>> same directory is valid for monophone or triphone setups. >>>> >>>> Dan >>>> >>>> >>>> >>>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: >>>> >>>>> Dear Dan, >>>>> thank you for your help. >>>>> Next are the tests you asked me to perform: >>>>> >>>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>>> >>>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>>> data/lang_test_phn-mono//phones/roots.txt >>>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>>> >>>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>>> data/lang_test_phn-mono//phones/sets.txt >>>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>>> >>>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} ... >>>>> --> 9 entry/entries in >>>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>>> --> data/lang_test_phn-mono//phones/extra_questions.int corresponds >>>>> to data/lang_test_phn-mono//phones/extra_questions.txt >>>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK >>>>> >>>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>>> --> silence.txt and nonsilence.txt are disjoint >>>>> --> silence.txt and disambig.txt are disjoint >>>>> --> disambig.txt and nonsilence.txt are disjoint >>>>> --> disjoint property is OK >>>>> >>>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>> not exists >>>>> >>>>> Checking optional_silence.txt ... >>>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>>> >>>>> Checking disambiguation symbols: #0 and #1 >>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or >>>>> not exists >>>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have >>>>> "#0" or "#1" >>>>> Checking topo ... >>>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>>> --> data/lang_test_phn-mono//topo is OK >>>>> >>>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>>> >>>>> --> ERROR >>>>> >>>>> Apparently I do not have either oov.txt nore disambig.txt >>>>> Probably the test data I am using does not have any OOV in it. I can >>>>> add it artificially, but I guess this is not the main problem here... >>>>> regarding the disambig.txt file, what should it contain? >>>>> >>>>> I did run gdb as you indicated (thank you for such detailed info) and >>>>> gives me: >>>>> (gdb) p subsequential_symbol >>>>> $1 = 97 >>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>> $2 = 0 >>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>> $3 = 1 >>>>> (gdb) p phone_syms_.size() >>>>> $4 = 78 >>>>> (gdb) p disambig_syms_.size() >>>>> $5 = 0 >>>>> >>>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>>> disambig.int is also empty. >>>>> >>>>> Thanks again for your help! >>>>> >>>>> yours, >>>>> >>>>> Xavier Anguera >>>>> >>>>> >>>>> >>>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...>wrote: >>>>> >>>>>> Could you please do the following. [apologies if you already know gdb] >>>>>> >>>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>>> and let me know if it fails. >>>>>> If it doesn't fail, do: >>>>>> >>>>>> gdb --args fstcomposecontext --context-size=1 --central-position=0 >>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >>>>>> --write-disambig-syms=data/lang_test_ >>>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>>> >>>>>> (gdb) r >>>>>> # wait till it crashes >>>>>> # go up the stack by typing "up" until you get to the right frame; >>>>>> type "down" if you go too far >>>>>> >>>>>> (gdb) p subsequential_symbol >>>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>>> (gdb) p phone_syms_.size() >>>>>> (gdb) p disambig_syms_.size() >>>>>> (gdb) quit >>>>>> >>>>>> [I hope this works; sometimes it will fail because functions are >>>>>> inlined]. >>>>>> Anyway, send the output, and also >>>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>>> and show me that output too. >>>>>> >>>>>> Dan >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...>wrote: >>>>>> >>>>>>> Dear all, >>>>>>> I am encounering a problem when training mono-state NN using a >>>>>>> recipe adapted from the SWBD S5 recipe. I am able to train, decode and >>>>>>> phone-align a GMM system, but when I use these results to train the NN I >>>>>>> get the following error (see below). I have used this recipe in the past to >>>>>>> successfully train one ASR system and now the only difference is that I am >>>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>>> Any help is appreciated. >>>>>>> >>>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>>>> >>>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>>> exp/mono/graph_phn >>>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>>> # >>>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>>> data/lang_test_phn-mono/G.fst >>>>>>> fstdeterminizestar --use-log=true >>>>>>> fstminimizeencoded >>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>>> 0.000358155 -0.000356635 >>>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>>> std::vector<B, std::allocator< >>>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) >>>>>>> [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: >>>>>>> Assertion `subsequenti >>>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>>>>>> phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>> ERROR >>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>> ERROR >>>>>>> (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading >>>>>>> FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>>> >>>>>>> >>>>>>> >>>>>>> ------------------------------------------------------------------------------ >>>>>>> Rapidly troubleshoot problems before they affect your business. Most >>>>>>> IT >>>>>>> organizations don't have a clear picture of how application >>>>>>> performance >>>>>>> affects their revenue. With AppDynamics, you get 100% visibility >>>>>>> into your >>>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>>> AppDynamics Pro! >>>>>>> >>>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>>>> _______________________________________________ >>>>>>> Kaldi-developers mailing list >>>>>>> Kal...@li... >>>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Xavier A. <xan...@gm...> - 2013-12-28 19:49:58
|
Dan, the same error occurred, just that now I got the extra Warning you inserted. Should I maybe modify the make_phone_bigram_lang.sh script to copy the current disambig.* files into the new lang directory? Thanks, X. On Sat, Dec 28, 2013 at 8:03 PM, Daniel Povey <dp...@gm...> wrote: > OK, then try running the script with the code fix I checked in. I forgot > about the existence of that script. Possibly it will work. I'll have to > modify validate_lang.pl in that case. > Dan > > > > On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...>wrote: > >> Dan, >> there must be something I do not do correctly in my current setup, or you >> did not understand where my problem is. >> I am currently calling the script mkgraph.sh (that is crashing) in the >> following context: >> >> # Create phone-bigram grammar (unsmoothed) estimated from >> alignments >> utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all >> data/lang_test_phn-mono || exit 1; >> # Create phone recognition graph >> $train_cmd exp/mono/graph/mkgraph_phn.log \ >> utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >> exp/mono/graph_phn || exit 1 >> >> As you can see, first the script make_phone_bigram_lang.sh is called, >> which takes as an input a lang directory and creates a "test" lang >> directory. Looking into this script I see that the disambig.* files are >> left empty in purpose in the new directory (they are not empty in the >> original lang directory, in fact, they have the #0 #1 values you proposed >> in the previous email). >> Then, when calling the mkgraph.sh script with this test_lang directory it >> complaints as stated in my previous emails. >> The question is then whether I should modify make_phone_bigram_lang.sh to >> copy the original disambig.* files or should I pass the original lang >> directory to the mkgraph.sh script, or am I doing something else very >> wrong? >> >> Thanks for your help. >> >> Xavier Anguera >> >> >> On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...> wrote: >> >>> OK, I just committed a fix because it should not have crashed at that >>> particular point in the code, but the underlying error is with your lang >>> directory. You do need to have the disambiguation symbols "disambig.txt", >>> with at least #0 and #1. You should probably be creating the lang >>> directory with the prepare_lang.sh script, and if not, at least you should >>> validate it with the validate_lang.pl script. Also, there is no reason >>> to have a separate "lang" directory for the monophone setup, the same >>> directory is valid for monophone or triphone setups. >>> >>> Dan >>> >>> >>> >>> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: >>> >>>> Dear Dan, >>>> thank you for your help. >>>> Next are the tests you asked me to perform: >>>> >>>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>>> >>>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>>> data/lang_test_phn-mono//phones/roots.txt >>>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>>> >>>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>>> data/lang_test_phn-mono//phones/sets.txt >>>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>>> >>>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} ... >>>> --> 9 entry/entries in >>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>> --> data/lang_test_phn-mono//phones/extra_questions.int corresponds to >>>> data/lang_test_phn-mono//phones/extra_questions.txt >>>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK >>>> >>>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>>> --> silence.txt and nonsilence.txt are disjoint >>>> --> silence.txt and disambig.txt are disjoint >>>> --> disambig.txt and nonsilence.txt are disjoint >>>> --> disjoint property is OK >>>> >>>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not >>>> exists >>>> >>>> Checking optional_silence.txt ... >>>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>>> >>>> Checking disambiguation symbols: #0 and #1 >>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not >>>> exists >>>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have >>>> "#0" or "#1" >>>> Checking topo ... >>>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>>> --> data/lang_test_phn-mono//topo's silence section is OK >>>> --> data/lang_test_phn-mono//topo is OK >>>> >>>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>>> >>>> --> ERROR >>>> >>>> Apparently I do not have either oov.txt nore disambig.txt >>>> Probably the test data I am using does not have any OOV in it. I can >>>> add it artificially, but I guess this is not the main problem here... >>>> regarding the disambig.txt file, what should it contain? >>>> >>>> I did run gdb as you indicated (thank you for such detailed info) and >>>> gives me: >>>> (gdb) p subsequential_symbol >>>> $1 = 97 >>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>> $2 = 0 >>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>> $3 = 1 >>>> (gdb) p phone_syms_.size() >>>> $4 = 78 >>>> (gdb) p disambig_syms_.size() >>>> $5 = 0 >>>> >>>> Finally, the contents of cat data/lang_test_phn-mono/phones/ >>>> disambig.int is also empty. >>>> >>>> Thanks again for your help! >>>> >>>> yours, >>>> >>>> Xavier Anguera >>>> >>>> >>>> >>>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...>wrote: >>>> >>>>> Could you please do the following. [apologies if you already know gdb] >>>>> >>>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>>> and let me know if it fails. >>>>> If it doesn't fail, do: >>>>> >>>>> gdb --args fstcomposecontext --context-size=1 --central-position=0 >>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >>>>> --write-disambig-syms=data/lang_test_ >>>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>>> data/lang_test_phn-mono/tmp/LG.fst >>>>> >>>>> (gdb) r >>>>> # wait till it crashes >>>>> # go up the stack by typing "up" until you get to the right frame; >>>>> type "down" if you go too far >>>>> >>>>> (gdb) p subsequential_symbol >>>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>>> (gdb) p phone_syms_.size() >>>>> (gdb) p disambig_syms_.size() >>>>> (gdb) quit >>>>> >>>>> [I hope this works; sometimes it will fail because functions are >>>>> inlined]. >>>>> Anyway, send the output, and also >>>>> cat data/lang_test_phn-mono/phones/disambig.int >>>>> and show me that output too. >>>>> >>>>> Dan >>>>> >>>>> >>>>> >>>>> >>>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...>wrote: >>>>> >>>>>> Dear all, >>>>>> I am encounering a problem when training mono-state NN using a recipe >>>>>> adapted from the SWBD S5 recipe. I am able to train, decode and phone-align >>>>>> a GMM system, but when I use these results to train the NN I get the >>>>>> following error (see below). I have used this recipe in the past to >>>>>> successfully train one ASR system and now the only difference is that I am >>>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>>> Any help is appreciated. >>>>>> >>>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>>> >>>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>>> exp/mono/graph_phn >>>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>>> # >>>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>>> data/lang_test_phn-mono/G.fst >>>>>> fstdeterminizestar --use-log=true >>>>>> fstminimizeencoded >>>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>>> 0.000358155 -0.000356635 >>>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>>> std::vector<B, std::allocator< >>>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with >>>>>> Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >>>>>> `subsequenti >>>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>>>>> phone_syms_.count(subsequential_symbol) == 0' failed. >>>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>>> --read-disambig-syms=$lang/phones/disambig. >>>>>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>> ERROR: FstHeader::Read: Bad FST header: >>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>>>>> Reading FST: error reading FST header from >>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>>>>> Reading FST: error reading FST header from >>>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>>> >>>>>> >>>>>> >>>>>> ------------------------------------------------------------------------------ >>>>>> Rapidly troubleshoot problems before they affect your business. Most >>>>>> IT >>>>>> organizations don't have a clear picture of how application >>>>>> performance >>>>>> affects their revenue. With AppDynamics, you get 100% visibility into >>>>>> your >>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>>> AppDynamics Pro! >>>>>> >>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>>> _______________________________________________ >>>>>> Kaldi-developers mailing list >>>>>> Kal...@li... >>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>>> >>>>>> >>>>> >>>> >>> >> > |
From: Daniel P. <dp...@gm...> - 2013-12-28 19:03:33
|
OK, then try running the script with the code fix I checked in. I forgot about the existence of that script. Possibly it will work. I'll have to modify validate_lang.pl in that case. Dan On Sat, Dec 28, 2013 at 7:02 AM, Xavier Anguera <xan...@gm...> wrote: > Dan, > there must be something I do not do correctly in my current setup, or you > did not understand where my problem is. > I am currently calling the script mkgraph.sh (that is crashing) in the > following context: > > # Create phone-bigram grammar (unsmoothed) estimated from > alignments > utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all > data/lang_test_phn-mono || exit 1; > # Create phone recognition graph > $train_cmd exp/mono/graph/mkgraph_phn.log \ > utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono > exp/mono/graph_phn || exit 1 > > As you can see, first the script make_phone_bigram_lang.sh is called, > which takes as an input a lang directory and creates a "test" lang > directory. Looking into this script I see that the disambig.* files are > left empty in purpose in the new directory (they are not empty in the > original lang directory, in fact, they have the #0 #1 values you proposed > in the previous email). > Then, when calling the mkgraph.sh script with this test_lang directory it > complaints as stated in my previous emails. > The question is then whether I should modify make_phone_bigram_lang.sh to > copy the original disambig.* files or should I pass the original lang > directory to the mkgraph.sh script, or am I doing something else very > wrong? > > Thanks for your help. > > Xavier Anguera > > > On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...> wrote: > >> OK, I just committed a fix because it should not have crashed at that >> particular point in the code, but the underlying error is with your lang >> directory. You do need to have the disambiguation symbols "disambig.txt", >> with at least #0 and #1. You should probably be creating the lang >> directory with the prepare_lang.sh script, and if not, at least you should >> validate it with the validate_lang.pl script. Also, there is no reason >> to have a separate "lang" directory for the monophone setup, the same >> directory is valid for monophone or triphone setups. >> >> Dan >> >> >> >> On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: >> >>> Dear Dan, >>> thank you for your help. >>> Next are the tests you asked me to perform: >>> >>> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >>> >>> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >>> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >>> --> data/lang_test_phn-mono//phones/roots.int corresponds to >>> data/lang_test_phn-mono//phones/roots.txt >>> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >>> >>> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >>> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >>> --> data/lang_test_phn-mono//phones/sets.int corresponds to >>> data/lang_test_phn-mono//phones/sets.txt >>> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >>> >>> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} ... >>> --> 9 entry/entries in >>> data/lang_test_phn-mono//phones/extra_questions.txt >>> --> data/lang_test_phn-mono//phones/extra_questions.int corresponds to >>> data/lang_test_phn-mono//phones/extra_questions.txt >>> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK >>> >>> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >>> --> silence.txt and nonsilence.txt are disjoint >>> --> silence.txt and disambig.txt are disjoint >>> --> disambig.txt and nonsilence.txt are disjoint >>> --> disjoint property is OK >>> >>> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not >>> exists >>> >>> Checking optional_silence.txt ... >>> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >>> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >>> >>> Checking disambiguation symbols: #0 and #1 >>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not >>> exists >>> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have >>> "#0" or "#1" >>> Checking topo ... >>> --> data/lang_test_phn-mono//topo's nonsilence section is OK >>> --> data/lang_test_phn-mono//topo's silence section is OK >>> --> data/lang_test_phn-mono//topo is OK >>> >>> Checking data/lang_test_phn-mono//oov.{txt, int} ... >>> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >>> >>> --> ERROR >>> >>> Apparently I do not have either oov.txt nore disambig.txt >>> Probably the test data I am using does not have any OOV in it. I can add >>> it artificially, but I guess this is not the main problem here... regarding >>> the disambig.txt file, what should it contain? >>> >>> I did run gdb as you indicated (thank you for such detailed info) and >>> gives me: >>> (gdb) p subsequential_symbol >>> $1 = 97 >>> (gdb) p disambig_syms_.count(subsequential_symbol) >>> $2 = 0 >>> (gdb) p phone_syms_.count(subsequential_symbol) >>> $3 = 1 >>> (gdb) p phone_syms_.size() >>> $4 = 78 >>> (gdb) p disambig_syms_.size() >>> $5 = 0 >>> >>> Finally, the contents of cat data/lang_test_phn-mono/phones/disambig.int is >>> also empty. >>> >>> Thanks again for your help! >>> >>> yours, >>> >>> Xavier Anguera >>> >>> >>> >>> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...> wrote: >>> >>>> Could you please do the following. [apologies if you already know gdb] >>>> >>>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>>> and let me know if it fails. >>>> If it doesn't fail, do: >>>> >>>> gdb --args fstcomposecontext --context-size=1 --central-position=0 >>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >>>> --write-disambig-syms=data/lang_test_ >>>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>>> data/lang_test_phn-mono/tmp/LG.fst >>>> >>>> (gdb) r >>>> # wait till it crashes >>>> # go up the stack by typing "up" until you get to the right frame; type >>>> "down" if you go too far >>>> >>>> (gdb) p subsequential_symbol >>>> (gdb) p disambig_syms_.count(subsequential_symbol) >>>> (gdb) p phone_syms_.count(subsequential_symbol) >>>> (gdb) p phone_syms_.size() >>>> (gdb) p disambig_syms_.size() >>>> (gdb) quit >>>> >>>> [I hope this works; sometimes it will fail because functions are >>>> inlined]. >>>> Anyway, send the output, and also >>>> cat data/lang_test_phn-mono/phones/disambig.int >>>> and show me that output too. >>>> >>>> Dan >>>> >>>> >>>> >>>> >>>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...>wrote: >>>> >>>>> Dear all, >>>>> I am encounering a problem when training mono-state NN using a recipe >>>>> adapted from the SWBD S5 recipe. I am able to train, decode and phone-align >>>>> a GMM system, but when I use these results to train the NN I get the >>>>> following error (see below). I have used this recipe in the past to >>>>> successfully train one ASR system and now the only difference is that I am >>>>> trying to train a similar system using graphemes are phonemes (for which I >>>>> have assigned the graphemes of the words as transcriptions to each word). >>>>> Any help is appreciated. >>>>> >>>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>>> >>>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>>> exp/mono/graph_phn >>>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>>> # >>>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>>> data/lang_test_phn-mono/G.fst >>>>> fstdeterminizestar --use-log=true >>>>> fstminimizeencoded >>>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>>> 0.000358155 -0.000356635 >>>>> fstcomposecontext --context-size=1 --central-position=0 >>>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>>> std::vector<B, std::allocator< >>>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with >>>>> Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >>>>> `subsequenti >>>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>>>> phone_syms_.count(subsequential_symbol) == 0' failed. >>>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>>> fstcomposecontext --context-size=$N --central-position=$P >>>>> --read-disambig-syms=$lang/phones/disambig. >>>>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>> ERROR: FstHeader::Read: Bad FST header: >>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>>>> Reading FST: error reading FST header from >>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>>>> Reading FST: error reading FST header from >>>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>>> >>>>> >>>>> >>>>> ------------------------------------------------------------------------------ >>>>> Rapidly troubleshoot problems before they affect your business. Most IT >>>>> organizations don't have a clear picture of how application performance >>>>> affects their revenue. With AppDynamics, you get 100% visibility into >>>>> your >>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>>> AppDynamics Pro! >>>>> >>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>>> _______________________________________________ >>>>> Kaldi-developers mailing list >>>>> Kal...@li... >>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>> >>>>> >>>> >>> >> > |
From: Xavier A. <xan...@gm...> - 2013-12-28 15:02:36
|
Dan, there must be something I do not do correctly in my current setup, or you did not understand where my problem is. I am currently calling the script mkgraph.sh (that is crashing) in the following context: # Create phone-bigram grammar (unsmoothed) estimated from alignments utils/make_phone_bigram_lang.sh data/lang exp/mono_ali_all data/lang_test_phn-mono || exit 1; # Create phone recognition graph $train_cmd exp/mono/graph/mkgraph_phn.log \ utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono exp/mono/graph_phn || exit 1 As you can see, first the script make_phone_bigram_lang.sh is called, which takes as an input a lang directory and creates a "test" lang directory. Looking into this script I see that the disambig.* files are left empty in purpose in the new directory (they are not empty in the original lang directory, in fact, they have the #0 #1 values you proposed in the previous email). Then, when calling the mkgraph.sh script with this test_lang directory it complaints as stated in my previous emails. The question is then whether I should modify make_phone_bigram_lang.sh to copy the original disambig.* files or should I pass the original lang directory to the mkgraph.sh script, or am I doing something else very wrong? Thanks for your help. Xavier Anguera On Sat, Dec 28, 2013 at 1:43 AM, Daniel Povey <dp...@gm...> wrote: > OK, I just committed a fix because it should not have crashed at that > particular point in the code, but the underlying error is with your lang > directory. You do need to have the disambiguation symbols "disambig.txt", > with at least #0 and #1. You should probably be creating the lang > directory with the prepare_lang.sh script, and if not, at least you should > validate it with the validate_lang.pl script. Also, there is no reason > to have a separate "lang" directory for the monophone setup, the same > directory is valid for monophone or triphone setups. > > Dan > > > > On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...>wrote: > >> Dear Dan, >> thank you for your help. >> Next are the tests you asked me to perform: >> >> Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: >> >> Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... >> --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt >> --> data/lang_test_phn-mono//phones/roots.int corresponds to >> data/lang_test_phn-mono//phones/roots.txt >> --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK >> >> Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... >> --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt >> --> data/lang_test_phn-mono//phones/sets.int corresponds to >> data/lang_test_phn-mono//phones/sets.txt >> --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK >> >> Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} ... >> --> 9 entry/entries in data/lang_test_phn-mono//phones/extra_questions.txt >> --> data/lang_test_phn-mono//phones/extra_questions.int corresponds to >> data/lang_test_phn-mono//phones/extra_questions.txt >> --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK >> >> Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... >> --> silence.txt and nonsilence.txt are disjoint >> --> silence.txt and disambig.txt are disjoint >> --> disambig.txt and nonsilence.txt are disjoint >> --> disjoint property is OK >> >> Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... >> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not >> exists >> >> Checking optional_silence.txt ... >> --> reading data/lang_test_phn-mono//phones/optional_silence.txt >> --> data/lang_test_phn-mono//phones/optional_silence.txt is OK >> >> Checking disambiguation symbols: #0 and #1 >> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not >> exists >> --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have "#0" >> or "#1" >> Checking topo ... >> --> data/lang_test_phn-mono//topo's nonsilence section is OK >> --> data/lang_test_phn-mono//topo's silence section is OK >> --> data/lang_test_phn-mono//topo is OK >> >> Checking data/lang_test_phn-mono//oov.{txt, int} ... >> --> ERROR: fail to open data/lang_test_phn-mono//oov.txt >> >> --> ERROR >> >> Apparently I do not have either oov.txt nore disambig.txt >> Probably the test data I am using does not have any OOV in it. I can add >> it artificially, but I guess this is not the main problem here... regarding >> the disambig.txt file, what should it contain? >> >> I did run gdb as you indicated (thank you for such detailed info) and >> gives me: >> (gdb) p subsequential_symbol >> $1 = 97 >> (gdb) p disambig_syms_.count(subsequential_symbol) >> $2 = 0 >> (gdb) p phone_syms_.count(subsequential_symbol) >> $3 = 1 >> (gdb) p phone_syms_.size() >> $4 = 78 >> (gdb) p disambig_syms_.size() >> $5 = 0 >> >> Finally, the contents of cat data/lang_test_phn-mono/phones/disambig.int is >> also empty. >> >> Thanks again for your help! >> >> yours, >> >> Xavier Anguera >> >> >> >> On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...> wrote: >> >>> Could you please do the following. [apologies if you already know gdb] >>> >>> First do utils/validate_lang.pl data/lang_test_phn-mono/ >>> and let me know if it fails. >>> If it doesn't fail, do: >>> >>> gdb --args fstcomposecontext --context-size=1 --central-position=0 >>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >>> --write-disambig-syms=data/lang_test_ >>> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >>> data/lang_test_phn-mono/tmp/LG.fst >>> >>> (gdb) r >>> # wait till it crashes >>> # go up the stack by typing "up" until you get to the right frame; type >>> "down" if you go too far >>> >>> (gdb) p subsequential_symbol >>> (gdb) p disambig_syms_.count(subsequential_symbol) >>> (gdb) p phone_syms_.count(subsequential_symbol) >>> (gdb) p phone_syms_.size() >>> (gdb) p disambig_syms_.size() >>> (gdb) quit >>> >>> [I hope this works; sometimes it will fail because functions are >>> inlined]. >>> Anyway, send the output, and also >>> cat data/lang_test_phn-mono/phones/disambig.int >>> and show me that output too. >>> >>> Dan >>> >>> >>> >>> >>> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...>wrote: >>> >>>> Dear all, >>>> I am encounering a problem when training mono-state NN using a recipe >>>> adapted from the SWBD S5 recipe. I am able to train, decode and phone-align >>>> a GMM system, but when I use these results to train the NN I get the >>>> following error (see below). I have used this recipe in the past to >>>> successfully train one ASR system and now the only difference is that I am >>>> trying to train a similar system using graphemes are phonemes (for which I >>>> have assigned the graphemes of the words as transcriptions to each word). >>>> Any help is appreciated. >>>> >>>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>>> >>>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>>> exp/mono/graph_phn >>>> # Started at Fri Dec 27 18:57:19 CET 2013 >>>> # >>>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>>> data/lang_test_phn-mono/G.fst >>>> fstdeterminizestar --use-log=true >>>> fstminimizeencoded >>>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>>> 0.000358155 -0.000356635 >>>> fstcomposecontext --context-size=1 --central-position=0 >>>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>>> std::vector<B, std::allocator< >>>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with >>>> Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >>>> `subsequenti >>>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>>> phone_syms_.count(subsequential_symbol) == 0' failed. >>>> utils/mkgraph.sh: line 76: 6263 Aborted >>>> fstcomposecontext --context-size=$N --central-position=$P >>>> --read-disambig-syms=$lang/phones/disambig. >>>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>> ERROR: FstHeader::Read: Bad FST header: >>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>>> Reading FST: error reading FST header from >>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>>> Reading FST: error reading FST header from >>>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>>> >>>> >>>> >>>> ------------------------------------------------------------------------------ >>>> Rapidly troubleshoot problems before they affect your business. Most IT >>>> organizations don't have a clear picture of how application performance >>>> affects their revenue. With AppDynamics, you get 100% visibility into >>>> your >>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>>> AppDynamics Pro! >>>> >>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>>> _______________________________________________ >>>> Kaldi-developers mailing list >>>> Kal...@li... >>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>> >>>> >>> >> > |
From: Daniel P. <dp...@gm...> - 2013-12-28 00:43:37
|
OK, I just committed a fix because it should not have crashed at that particular point in the code, but the underlying error is with your lang directory. You do need to have the disambiguation symbols "disambig.txt", with at least #0 and #1. You should probably be creating the lang directory with the prepare_lang.sh script, and if not, at least you should validate it with the validate_lang.pl script. Also, there is no reason to have a separate "lang" directory for the monophone setup, the same directory is valid for monophone or triphone setups. Dan On Fri, Dec 27, 2013 at 4:18 PM, Xavier Anguera <xan...@gm...> wrote: > Dear Dan, > thank you for your help. > Next are the tests you asked me to perform: > > Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: > > Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... > --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt > --> data/lang_test_phn-mono//phones/roots.int corresponds to > data/lang_test_phn-mono//phones/roots.txt > --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK > > Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... > --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt > --> data/lang_test_phn-mono//phones/sets.int corresponds to > data/lang_test_phn-mono//phones/sets.txt > --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK > > Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} ... > --> 9 entry/entries in data/lang_test_phn-mono//phones/extra_questions.txt > --> data/lang_test_phn-mono//phones/extra_questions.int corresponds to > data/lang_test_phn-mono//phones/extra_questions.txt > --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK > > Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... > --> silence.txt and nonsilence.txt are disjoint > --> silence.txt and disambig.txt are disjoint > --> disambig.txt and nonsilence.txt are disjoint > --> disjoint property is OK > > Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... > --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not > exists > > Checking optional_silence.txt ... > --> reading data/lang_test_phn-mono//phones/optional_silence.txt > --> data/lang_test_phn-mono//phones/optional_silence.txt is OK > > Checking disambiguation symbols: #0 and #1 > --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not > exists > --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have "#0" > or "#1" > Checking topo ... > --> data/lang_test_phn-mono//topo's nonsilence section is OK > --> data/lang_test_phn-mono//topo's silence section is OK > --> data/lang_test_phn-mono//topo is OK > > Checking data/lang_test_phn-mono//oov.{txt, int} ... > --> ERROR: fail to open data/lang_test_phn-mono//oov.txt > > --> ERROR > > Apparently I do not have either oov.txt nore disambig.txt > Probably the test data I am using does not have any OOV in it. I can add > it artificially, but I guess this is not the main problem here... regarding > the disambig.txt file, what should it contain? > > I did run gdb as you indicated (thank you for such detailed info) and > gives me: > (gdb) p subsequential_symbol > $1 = 97 > (gdb) p disambig_syms_.count(subsequential_symbol) > $2 = 0 > (gdb) p phone_syms_.count(subsequential_symbol) > $3 = 1 > (gdb) p phone_syms_.size() > $4 = 78 > (gdb) p disambig_syms_.size() > $5 = 0 > > Finally, the contents of cat data/lang_test_phn-mono/phones/disambig.int is > also empty. > > Thanks again for your help! > > yours, > > Xavier Anguera > > > > On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...> wrote: > >> Could you please do the following. [apologies if you already know gdb] >> >> First do utils/validate_lang.pl data/lang_test_phn-mono/ >> and let me know if it fails. >> If it doesn't fail, do: >> >> gdb --args fstcomposecontext --context-size=1 --central-position=0 >> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int >> --write-disambig-syms=data/lang_test_ >> phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 >> data/lang_test_phn-mono/tmp/LG.fst >> >> (gdb) r >> # wait till it crashes >> # go up the stack by typing "up" until you get to the right frame; type >> "down" if you go too far >> >> (gdb) p subsequential_symbol >> (gdb) p disambig_syms_.count(subsequential_symbol) >> (gdb) p phone_syms_.count(subsequential_symbol) >> (gdb) p phone_syms_.size() >> (gdb) p disambig_syms_.size() >> (gdb) quit >> >> [I hope this works; sometimes it will fail because functions are inlined]. >> Anyway, send the output, and also >> cat data/lang_test_phn-mono/phones/disambig.int >> and show me that output too. >> >> Dan >> >> >> >> >> On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...>wrote: >> >>> Dear all, >>> I am encounering a problem when training mono-state NN using a recipe >>> adapted from the SWBD S5 recipe. I am able to train, decode and phone-align >>> a GMM system, but when I use these results to train the NN I get the >>> following error (see below). I have used this recipe in the past to >>> successfully train one ASR system and now the only difference is that I am >>> trying to train a similar system using graphemes are phonemes (for which I >>> have assigned the graphemes of the words as transcriptions to each word). >>> Any help is appreciated. >>> >>> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >>> >>> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >>> exp/mono/graph_phn >>> # Started at Fri Dec 27 18:57:19 CET 2013 >>> # >>> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >>> data/lang_test_phn-mono/G.fst >>> fstdeterminizestar --use-log=true >>> fstminimizeencoded >>> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >>> 0.000358155 -0.000356635 >>> fstcomposecontext --context-size=1 --central-position=0 >>> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >>> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >>> fstcomposecontext: ../fstext/context-fst-inl.h:105: >>> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >>> std::vector<B, std::allocator< >>> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with >>> Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >>> `subsequenti >>> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >>> phone_syms_.count(subsequential_symbol) == 0' failed. >>> utils/mkgraph.sh: line 76: 6263 Aborted >>> fstcomposecontext --context-size=$N --central-position=$P >>> --read-disambig-syms=$lang/phones/disambig. >>> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >>> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >>> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR: FstHeader::Read: Bad FST header: >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>> Reading FST: error reading FST header from >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >>> Reading FST: error reading FST header from >>> data/lang_test_phn-mono/tmp/CLG_1_0.fst >>> >>> >>> >>> ------------------------------------------------------------------------------ >>> Rapidly troubleshoot problems before they affect your business. Most IT >>> organizations don't have a clear picture of how application performance >>> affects their revenue. With AppDynamics, you get 100% visibility into >>> your >>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of >>> AppDynamics Pro! >>> >>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >>> _______________________________________________ >>> Kaldi-developers mailing list >>> Kal...@li... >>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>> >>> >> > |
From: Xavier A. <xan...@gm...> - 2013-12-28 00:18:49
|
Dear Dan, thank you for your help. Next are the tests you asked me to perform: Running utils/validate_lang.pl data/lang_test_phn-mono/ gives: Checking data/lang_test_phn-mono//phones/roots.{txt, int} ... --> 30 entry/entries in data/lang_test_phn-mono//phones/roots.txt --> data/lang_test_phn-mono//phones/roots.int corresponds to data/lang_test_phn-mono//phones/roots.txt --> data/lang_test_phn-mono//phones/roots.{txt, int} are OK Checking data/lang_test_phn-mono//phones/sets.{txt, int} ... --> 30 entry/entries in data/lang_test_phn-mono//phones/sets.txt --> data/lang_test_phn-mono//phones/sets.int corresponds to data/lang_test_phn-mono//phones/sets.txt --> data/lang_test_phn-mono//phones/sets.{txt, int} are OK Checking data/lang_test_phn-mono//phones/extra_questions.{txt, int} ... --> 9 entry/entries in data/lang_test_phn-mono//phones/extra_questions.txt --> data/lang_test_phn-mono//phones/extra_questions.int corresponds to data/lang_test_phn-mono//phones/extra_questions.txt --> data/lang_test_phn-mono//phones/extra_questions.{txt, int} are OK Checking disjoint: silence.txt, nosilenct.txt, disambig.txt ... --> silence.txt and nonsilence.txt are disjoint --> silence.txt and disambig.txt are disjoint --> disambig.txt and nonsilence.txt are disjoint --> disjoint property is OK Checking sumation: silence.txt, nonsilence.txt, disambig.txt ... --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not exists Checking optional_silence.txt ... --> reading data/lang_test_phn-mono//phones/optional_silence.txt --> data/lang_test_phn-mono//phones/optional_silence.txt is OK Checking disambiguation symbols: #0 and #1 --> ERROR: data/lang_test_phn-mono//phones/disambig.txt is empty or not exists --> ERROR: data/lang_test_phn-mono//phones/disambig.txt doesn't have "#0" or "#1" Checking topo ... --> data/lang_test_phn-mono//topo's nonsilence section is OK --> data/lang_test_phn-mono//topo's silence section is OK --> data/lang_test_phn-mono//topo is OK Checking data/lang_test_phn-mono//oov.{txt, int} ... --> ERROR: fail to open data/lang_test_phn-mono//oov.txt --> ERROR Apparently I do not have either oov.txt nore disambig.txt Probably the test data I am using does not have any OOV in it. I can add it artificially, but I guess this is not the main problem here... regarding the disambig.txt file, what should it contain? I did run gdb as you indicated (thank you for such detailed info) and gives me: (gdb) p subsequential_symbol $1 = 97 (gdb) p disambig_syms_.count(subsequential_symbol) $2 = 0 (gdb) p phone_syms_.count(subsequential_symbol) $3 = 1 (gdb) p phone_syms_.size() $4 = 78 (gdb) p disambig_syms_.size() $5 = 0 Finally, the contents of cat data/lang_test_phn-mono/phones/disambig.int is also empty. Thanks again for your help! yours, Xavier Anguera On Fri, Dec 27, 2013 at 10:26 PM, Daniel Povey <dp...@gm...> wrote: > Could you please do the following. [apologies if you already know gdb] > > First do utils/validate_lang.pl data/lang_test_phn-mono/ > and let me know if it fails. > If it doesn't fail, do: > > gdb --args fstcomposecontext --context-size=1 --central-position=0 > --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int > --write-disambig-syms=data/lang_test_ > phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 > data/lang_test_phn-mono/tmp/LG.fst > > (gdb) r > # wait till it crashes > # go up the stack by typing "up" until you get to the right frame; type > "down" if you go too far > > (gdb) p subsequential_symbol > (gdb) p disambig_syms_.count(subsequential_symbol) > (gdb) p phone_syms_.count(subsequential_symbol) > (gdb) p phone_syms_.size() > (gdb) p disambig_syms_.size() > (gdb) quit > > [I hope this works; sometimes it will fail because functions are inlined]. > Anyway, send the output, and also > cat data/lang_test_phn-mono/phones/disambig.int > and show me that output too. > > Dan > > > > > On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...>wrote: > >> Dear all, >> I am encounering a problem when training mono-state NN using a recipe >> adapted from the SWBD S5 recipe. I am able to train, decode and phone-align >> a GMM system, but when I use these results to train the NN I get the >> following error (see below). I have used this recipe in the past to >> successfully train one ASR system and now the only difference is that I am >> trying to train a similar system using graphemes are phonemes (for which I >> have assigned the graphemes of the words as transcriptions to each word). >> Any help is appreciated. >> >> This is the beginning of the file exp/mono/graph/mkgraph_phn.log: >> >> # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono >> exp/mono/graph_phn >> # Started at Fri Dec 27 18:57:19 CET 2013 >> # >> fsttablecompose data/lang_test_phn-mono/L_disambig.fst >> data/lang_test_phn-mono/G.fst >> fstdeterminizestar --use-log=true >> fstminimizeencoded >> fstisstochastic data/lang_test_phn-mono/tmp/LG.fst >> 0.000358155 -0.000356635 >> fstcomposecontext --context-size=1 --central-position=0 >> --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ >> phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 >> fstcomposecontext: ../fstext/context-fst-inl.h:105: >> fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const >> std::vector<B, std::allocator< >> _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc >> = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion >> `subsequenti >> al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && >> phone_syms_.count(subsequential_symbol) == 0' failed. >> utils/mkgraph.sh: line 76: 6263 Aborted >> fstcomposecontext --context-size=$N --central-position=$P >> --read-disambig-syms=$lang/phones/disambig. >> int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int >> $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg >> fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst >> ERROR: FstHeader::Read: Bad FST header: >> data/lang_test_phn-mono/tmp/CLG_1_0.fst >> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >> Reading FST: error reading FST header from >> data/lang_test_phn-mono/tmp/CLG_1_0.fst >> ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) >> Reading FST: error reading FST header from >> data/lang_test_phn-mono/tmp/CLG_1_0.fst >> >> >> >> ------------------------------------------------------------------------------ >> Rapidly troubleshoot problems before they affect your business. Most IT >> organizations don't have a clear picture of how application performance >> affects their revenue. With AppDynamics, you get 100% visibility into your >> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics >> Pro! >> >> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk >> _______________________________________________ >> Kaldi-developers mailing list >> Kal...@li... >> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >> >> > |
From: Daniel P. <dp...@gm...> - 2013-12-27 21:26:11
|
Could you please do the following. [apologies if you already know gdb] First do utils/validate_lang.pl data/lang_test_phn-mono/ and let me know if it fails. If it doesn't fail, do: gdb --args fstcomposecontext --context-size=1 --central-position=0 --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int --write-disambig-syms=data/lang_test_ phn-mono/tmp/disambig_ilabels_1_0.int data/lang_test_phn-mono/tmp/ilabels_1_0 data/lang_test_phn-mono/tmp/LG.fst (gdb) r # wait till it crashes # go up the stack by typing "up" until you get to the right frame; type "down" if you go too far (gdb) p subsequential_symbol (gdb) p disambig_syms_.count(subsequential_symbol) (gdb) p phone_syms_.count(subsequential_symbol) (gdb) p phone_syms_.size() (gdb) p disambig_syms_.size() (gdb) quit [I hope this works; sometimes it will fail because functions are inlined]. Anyway, send the output, and also cat data/lang_test_phn-mono/phones/disambig.int and show me that output too. Dan On Fri, Dec 27, 2013 at 10:23 AM, Xavier Anguera <xan...@gm...> wrote: > Dear all, > I am encounering a problem when training mono-state NN using a recipe > adapted from the SWBD S5 recipe. I am able to train, decode and phone-align > a GMM system, but when I use these results to train the NN I get the > following error (see below). I have used this recipe in the past to > successfully train one ASR system and now the only difference is that I am > trying to train a similar system using graphemes are phonemes (for which I > have assigned the graphemes of the words as transcriptions to each word). > Any help is appreciated. > > This is the beginning of the file exp/mono/graph/mkgraph_phn.log: > > # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono > exp/mono/graph_phn > # Started at Fri Dec 27 18:57:19 CET 2013 > # > fsttablecompose data/lang_test_phn-mono/L_disambig.fst > data/lang_test_phn-mono/G.fst > fstdeterminizestar --use-log=true > fstminimizeencoded > fstisstochastic data/lang_test_phn-mono/tmp/LG.fst > 0.000358155 -0.000356635 > fstcomposecontext --context-size=1 --central-position=0 > --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ > phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 > fstcomposecontext: ../fstext/context-fst-inl.h:105: > fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const > std::vector<B, std::allocator< > _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc > = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion > `subsequenti > al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && > phone_syms_.count(subsequential_symbol) == 0' failed. > utils/mkgraph.sh: line 76: 6263 Aborted fstcomposecontext > --context-size=$N --central-position=$P > --read-disambig-syms=$lang/phones/disambig. > int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int > $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg > fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst > ERROR: FstHeader::Read: Bad FST header: > data/lang_test_phn-mono/tmp/CLG_1_0.fst > ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) > Reading FST: error reading FST header from > data/lang_test_phn-mono/tmp/CLG_1_0.fst > ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) > Reading FST: error reading FST header from > data/lang_test_phn-mono/tmp/CLG_1_0.fst > > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics > Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Xavier A. <xan...@gm...> - 2013-12-27 18:24:05
|
Dear all, I am encounering a problem when training mono-state NN using a recipe adapted from the SWBD S5 recipe. I am able to train, decode and phone-align a GMM system, but when I use these results to train the NN I get the following error (see below). I have used this recipe in the past to successfully train one ASR system and now the only difference is that I am trying to train a similar system using graphemes are phonemes (for which I have assigned the graphemes of the words as transcriptions to each word). Any help is appreciated. This is the beginning of the file exp/mono/graph/mkgraph_phn.log: # utils/mkgraph.sh --mono data/lang_test_phn-mono exp/mono exp/mono/graph_phn # Started at Fri Dec 27 18:57:19 CET 2013 # fsttablecompose data/lang_test_phn-mono/L_disambig.fst data/lang_test_phn-mono/G.fst fstdeterminizestar --use-log=true fstminimizeencoded fstisstochastic data/lang_test_phn-mono/tmp/LG.fst 0.000358155 -0.000356635 fstcomposecontext --context-size=1 --central-position=0 --read-disambig-syms=data/lang_test_phn-mono/phones/disambig.int--write-disambig-syms=data/lang_test_ phn-mono/tmp/disambig_ilabels_1_0.intdata/lang_test_phn-mono/tmp/ilabels_1_0 fstcomposecontext: ../fstext/context-fst-inl.h:105: fst::ContextFstImpl<Arc, LabelT>::ContextFstImpl(typename Arc::Label, const std::vector<B, std::allocator< _T2> >&, const std::vector<B, std::allocator<_T2> >&, int, int) [with Arc = fst::ArcTpl<fst::TropicalWeightTpl<float> >, LabelT = int]: Assertion `subsequenti al_symbol != 0 && disambig_syms_.count(subsequential_symbol) == 0 && phone_syms_.count(subsequential_symbol) == 0' failed. utils/mkgraph.sh: line 76: 6263 Aborted fstcomposecontext --context-size=$N --central-position=$P --read-disambig-syms=$lang/phones/disambig. int --write-disambig-syms=$lang/tmp/disambig_ilabels_${N}_${P}.int $lang/tmp/ilabels_${N}_${P} < $lang/tmp/LG.fst > $clg fstisstochastic data/lang_test_phn-mono/tmp/CLG_1_0.fst ERROR: FstHeader::Read: Bad FST header: data/lang_test_phn-mono/tmp/CLG_1_0.fst ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst ERROR (fstisstochastic:ReadFstKaldi():fstext/fstext-utils-inl.h:1183) Reading FST: error reading FST header from data/lang_test_phn-mono/tmp/CLG_1_0.fst |
From: Christopher K. <chr...@a2...> - 2013-12-27 09:25:18
|
Hi, yes it is fixed since : *Revision 3352 by danielpovey <http://jenkins.a2ialab.com/jenkins/user/danielpovey/>: * trunk: Compilation fix to test code; minor cosmetic fix. -- Christophe Le 12/26/2013 02:07 AM, Daniel Povey a écrit : > Yes, I think it works now. The build.log was attached to the email > from jenkins and it has what you need. > Dan > > > > On Wed, Dec 25, 2013 at 5:02 PM, Ho Yin Chan > <ric...@gm... <mailto:ric...@gm...>> wrote: > > > Does it work now ? > > BTW , can't access your console output . > > > ---------------------------------------------------------------------- > > Message: 1 > Date: Wed, 25 Dec 2013 04:53:39 +0100 (CET) > From: jen...@a2... <mailto:jen...@a2...> > Subject: [Kaldi-developers] [Jenkins] Kaldi automatic build > and test > To: kal...@li... > <mailto:kal...@li...> > Message-ID: > > <176...@ks... > <mailto:176...@ks...>> > Content-Type: text/plain; charset="utf-8" > > > Kaldi - Build # 330 - Still Failing: > > Check console output at > http://jenkins.a2ialab.com/jenkins/job/Kaldi/330/ to view the > results. > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. > Most IT > organizations don't have a clear picture of how application > performance > affects their revenue. With AppDynamics, you get 100% visibility > into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of > AppDynamics Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > <mailto:Kal...@li...> > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > -- Christopher Kermorvant R&D Manager - A2iA - France 39 rue de la Bienfaisance - 75008 Paris +33 (0) 144 420 086 / +33 (0) 689 513 601 (mobile) |
From: Daniel P. <dp...@gm...> - 2013-12-26 01:07:29
|
Yes, I think it works now. The build.log was attached to the email from jenkins and it has what you need. Dan On Wed, Dec 25, 2013 at 5:02 PM, Ho Yin Chan <ric...@gm...>wrote: > > Does it work now ? > > BTW , can't access your console output . > > > ---------------------------------------------------------------------- >> >> Message: 1 >> Date: Wed, 25 Dec 2013 04:53:39 +0100 (CET) >> From: jen...@a2... >> Subject: [Kaldi-developers] [Jenkins] Kaldi automatic build and test >> To: kal...@li... >> Message-ID: >> <176...@ks...> >> Content-Type: text/plain; charset="utf-8" >> >> >> Kaldi - Build # 330 - Still Failing: >> >> Check console output at http://jenkins.a2ialab.com/jenkins/job/Kaldi/330/to view the results. >> >> > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics > Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: Ho Y. C. <ric...@gm...> - 2013-12-26 01:02:45
|
Does it work now ? BTW , can't access your console output . ---------------------------------------------------------------------- > > Message: 1 > Date: Wed, 25 Dec 2013 04:53:39 +0100 (CET) > From: jen...@a2... > Subject: [Kaldi-developers] [Jenkins] Kaldi automatic build and test > To: kal...@li... > Message-ID: > <176...@ks...> > Content-Type: text/plain; charset="utf-8" > > Kaldi - Build # 330 - Still Failing: > > Check console output at http://jenkins.a2ialab.com/jenkins/job/Kaldi/330/to view the results. > > |
From: <jen...@a2...> - 2013-12-25 09:42:59
|
Kaldi - Build # 329 - Still Failing: Check console output at http://jenkins.a2ialab.com/jenkins/job/Kaldi/329/ to view the results. |
From: <jen...@a2...> - 2013-12-25 05:12:59
|
Kaldi - Build # 330 - Still Failing: Check console output at http://jenkins.a2ialab.com/jenkins/job/Kaldi/330/ to view the results. |
From: <jen...@a2...> - 2013-12-25 02:18:51
|
Kaldi - Build # 328 - Failure: Check console output at http://jenkins.a2ialab.com/jenkins/job/Kaldi/328/ to view the results. |
From: Daniel P. <dp...@gm...> - 2013-12-24 23:56:17
|
Guys, when looking at the code I discovered the memory leak. It wasn't detectable by valgrind, and the memory was deleted at the end of the program, but it had the same effect as a memory leak. It's fixed in the trunk now. Dan On Wed, Aug 28, 2013 at 9:01 AM, Daniel Povey <dp...@gm...> wrote: > That does sound like a memory leak-- try running it for just a few > utterances with valgrind. It's possible the memory leak happens only > under certain rare circumstances, so if you don't see it there you may > have to run for a thousand utterances or so and see if valgrind > reports a leak. > Dan > > > On Wed, Aug 28, 2013 at 3:48 PM, Li Peng <lip...@gm...> wrote: > > Sorry for that I don't know the details of how the program works, so I'm > not > > sure how to tune the options properly. > > > > The memory consumption grows little by little as the number of processed > > utterances increases. When the program runs for one day, it grows from > the > > starting 100MB to several GB. > > > > 在 2013年8月28日 下午9:33,"Arnab Ghoshal" <ar...@gm...>写道: > > > >> It is possible for the lattice generation to take a lot of memory. > >> Have you tried changing the --max-mem, --lattice-beam, --beam options > >> to see if it runs properly? Also, does this happen for a particular > >> utterance or all utterances? > >> -Arnab > >> > >> On Wed, Aug 28, 2013 at 7:22 AM, Li Peng <lip...@gm...> wrote: > >> > Hi, > >> > > >> > When I used latgen-faster-mapped with DNN model to generate lattices, > I > >> > observed that the memory usage keeps growing and exhausts the system's > >> > memory at last. I tried to use valgrind to find if there's a memory > >> > leak, > >> > but got no clues. So I write to report this problem, but I'm not sure > if > >> > there is a bug or it is just my own case. > >> > > >> > Best regards, > >> > > >> > Li Peng > >> > > >> > > >> > > ------------------------------------------------------------------------------ > >> > Learn the latest--Visual Studio 2012, SharePoint 2013, SQL 2012, more! > >> > Discover the easy way to master current and previous Microsoft > >> > technologies > >> > and advance your career. Get an incredible 1,500+ hours of > step-by-step > >> > tutorial videos with LearnDevNow. Subscribe today and save! > >> > > >> > > http://pubads.g.doubleclick.net/gampad/clk?id=58040911&iu=/4140/ostg.clktrk > >> > _______________________________________________ > >> > Kaldi-developers mailing list > >> > Kal...@li... > >> > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > >> > > > > > > > > ------------------------------------------------------------------------------ > > Learn the latest--Visual Studio 2012, SharePoint 2013, SQL 2012, more! > > Discover the easy way to master current and previous Microsoft > technologies > > and advance your career. Get an incredible 1,500+ hours of step-by-step > > tutorial videos with LearnDevNow. Subscribe today and save! > > > http://pubads.g.doubleclick.net/gampad/clk?id=58040911&iu=/4140/ostg.clktrk > > _______________________________________________ > > Kaldi-developers mailing list > > Kal...@li... > > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > > |
From: Daniel P. <dp...@gm...> - 2013-12-12 18:07:55
|
Hi-- Sorry, this was an oversight. A dependency on thread/ by gmm/ was introduced, and kaldi-thread.a was not added in the dependencies in the Makefile in the onlinebin/ directory. It's added now. Dan On Thu, Dec 12, 2013 at 11:59 AM, Eamonn Kenny <Eam...@cs...>wrote: > Dear Developers, > > For 3 weeks now I've found that I cannot build the onlinebin executables > whereas it was easy to do it before them. I've build scripts to do > everything so nothing has changed on my side. > > I'm running the following for a number of months now under Debian Wheezy > 7.2: > > cd tools > make > ./install_portaudio.sh > cd ../src > ./configure --shared > make depend > make > make ext > > everything goes well until the following error in 'make ext': > > make -C online > make[1]: Entering directory > `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/online' > make[1]: Nothing to be done for `all'. > make[1]: Leaving directory > `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/online' > make -C onlinebin > make[1]: Entering directory > `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/onlinebin' > g++ -rdynamic > -Wl,-rpath=/home/ekenny/eclipse/DI2/build/kaldi_trunk/tools/openfst/lib > -Wl,-rpath=/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/lib -L. > -L../online/ -L../lat/ -L../decoder/ -L../feat/ -L../transform/ > -L../gmm/ -L../hmm/ -L../tree/ -L../matrix/ -L../util/ > -L../base/ online-net-client.o ../online//libkaldi-online.so > ../lat//libkaldi-lat.so ../decoder//libkaldi-decoder.so > ../feat//libkaldi-feat.so ../transform//libkaldi-transform.so > ../gmm//libkaldi-gmm.so ../hmm//libkaldi-hmm.so > ../tree//libkaldi-tree.so ../matrix//libkaldi-matrix.so > ../util//libkaldi-util.so ../base//libkaldi-base.so > ../../tools/portaudio/install/lib/libportaudio.a -lasound -lrt > -L/home/ekenny/eclipse/DI2/build/kaldi_trunk/tools/openfst/lib -lfst > -L/usr/lib -llapack_atlas -lcblas -latlas -lf77blas -lm -lpthread -ldl > -lkaldi-online -lkaldi-lat -lkaldi-decoder -lkaldi-feat > -lkaldi-transform -lkaldi-gmm -lkaldi-hmm -lkaldi-tree > -lkaldi-matrix -lkaldi-util -lkaldi-base -o online-net-client > ../gmm//libkaldi-gmm.so: undefined reference to `typeinfo for > kaldi::MultiThreadable' > ../gmm//libkaldi-gmm.so: undefined reference to > `kaldi::MultiThreadable::~MultiThreadable()' > ../gmm//libkaldi-gmm.so: undefined reference to `vtable for > kaldi::MultiThreadable' > collect2: error: ld returned 1 exit status > make[1]: *** [online-net-client] Error 1 > make[1]: Leaving directory > `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/onlinebin' > make: *** [onlinebin] Error 2 > > Does anyone know what the error refers to? I can't trace it back to its > source. > > -- > Best Regards, > Eamonn Kenny > > + + + > Email Eam...@cs... CNGL/Speech Communication Lab, > Tel: 00+353-1-8961797 Dept. of Computer Science, > F.34, O'Reilly Institute, > http://www.cs.tcd.ie/Eamonn.Kenny Trinity College Dublin, > http://eamonnmkenny.wordpress.com Dublin 2, Ireland. > + + + > > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics > Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > |
From: Eamonn K. <Eam...@cs...> - 2013-12-12 17:00:30
|
Dear Developers, For 3 weeks now I've found that I cannot build the onlinebin executables whereas it was easy to do it before them. I've build scripts to do everything so nothing has changed on my side. I'm running the following for a number of months now under Debian Wheezy 7.2: cd tools make ./install_portaudio.sh cd ../src ./configure --shared make depend make make ext everything goes well until the following error in 'make ext': make -C online make[1]: Entering directory `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/online' make[1]: Nothing to be done for `all'. make[1]: Leaving directory `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/online' make -C onlinebin make[1]: Entering directory `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/onlinebin' g++ -rdynamic -Wl,-rpath=/home/ekenny/eclipse/DI2/build/kaldi_trunk/tools/openfst/lib -Wl,-rpath=/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/lib -L. -L../online/ -L../lat/ -L../decoder/ -L../feat/ -L../transform/ -L../gmm/ -L../hmm/ -L../tree/ -L../matrix/ -L../util/ -L../base/ online-net-client.o ../online//libkaldi-online.so ../lat//libkaldi-lat.so ../decoder//libkaldi-decoder.so ../feat//libkaldi-feat.so ../transform//libkaldi-transform.so ../gmm//libkaldi-gmm.so ../hmm//libkaldi-hmm.so ../tree//libkaldi-tree.so ../matrix//libkaldi-matrix.so ../util//libkaldi-util.so ../base//libkaldi-base.so ../../tools/portaudio/install/lib/libportaudio.a -lasound -lrt -L/home/ekenny/eclipse/DI2/build/kaldi_trunk/tools/openfst/lib -lfst -L/usr/lib -llapack_atlas -lcblas -latlas -lf77blas -lm -lpthread -ldl -lkaldi-online -lkaldi-lat -lkaldi-decoder -lkaldi-feat -lkaldi-transform -lkaldi-gmm -lkaldi-hmm -lkaldi-tree -lkaldi-matrix -lkaldi-util -lkaldi-base -o online-net-client ../gmm//libkaldi-gmm.so: undefined reference to `typeinfo for kaldi::MultiThreadable' ../gmm//libkaldi-gmm.so: undefined reference to `kaldi::MultiThreadable::~MultiThreadable()' ../gmm//libkaldi-gmm.so: undefined reference to `vtable for kaldi::MultiThreadable' collect2: error: ld returned 1 exit status make[1]: *** [online-net-client] Error 1 make[1]: Leaving directory `/home/ekenny/eclipse/DI2/build/kaldi_trunk/src/onlinebin' make: *** [onlinebin] Error 2 Does anyone know what the error refers to? I can't trace it back to its source. -- Best Regards, Eamonn Kenny + + + Email Eam...@cs... CNGL/Speech Communication Lab, Tel: 00+353-1-8961797 Dept. of Computer Science, F.34, O'Reilly Institute, http://www.cs.tcd.ie/Eamonn.Kenny Trinity College Dublin, http://eamonnmkenny.wordpress.com Dublin 2, Ireland. + + + |
From: Daniel P. <dp...@gm...> - 2013-12-11 16:55:16
|
Sorry, I don't think Kaldi is the right toolkit in your case, likely it would be easier for you to get started in Matlab or something like that. Once you are familiar with all the issues and need to scale up, maybe you could use Kaldi. Dan On Tue, Dec 10, 2013 at 10:48 PM, 雪莲 <amy...@16...> wrote: > Dear sir or miss : > Sorry to bother you! > I want to know whether kaldi could do linear regression instead of > softmax classification? I am a postgraduate student in China, I need to do > acoustic-articulatory inversion, I've modified the code in pre-traindbn.sh > to get a pretrained dbn, now there left is how to do linear regression on > this dbn, I'm not good at programming languages and kaldi toolkit is the > most familiar toolkit for me, so I've leaned almost all my work on kaldi. > I am in a rush to find out how to do it, I really hope you could > help me with this , if kaldi can perform linear regression, please tell me > which tool I should use. Thank you very much!!! > I am really grateful for your help! > > Amy > > > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics > Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > > |
From: 雪莲 <amy...@16...> - 2013-12-11 03:48:37
|
Dear sir or miss : Sorry to bother you! I want to know whether kaldi could do linear regression instead of softmax classification? I am a postgraduate student in China, I need to do acoustic-articulatory inversion, I've modified the code in pre-traindbn.sh to get a pretrained dbn, now there left is how to do linear regression on this dbn, I'm not good at programming languages and kaldi toolkit is the most familiar toolkit for me, so I've leaned almost all my work on kaldi. I am in a rush to find out how to do it, I really hope you could help me with this , if kaldi can perform linear regression, please tell me which tool I should use. Thank you very much!!! I am really grateful for your help! Amy |
From: Daniel P. <dp...@gm...> - 2013-12-03 16:47:03
|
Thanks for pointing it out, it's fixed now. Dan On Tue, Dec 3, 2013 at 4:25 AM, Serg Re <re...@gm...> wrote: > Hi. > > After update building was stopped in fgmmbin with linking error. It > seems that in src/fgmmbin/Makefile in ADDLIBS missing library > ../thread/kaldi-thread.a. In my case adding this library is solve > problem. > > (BTW in src/gmmbin/Makefile in ADDLIBS library ../gmm/kaldi-gmm.a is > referred twice) > > Best regards, > Sergey > > > ------------------------------------------------------------------------------ > Rapidly troubleshoot problems before they affect your business. Most IT > organizations don't have a clear picture of how application performance > affects their revenue. With AppDynamics, you get 100% visibility into your > Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics > Pro! > http://pubads.g.doubleclick.net/gampad/clk?id=84349351&iu=/4140/ostg.clktrk > _______________________________________________ > Kaldi-developers mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-developers > |