kaldi-developers Mailing List for Kaldi (Page 37)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

BTW, I just dug up some comments/notes I made for some others who
wanted to do this.
This is a diff from egs/wsj/s1/run.sh in kaldi-v1.0, but the general
ideas should carry over to whatever
version and data-set you are using.

Dan

apps0:kaldi-v1.0: svn diff egs/wsj/s1/run.sh
Index: egs/wsj/s1/run.sh
===================================================================

--- egs/wsj/s1/run.sh   (revision 1893)
+++ egs/wsj/s1/run.sh   (working copy)
@@ -192,6 +192,20 @@

 steps/train_tri2a.sh || exit 1;

+# Command for Geoff + George:
+# note: tri2a uses 3500 utterances, about half the total.
+# tri3 (not run yet) uses them all.
+. path.sh
+ali-to-pdf exp/tri2a/final.mdl 'ark:gunzip -c exp/tri2a/cur?.ali.gz|'
 'ark,t:|gzip -c > exp/tri2a/pdf_level_alignments.gz'
+
+# Command for Geoff + George to get high dimensional
+# features on this same subset of data (example)
+scripts/filter_scp.pl exp/tri2a/train.scp data/train_wav.scp | \
+  ../../../src/featbin/compute-mfcc-feats --num-ceps=23 --verbose=2 \
+   --config=conf/mfcc.conf scp:- ark:- | \
+   ../../../src/featbin/add-deltas \
+   ark:- ark,t:- | head
+
 (scripts/mkgraph.sh data/G_tg_pruned.fst exp/tri2a/tree
exp/tri2a/final.mdl exp/graph_tri2a_tg_pruned || exit 1;
  scripts/decode.sh exp/decode_tri2a_tgpr_eval92
exp/graph_tri2a_tg_pruned/HCLG.fst steps/decode_tri2a.sh
data/eval_nov92.scp
  scripts/decode.sh exp/decode_tri2a_tgpr_eval93
exp/graph_tri2a_tg_pruned/HCLG.fst steps/decode_tri2a.sh
data/eval_nov93.scp
@@ -217,6 +231,45 @@
 scripts/decode.sh exp/decode_tri3a_tgpr_uttdfmllr_eval92
exp/graph_tri3a_tg_pruned/HCLG.fst steps/decode_tri3a_diag_fmllr.sh
data/eval_nov92.scp
 )&

+
+
+
+# command for Geoff + George:
+. path.sh
+ali-to-pdf exp/tri3a/final.mdl 'ark:gunzip -c exp/tri3a/cur?.ali.gz|'
 'ark,t:|gzip -c > exp/tri3a/pdf_level_alignments.gz'
+
+# Command for Geoff + George to get high dimensional
+# features on this same subset of data (example)
+scripts/filter_scp.pl exp/tri3a/train.scp data/train_wav.scp | \
+  ../../../src/featbin/compute-mfcc-feats --num-ceps=23 --verbose=2 \
+   --config=conf/mfcc.conf scp:- ark:- | \
+   ../../../src/featbin/add-deltas \
+   ark:- ark,t:- | head
+
+# Command for George: decoding with scores obtained from a pipe.
+
+dir=exp/decode_tri3a_tgpr_eval92_pipe
+mkdir ${dir}
+scripts/split_scp.pl data/eval_nov92.scp ${dir}/{1,2,3,4,5,6,7,8}.scp
+. path.sh
+for n in 1 2 3 4 5 6 7 8; do
+ gmm-compute-likes exp/tri3a/final.mdl "ark:add-deltas
--print-args=false scp:${dir}/$n.scp ark:- |" \
+    ark,t:- | \
+ decode-faster --beam=13.0 --max-active=7000 --acoustic-scale=0.0625 \
+   --word-symbol-table=data/words.txt exp/tri3a/final.mdl
exp/graph_tri3a_tg_pruned/HCLG.fst \
+   ark:- ark,t:${dir}/$n.tra \
+   ark,t:${dir}/$n.ali 2> ${dir}/decode$n.log &
+done
+wait
+
+cat data/eval_nov92.txt | sed 's:<NOISE>::g' |  sed
's:<SPOKEN_NOISE>::g' > $dir/test_trans.filt
+
+cat $dir/{1,2,3,4,5,6,7,8}.tra | \
+  scripts/int2sym.pl --ignore-first-field data/words.txt | \
+  sed 's:<s>::' | sed 's:</s>::' | sed 's:<UNK>::g' | \
+  compute-wer --text --mode=present ark:$dir/test_trans.filt  ark,p:-
  >& $dir/wer
+# End command for George.
+
 # will delete:
 ## scripts/decode_queue_fmllr.sh exp/graph_tri3a_tg_pruned
exp/tri3a/final.mdl exp/decode_tri3a_tg_pruned_fmllr &


On Tue, Dec 6, 2011 at 12:04 AM, Troy Lee <tro...@gm...> wrote:
> Hi Dan,
>
> Great! I have check out that version. Thanks so much!
>
> Regards,
> Troy
>
>
> On Tue, Dec 6, 2011 at 3:44 PM, Daniel Povey <dp...@gm...> wrote:
>>
>> That sounds correct.
>> sandbox/karel is an alternative to trunk, that you could use when
>> checking it out.
>> e.g. check out a different version of kaldi, replacing "trunk" with
>> "sandbox/karel"
>> Dan
>>
>> On Mon, Dec 5, 2011 at 11:42 PM, Troy Lee <tro...@gm...> wrote:
>> > Hi Karel,
>> >
>> > Thanks so much for the detailed explanations. I'm currently using your
>> > TNet
>> > package for neural network training. It's really a great tool I have
>> > ever
>> > used for training neural nets for speech recognition.
>> >
>> > As I don't have access to the directory sandbox/karel/ (which is
>> > currently
>> > not available in the trunk) and from what I know currently, the general
>> > steps for working with TNet and Kaldi should be as follows:
>> > 1) Align the training transcriptions with compile-train-graphs and
>> > gmm-align-compiled
>> > 2) Convert the alignment to pdf-id labels using ali-to-pdf
>> > 3) Train the neural net with pdf-id labels using TNet
>> > 4) Decode the neural net generated logposteriors with
>> > decode-faster-mapped
>> >
>> > Am I correct? Thanks!
>> >
>> > Regards,
>> > Troy
>> >
>> >
>> > On Tue, Dec 6, 2011 at 4:39 AM, Karel Veselý <ive...@fi...>
>> > wrote:
>> >>
>> >> Hi Troy,
>> >> for example how neural network output decoding works see script:
>> >> /kaldi/sandbox/karel/egs/rm/s2/steps/decode_nnet_tri2a_s3.sh
>> >>
>> >> At the beginning is built long pipeline of feature processing,
>> >> which has at the end nnet-forward which produces logposteriors
>> >> (optionally divided by priors), those are then passed to
>> >> decode-faster-mapped
>> >> which decodes matrix "Nframes x Npdf"
>> >> (decode-faster would decode matrix "Nframes x Ntransition_id")
>> >>
>> >> The trunk contains only CPU implementation of neural network training,
>> >> the GPU version is in sandbox/karel/
>> >>
>> >> Karel
>> >>
>> >>
>> >>
>> >> Dne 3.12.2011 22:38, Daniel Povey napsal(a):
>> >>
>> >> BTW, the basic way I recommend to decode with neural nets is to use the
>> >> neural net to produce scores for all clustered states (pdf-ids) [as a
>> >> matrix
>> >> for each utterance], and pipe these into "decode-faster".  Probably the
>> >> scripts I pointed to below use this approach.  This method can be used
>> >> for
>> >> any type of neural net.  Basically you can have a Matlab program print
>> >> out,
>> >> for each utterance, the utterance-id and then a matrix of scores, e.g.
>> >> comparable to log-likelihoods, in Matlab format (one row per frame),
>> >> and
>> >> then pipe this into decode-faster.
>> >>
>> >> Dan
>> >>
>> >> On Sat, Dec 3, 2011 at 11:51 AM, Daniel Povey <dp...@gm...> wrote:
>> >>>
>> >>> Also-- in sandbox/karel/egs/rm/s2, I think there are examples of how
>> >>> to
>> >>> train and decode with neural nets.
>> >>> This stuff has not been merged back into the trunk yet, AFAIK.
>> >>>
>> >>> Dan
>> >>>
>> >>>
>> >>> On Sat, Dec 3, 2011 at 1:53 AM, Arnab Ghoshal <ar...@gm...>
>> >>> wrote:
>> >>>>
>> >>>> Hi Troy,
>> >>>>
>> >>>> there is currently no support for decoding with NN, but that is
>> >>>> pretty
>> >>>> easy to add. The decoder works with a "decodable" interface that is
>> >>>> defined in itf/decodable-itf.h. Any acoustic modeling class needs to
>> >>>> provide its implementation of the DecodableInterface. You can see how
>> >>>> the implementations are for few different acoustic models (regular
>> >>>> diag GMMs, semi-cont models, and SGMMs) in the
>> >>>> decoder/decodable-am-*.{h,cc} files. The main thing needed from the
>> >>>> acoustic model is that it is able to provide a score (log likelihood)
>> >>>> for a given feature vector and a state in the model. In practice,  a
>> >>>> decodable class in the decoder directory does not directly call the
>> >>>> LogLikelihood function of the corresponding acoustic model class, but
>> >>>> reimplements it to take advantage of caching.
>> >>>>
>> >>>> I am not sure if you can actually do acoustic modeling with the
>> >>>> current neural network code in Kaldi. Karel, who wrote the the neural
>> >>>> network code, can give you more details about the NN code. But if you
>> >>>> have your favorite C++ implementation of, say, deep belief networks,
>> >>>> that should be fairly straightforward to use with the kaldi decoder.
>> >>>>
>> >>>> -Arnab
>> >>>>
>> >>>> On Thu, Dec 1, 2011 at 6:54 AM, Troy Lee <tro...@gm...>
>> >>>> wrote:
>> >>>> > Hi,
>> >>>> >
>> >>>> > I'm new to the Kaldi package, and just saw there is a module in the
>> >>>> > source
>> >>>> > code called "nnet", which probably deals with Neural Network (NN)
>> >>>> > stuff. I'm
>> >>>> > thus wondering whether there is a direct support for decoding with
>> >>>> > likelihoods generated by neural network acoustic models in the
>> >>>> > Kaldi
>> >>>> > decoder? Otherwise, what would be the easiest way to do so? Thanks!
>> >>>> >
>> >>>> > Regards,
>> >>>> > Troy
>> >>>> >
>> >>>> >
>> >>>> >
>> >>>> > ------------------------------------------------------------------------------
>> >>>> > All the data continuously generated in your IT infrastructure
>> >>>> > contains a definitive record of customers, application performance,
>> >>>> > security threats, fraudulent activity, and more. Splunk takes this
>> >>>> > data and makes sense of it. IT sense. And common sense.
>> >>>> > http://p.sf.net/sfu/splunk-novd2d
>> >>>> > _______________________________________________
>> >>>> > Kaldi-developers mailing list
>> >>>> > Kal...@li...
>> >>>> > https://lists.sourceforge.net/lists/listinfo/kaldi-developers
>> >>>> >
>> >>>>
>> >>>>
>> >>>>
>> >>>> ------------------------------------------------------------------------------
>> >>>> All the data continuously generated in your IT infrastructure
>> >>>> contains a definitive record of customers, application performance,
>> >>>> security threats, fraudulent activity, and more. Splunk takes this
>> >>>> data and makes sense of it. IT sense. And common sense.
>> >>>> http://p.sf.net/sfu/splunk-novd2d
>> >>>> _______________________________________________
>> >>>> Kaldi-developers mailing list
>> >>>> Kal...@li...
>> >>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers
>> >>>
>> >>>
>> >>
>> >>
>> >
>
>




2011	Jan	Feb	Mar	Apr	May	Jun (4)	Jul	Aug	Sep (1)	Oct (4)	Nov (1)	Dec (14)
2012	Jan (1)	Feb (8)	Mar	Apr (1)	May (3)	Jun (13)	Jul (7)	Aug (11)	Sep (6)	Oct (14)	Nov (16)	Dec (1)
2013	Jan (3)	Feb (8)	Mar (17)	Apr (21)	May (27)	Jun (11)	Jul (11)	Aug (21)	Sep (39)	Oct (17)	Nov (39)	Dec (28)
2014	Jan (36)	Feb (30)	Mar (35)	Apr (17)	May (22)	Jun (28)	Jul (23)	Aug (41)	Sep (17)	Oct (10)	Nov (22)	Dec (56)
2015	Jan (30)	Feb (32)	Mar (37)	Apr (28)	May (79)	Jun (18)	Jul (35)	Aug	Sep (1)	Oct	Nov	Dec

kaldi-developers Mailing List for Kaldi (Page 37)

kaldi-developers — Kaldi Developers