From: Mailing l. u. f. U. C. a. U. <kal...@li...> - 2013-04-16 12:14:45
|
I think you can get it by numerous ways in KALDI, one of them is : lattice-align-words $lang/phones/word_boundary.int $model ark:- ark:- \| \ nbest-to-ctm ark:- - \| \ utils/int2sym.pl -f 5 $lang/words.txt \| \ $filter_cmd '>' $dir/ctm Note that file $lang/phones/word_boundary.int is a phone id to position-information map file, you should have it if you follow the standard training procedure Please check the following file: egs/tidigits/s5/steps/get_train_ctm.sh Haihua On Tue, Apr 16, 2013 at 6:14 PM, Mailing list used for User Communication and Updates <kal...@li...> wrote: > Hi, > > you can have a look at the commands toward the end of > egs/rm/s1/steps/decode_tri1.sh (ali-to-phones, phones-to-prons, > prons-to-wordali). This AFAIK is the "old method" to obtain this > information. Dan mentioned some time ago, that there are new tools > that can be used to achieve the same, using the identity of the word > position dependent phones, but I never had the need to use this new > method. > > Vassil > > On Tue, Apr 16, 2013 at 1:00 PM, Mailing list used for User > Communication and Updates <kal...@li...> wrote: > > Hi, > > > > we recently started trying out kaldi here. Right now we do have a system > up > > and running based on the switchboard recipes, using our own (German) > data. > > > > The problem we currently have is: How can you get timing information for > > every recognized word from the decoding step? To illustrate, what we have > > are those transcriptions, output by the decoder: > > > > [...] > > SD0041_06 die letzten drei Spiele > > [...] > > > > what we want is actually, for that specific example, something like > > 003 025 die > > 034 063 letzten > > 070 082 drei > > 088 104 Spiele > > so that we have start and end frame number or time stamp for every word > that > > was recognized (numbers here are made up) > > > > How can we get that? > > > > Thanks in advance, > > Thomas > > > > > ------------------------------------------------------------------------------ > > Precog is a next-generation analytics platform capable of advanced > > analytics on semi-structured data. The platform includes APIs for > building > > apps and a phenomenal toolset for data science. Developers can use > > our toolset for easy data analysis & visualization. Get a free account! > > http://www2.precog.com/precogplatform/slashdotnewsletter > > _______________________________________________ > > Kaldi-users mailing list > > Kal...@li... > > https://lists.sourceforge.net/lists/listinfo/kaldi-users > > > > > ------------------------------------------------------------------------------ > Precog is a next-generation analytics platform capable of advanced > analytics on semi-structured data. The platform includes APIs for building > apps and a phenomenal toolset for data science. Developers can use > our toolset for easy data analysis & visualization. Get a free account! > http://www2.precog.com/precogplatform/slashdotnewsletter > _______________________________________________ > Kaldi-users mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-users > |