|
From: Mailing l. u. f. U. C. a. U. <kal...@li...> - 2013-04-16 12:14:45
|
I think you can get it by numerous ways in KALDI, one of them is :
lattice-align-words $lang/phones/word_boundary.int $model ark:- ark:- \| \
nbest-to-ctm ark:- - \| \
utils/int2sym.pl -f 5 $lang/words.txt \| \
$filter_cmd '>' $dir/ctm
Note that file $lang/phones/word_boundary.int is a phone id to
position-information
map file, you should have it if you follow the standard training procedure
Please check the following file:
egs/tidigits/s5/steps/get_train_ctm.sh
Haihua
On Tue, Apr 16, 2013 at 6:14 PM, Mailing list used for User Communication
and Updates <kal...@li...> wrote:
> Hi,
>
> you can have a look at the commands toward the end of
> egs/rm/s1/steps/decode_tri1.sh (ali-to-phones, phones-to-prons,
> prons-to-wordali). This AFAIK is the "old method" to obtain this
> information. Dan mentioned some time ago, that there are new tools
> that can be used to achieve the same, using the identity of the word
> position dependent phones, but I never had the need to use this new
> method.
>
> Vassil
>
> On Tue, Apr 16, 2013 at 1:00 PM, Mailing list used for User
> Communication and Updates <kal...@li...> wrote:
> > Hi,
> >
> > we recently started trying out kaldi here. Right now we do have a system
> up
> > and running based on the switchboard recipes, using our own (German)
> data.
> >
> > The problem we currently have is: How can you get timing information for
> > every recognized word from the decoding step? To illustrate, what we have
> > are those transcriptions, output by the decoder:
> >
> > [...]
> > SD0041_06 die letzten drei Spiele
> > [...]
> >
> > what we want is actually, for that specific example, something like
> > 003 025 die
> > 034 063 letzten
> > 070 082 drei
> > 088 104 Spiele
> > so that we have start and end frame number or time stamp for every word
> that
> > was recognized (numbers here are made up)
> >
> > How can we get that?
> >
> > Thanks in advance,
> > Thomas
> >
> >
> ------------------------------------------------------------------------------
> > Precog is a next-generation analytics platform capable of advanced
> > analytics on semi-structured data. The platform includes APIs for
> building
> > apps and a phenomenal toolset for data science. Developers can use
> > our toolset for easy data analysis & visualization. Get a free account!
> > http://www2.precog.com/precogplatform/slashdotnewsletter
> > _______________________________________________
> > Kaldi-users mailing list
> > Kal...@li...
> > https://lists.sourceforge.net/lists/listinfo/kaldi-users
> >
>
>
> ------------------------------------------------------------------------------
> Precog is a next-generation analytics platform capable of advanced
> analytics on semi-structured data. The platform includes APIs for building
> apps and a phenomenal toolset for data science. Developers can use
> our toolset for easy data analysis & visualization. Get a free account!
> http://www2.precog.com/precogplatform/slashdotnewsletter
> _______________________________________________
> Kaldi-users mailing list
> Kal...@li...
> https://lists.sourceforge.net/lists/listinfo/kaldi-users
>
|