From: Mailing l. u. f. U. C. a. U. <kal...@li...> - 2013-04-16 11:42:25
|
Thomas- take a look at egs/swbd/s5/local/score_sclite.sh within the "if [ $stage -le 0 ]; " statement where CTMs are generated. What you need is basically the CTM format used in NIST scoring. -Arnab On Tue, Apr 16, 2013 at 11:00 AM, Mailing list used for User Communication and Updates <kal...@li...> wrote: > Hi, > > we recently started trying out kaldi here. Right now we do have a system up > and running based on the switchboard recipes, using our own (German) data. > > The problem we currently have is: How can you get timing information for > every recognized word from the decoding step? To illustrate, what we have > are those transcriptions, output by the decoder: > > [...] > SD0041_06 die letzten drei Spiele > [...] > > what we want is actually, for that specific example, something like > 003 025 die > 034 063 letzten > 070 082 drei > 088 104 Spiele > so that we have start and end frame number or time stamp for every word that > was recognized (numbers here are made up) > > How can we get that? > > Thanks in advance, > Thomas > > ------------------------------------------------------------------------------ > Precog is a next-generation analytics platform capable of advanced > analytics on semi-structured data. The platform includes APIs for building > apps and a phenomenal toolset for data science. Developers can use > our toolset for easy data analysis & visualization. Get a free account! > http://www2.precog.com/precogplatform/slashdotnewsletter > _______________________________________________ > Kaldi-users mailing list > Kal...@li... > https://lists.sourceforge.net/lists/listinfo/kaldi-users > |