From: Kartik A. <aud...@us...> - 2013-03-11 16:02:44
|
Thanks. I understand the OpenFST format but was under the impression that lattice-align-words will replace FST state IDs with the corresponding frame numbers. On Mon, Mar 11, 2013 at 8:59 AM, Daniel Povey <dp...@gm...> wrote: > cc-ing kaldi-developers so it's archived, in case anyone has similar > questions. > > You need to understand the OpenFst format. This is the "acceptor" version > of the format, which is (start-state end-state symbol weight), and the > weight is (graph cost, acoustic cost, sequence of transition-ids). [the > "normal", non-acceptor format is start-state end-state input-symbol > output-symbol weight]. The time can be obtained by summing up the number > of transition-ids starting from the beginning of the lattice; in the code > it's LatticeStateTimes. > > Note that the state numbers are arbitrary in a sense, they contain no real > information. See www.openfst.org for more info on WFSTs. > > > On Mon, Mar 11, 2013 at 11:55 AM, Kartik Audhkhasi <aud...@us...>wrote: > >> Thanks Dan. I have started using the new scripts. However the timing >> issue still remains. I used lattice-align-words to get times on the lattice >> nodes in the same way as is demonstrated in the run.sh script. I think I am >> not interpreting the times correctly. Do the IDs on both start and end >> nodes represent frame numbers? E.g. the first line of my lattice is: >> >> 0 2337 44870 >> 17.0063,2553.48,9468_9482_9492_9491_9491_9491_9491_9910_9909_9909_9916_9924_9194_9210_9242_9241_9241_9241 >> >> Does this says that word 44870 goes from frame 0 to frame 2237? The >> transition ID sequence however shows only 18 frames. >> >> >> On Sun, Mar 10, 2013 at 9:26 AM, Daniel Povey <dp...@gm...> wrote: >> >>> And RE how to debug it-- before and after lattice-align-words, you could >>> run something like lattice-to-post; this program will crash if there are >>> inconsistent times in the lattice, i.e. the lengths of input-symbol >>> sequences are not all the same. I suspect you actually mixed something up. >>> Dan >>> >>> >>> On Sun, Mar 10, 2013 at 12:24 PM, Daniel Povey <dp...@gm...> wrote: >>> >>>> It's a shame that you're using the older versions of the script. >>>> Currently the "s5" scripts are the canonical ones. Your issue with times >>>> greater than the length of the file is very unexpected. This is not the >>>> kind of error I would expect to ever arise. >>>> RE getting the N-best or 1-best sequences-- the programs lattice-nbest >>>> and lattice-1best are relevant here; they output stuff in the regular >>>> lattice format, and you can then put them through lattice-word-align (old >>>> scripts) or lattice-align-words (new scripts), and convert the output to, >>>> say, ctm format-- you can check the scripts for how to convert to ctm >>>> format, it's something like lattice-to-ctm, but there are scripts such as >>>> get_ctm.sh and get_train_ctm.sh in s5. >>>> Dan >>>> >>>> >>>> >>>> On Sat, Mar 9, 2013 at 8:41 PM, Kartik Audhkhasi <aud...@us...>wrote: >>>> >>>>> Hello, >>>>> >>>>> I am working with Kaldi lattices and used the walign_lats.sh script to >>>>> get times on nodes. I am using position dependent phones and believe that >>>>> all required files are in place. However, I see that some frame numbers (in >>>>> units of 10ms) exceed the total length of the file. Do you have any >>>>> suggestions as to what could be going wrong? >>>>> >>>>> Also: Is there an easy way to get the N-best or 1-best sequences with >>>>> word boundaries? >>>>> >>>>> Thanks, >>>>> Kartik >>>>> >>>>> -- >>>>> Kartik Audhkhasi >>>>> >>>>> >>>>> ------------------------------------------------------------------------------ >>>>> Symantec Endpoint Protection 12 positioned as A LEADER in The Forrester >>>>> Wave(TM): Endpoint Security, Q1 2013 and "remains a good choice" in the >>>>> endpoint security space. For insight on selecting the right partner to >>>>> tackle endpoint security challenges, access the full report. >>>>> http://p.sf.net/sfu/symantec-dev2dev >>>>> _______________________________________________ >>>>> Kaldi-developers mailing list >>>>> Kal...@li... >>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers >>>>> >>>>> >>>> >>> >> >> >> -- >> Kartik Audhkhasi >> > > -- Kartik Audhkhasi |