kaldi-developers Mailing List for Kaldi (Page 33)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Thanks. I understand the OpenFST format but was under the impression that
lattice-align-words will replace FST state IDs with the corresponding frame
numbers.

On Mon, Mar 11, 2013 at 8:59 AM, Daniel Povey <dp...@gm...> wrote:

> cc-ing kaldi-developers so it's archived, in case anyone has similar
> questions.
>
> You need to understand the OpenFst format.  This is the "acceptor" version
> of the format, which is (start-state end-state symbol weight), and the
> weight is (graph cost, acoustic cost, sequence of transition-ids).  [the
> "normal", non-acceptor format is start-state end-state input-symbol
> output-symbol weight].  The time can be obtained by summing up the number
> of transition-ids starting from the beginning of the lattice; in the code
> it's LatticeStateTimes.
>
> Note that the state numbers are arbitrary in a sense, they contain no real
> information.  See www.openfst.org for more info on WFSTs.
>
>
> On Mon, Mar 11, 2013 at 11:55 AM, Kartik Audhkhasi <aud...@us...>wrote:
>
>> Thanks Dan. I have started using the new scripts. However the timing
>> issue still remains. I used lattice-align-words to get times on the lattice
>> nodes in the same way as is demonstrated in the run.sh script. I think I am
>> not interpreting the times correctly. Do the IDs on both start and end
>> nodes represent frame numbers? E.g. the first line of my lattice is:
>>
>> 0       2337    44870
>> 17.0063,2553.48,9468_9482_9492_9491_9491_9491_9491_9910_9909_9909_9916_9924_9194_9210_9242_9241_9241_9241
>>
>> Does this says that word 44870 goes from frame 0 to frame 2237? The
>> transition ID sequence however shows only 18 frames.
>>
>>
>> On Sun, Mar 10, 2013 at 9:26 AM, Daniel Povey <dp...@gm...> wrote:
>>
>>> And RE how to debug it-- before and after lattice-align-words, you could
>>> run something like lattice-to-post; this program will crash if there are
>>> inconsistent times in the lattice, i.e. the lengths of input-symbol
>>> sequences are not all the same.  I suspect you actually mixed something up.
>>> Dan
>>>
>>>
>>> On Sun, Mar 10, 2013 at 12:24 PM, Daniel Povey <dp...@gm...> wrote:
>>>
>>>> It's a shame that you're using the older versions of the script.
>>>>  Currently the "s5" scripts are the canonical ones.  Your issue with times
>>>> greater than the length of the file is very unexpected.  This is not the
>>>> kind of error I would expect to ever arise.
>>>> RE getting the N-best or 1-best sequences-- the programs lattice-nbest
>>>> and lattice-1best are relevant here; they output stuff in the regular
>>>> lattice format, and you can then put them through lattice-word-align (old
>>>> scripts) or lattice-align-words (new scripts), and convert the output to,
>>>> say, ctm format-- you can check the scripts for how to convert to ctm
>>>> format, it's something like lattice-to-ctm, but there are scripts such as
>>>> get_ctm.sh and get_train_ctm.sh in s5.
>>>> Dan
>>>>
>>>>
>>>>
>>>> On Sat, Mar 9, 2013 at 8:41 PM, Kartik Audhkhasi <aud...@us...>wrote:
>>>>
>>>>> Hello,
>>>>>
>>>>> I am working with Kaldi lattices and used the walign_lats.sh script to
>>>>> get times on nodes. I am using position dependent phones and believe that
>>>>> all required files are in place. However, I see that some frame numbers (in
>>>>> units of 10ms) exceed the total length of the file. Do you have any
>>>>> suggestions as to what could be going wrong?
>>>>>
>>>>> Also: Is there an easy way to get the N-best or 1-best sequences with
>>>>> word boundaries?
>>>>>
>>>>> Thanks,
>>>>> Kartik
>>>>>
>>>>> --
>>>>> Kartik Audhkhasi
>>>>>
>>>>>
>>>>> ------------------------------------------------------------------------------
>>>>> Symantec Endpoint Protection 12 positioned as A LEADER in The Forrester
>>>>> Wave(TM): Endpoint Security, Q1 2013 and "remains a good choice" in the
>>>>> endpoint security space. For insight on selecting the right partner to
>>>>> tackle endpoint security challenges, access the full report.
>>>>> http://p.sf.net/sfu/symantec-dev2dev
>>>>> _______________________________________________
>>>>> Kaldi-developers mailing list
>>>>> Kal...@li...
>>>>> https://lists.sourceforge.net/lists/listinfo/kaldi-developers
>>>>>
>>>>>
>>>>
>>>
>>
>>
>> --
>> Kartik Audhkhasi
>>
>
>

-- 
Kartik Audhkhasi

2011	Jan	Feb	Mar	Apr	May	Jun (4)	Jul	Aug	Sep (1)	Oct (4)	Nov (1)	Dec (14)
2012	Jan (1)	Feb (8)	Mar	Apr (1)	May (3)	Jun (13)	Jul (7)	Aug (11)	Sep (6)	Oct (14)	Nov (16)	Dec (1)
2013	Jan (3)	Feb (8)	Mar (17)	Apr (21)	May (27)	Jun (11)	Jul (11)	Aug (21)	Sep (39)	Oct (17)	Nov (39)	Dec (28)
2014	Jan (36)	Feb (30)	Mar (35)	Apr (17)	May (22)	Jun (28)	Jul (23)	Aug (41)	Sep (17)	Oct (10)	Nov (22)	Dec (56)
2015	Jan (30)	Feb (32)	Mar (37)	Apr (28)	May (79)	Jun (18)	Jul (35)	Aug	Sep (1)	Oct	Nov	Dec

kaldi-developers Mailing List for Kaldi (Page 33)

kaldi-developers — Kaldi Developers