Menu

uncertainty margins of kws segment times

Help
2016-11-23
2016-11-27
  • Yuval Karon

    Yuval Karon - 2016-11-23

    Hello,

    I am extracting keywords from an audio file.
    The audio is a sentence containing the words "w1 w2" in sequence.

    The decoder identifies these words (as well as some variants)
    but returns the following segment start and end times:

    w1: a - x
    w2: y - b
    

    with y > x by three or four frames.

    Is this a normal behaviour? I can assume a certain tolerance when processing the segments
    further.
    - which values are expected with the default arguments?
    - Is it possible to control this margin?

    Many thanks,
    Yuval

     
    • Nickolay V. Shmyrev

      Is this a normal behaviour? I can assume a certain tolerance when processing the segments

      Yes it should be ok. There is no direct restart of the next word after one ends since it's keyword spotting. So it can decide word started a bit later.

      • Is it possible to control this margin?

      I don't think so.

       
  • Yuval Karon

    Yuval Karon - 2016-11-27

    I see, thank you.

     

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.