uncertainty margins of kws segment times

Speech Recognition Toolkit

Brought to you by: air, arthchan2003, awb, bhiksha, and 5 others

This project can now be found here.

uncertainty margins of kws segment times

Forum: Help

Creator: Yuval Karon

Created: 2016-11-23

Updated: 2016-11-27

Yuval Karon - 2016-11-23

Hello,

I am extracting keywords from an audio file.
The audio is a sentence containing the words "w1 w2" in sequence.

The decoder identifies these words (as well as some variants)
but returns the following segment start and end times:

w1: a - x w2: y - b

with y > x by three or four frames.

Is this a normal behaviour? I can assume a certain tolerance when processing the segments
further.
- which values are expected with the default arguments?
- Is it possible to control this margin?

Many thanks,
Yuval
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Nickolay V. Shmyrev - 2016-11-23
  
  Is this a normal behaviour? I can assume a certain tolerance when processing the segments
  
  Yes it should be ok. There is no direct restart of the next word after one ends since it's keyword spotting. So it can decide word started a bit later.
  
  Is it possible to control this margin?
  
  I don't think so.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Yuval Karon - 2016-11-27

I see, thank you.

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.