2) How many different speakers should record the keyphrase?
3) Is keyphrase spotting becoming less accurate when the number of keyphrases to be analysed increase?
4) And is it becoming slower? With 100 keyphrases? 1000?
5) I am thinking about automating the choice of the threshold. For example, it will try different threshold values for a given keyphrase, and I will chose the level of acceptance I want. Is it a good idea? (I just don't want to lose time by having to do it manually)
6) Is the idea in 5) already coded by someone?
Thank you for your consideration,
Otherend
Last edit: Otherend 2016-05-22
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
2) How many different speakers should record the keyphrase?
Testing allows you to figure out how system will behave in a production. So you need to try to reproduce the production environment as close as possible. If many speakers will use the system, you need many test speakers as well. However, single speaker also gives a reasonable approximation for threshold.
3) Is keyphrase spotting becoming less accurate when the number of keyphrases to be analysed increase?
Yes
4) And is it becoming slower? With 100 keyphrases? 1000?
Yes
6) Is the idea in 5) already coded by someone?
No
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello everyone,
I have several questions about spotting keyphrases with pocketsphinx:
1) What is the best way to chose the threshold value for a keyphrase? (there will be many keyphrases)
2) How many different speakers should record the keyphrase?
3) Is keyphrase spotting becoming less accurate when the number of keyphrases to be analysed increase?
4) And is it becoming slower? With 100 keyphrases? 1000?
5) I am thinking about automating the choice of the threshold. For example, it will try different threshold values for a given keyphrase, and I will chose the level of acceptance I want. Is it a good idea? (I just don't want to lose time by having to do it manually)
6) Is the idea in 5) already coded by someone?
Thank you for your consideration,
Otherend
Last edit: Otherend 2016-05-22
This issue is covered in http://cmusphinx.sourceforge.net/wiki/tutoriallm#keyword_lists
Testing allows you to figure out how system will behave in a production. So you need to try to reproduce the production environment as close as possible. If many speakers will use the system, you need many test speakers as well. However, single speaker also gives a reasonable approximation for threshold.
Yes
Yes
No
Thank you !