Is their any way to neglect any background noises like breathing or cough in pocketsphinx?
If their is any method, how can I do the noise cancellation?
Can I do it using .filler file?
So far I'm building the acoustic model by recording sentences and phrases by pronuncing them word by word and save the audio file for each and every sentence. Is that the better way or should I have to record them word by word and save the audio files seperatly?
Thank You
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
yes. Stationary noise cancellation is done durion feature extraction step.
.filler file contains fillers (breath, cough) that are included into your acoustic model
You can record whole thing and than separate audio automatically. But it's easier to record corpus sentence by sentence. You can fix audio or transcription once something goes wrong. To use fillers in your acoustic model you should add them into transcription if they occur in audio.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank you for the reply bic-user. Still I'm having some confusions,
1. How should I do it?
3. How can I add new fillers to my acoustic model. (For an example Bus Horn Sound)
4. Is their a specific speed of word utterance when recording? Should I have to keep small breaks between two words or just record continuously as we are speaking?
Thank You
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank you bic-user.
Still no idea how to retrain the acoustic model to recognize such noise utterances. What are steps should I follow? Is their any tutorials to follow?
thank you
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Thank You
Thank you for the reply bic-user. Still I'm having some confusions,
1. How should I do it?
3. How can I add new fillers to my acoustic model. (For an example Bus Horn Sound)
4. Is their a specific speed of word utterance when recording? Should I have to keep small breaks between two words or just record continuously as we are speaking?
Thank You
Thank you bic-user.
Still no idea how to retrain the acoustic model to recognize such noise utterances. What are steps should I follow? Is their any tutorials to follow?
thank you