CMU Sphinx / Forums / Help: pocketsphinx noise cancellation and best method to build an acoustic model?

pocketsphinx noise cancellation and best method to build an acoustic model?

Forum: Help

Creator: ab1984

Created: 2015-08-14

Updated: 2015-08-17

ab1984 - 2015-08-14

Is their any way to neglect any background noises like breathing or cough in pocketsphinx?

If their is any method, how can I do the noise cancellation?

Can I do it using .filler file?

So far I'm building the acoustic model by recording sentences and phrases by pronuncing them word by word and save the audio file for each and every sentence. Is that the better way or should I have to record them word by word and save the audio files seperatly?

Thank You
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

bic-user - 2015-08-14

yes. you can recognize them and discard

yes. Stationary noise cancellation is done durion feature extraction step.

.filler file contains fillers (breath, cough) that are included into your acoustic model

You can record whole thing and than separate audio automatically. But it's easier to record corpus sentence by sentence. You can fix audio or transcription once something goes wrong. To use fillers in your acoustic model you should add them into transcription if they occur in audio.
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

ab1984 - 2015-08-16

Thank you for the reply bic-user. Still I'm having some confusions,
1. How should I do it?
3. How can I add new fillers to my acoustic model. (For an example Bus Horn Sound)
4. Is their a specific speed of word utterance when recording? Should I have to keep small breaks between two words or just record continuously as we are speaking?

Thank You

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- bic-user - 2015-08-16
  
  In cmusphinx decoders it is done automatically. You can play with -fillprob parameter to control fillers insertion intensivity.
  
  You need to retrain acoustic model adding training utterances with this filler
  
  Just record in the same way, that is expected during recognition.
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

ab1984 - 2015-08-17

Thank you bic-user.
Still no idea how to retrain the acoustic model to recognize such noise utterances. What are steps should I follow? Is their any tutorials to follow?

thank you

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.