Menu

Generate a list of OOV words

Help
Aanand P
2020-06-15
2020-06-16
  • Aanand P

    Aanand P - 2020-06-15

    Hello. I am trying to adapt an acoustic model but unfortunately, the transcripts of my audio files contains words that are not in the existing CMU dictionary. I would like to extend the exitsting dictionary but for that I will have to create a text file containing all the words that I would like to include in the dictionary. My transcription file is huge and there are lot of words that is not in the default dictionary. How can I compare my transcription file with the existing dictionary? Is there a tool by cmusphinx to create a text file when the word wasn't found in the existing dictionary? Thanks

     
    • Nickolay V. Shmyrev

      You can write a script in Python

       
  • Aanand P

    Aanand P - 2020-06-16

    I have no idea how to acheive this with python. Is there any reference script where I can take a look?

     

    Last edit: Aanand P 2020-06-16

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.