Menu

How the file en-us-phone.lm.bin is build ?

Help
2018-10-23
2018-10-25
  • Esteban  Lorantino

    The file ./model/en-us/en-us-phone.lm.bin is a binary file. Where does it come from ?
    Also, where is strored the list of phonems in the source code ?
    I mean when the recogniser loop through the list of phonems ?

     

    Last edit: Esteban Lorantino 2018-10-23
  • Esteban  Lorantino

    What will happen if my dictionary file contains phonem that are not in the CMUBet list, such as AI and OE ?

     

    Last edit: Esteban Lorantino 2018-10-23
    • Nickolay V. Shmyrev

      You will see an error about a phone missing in acoustic model.

       
  • Esteban  Lorantino

    Can I extract the list of phones keywords from the acoustic model ?

     
    • Nickolay V. Shmyrev

      You can look inside mdef file. If mdef file is binary you can convert it to text and look inside.

       
  • Esteban  Lorantino

    Thank, your right. I see them at the top of the file: AA^@AE^@AH^@AO^@AW^@AY^@B^@CH^@D^@DH^@EH^@ER^@EY^@F^@G^@HH^@IH^@IY^@JH^@K^@L^@M^@N^@NG^@OW^@OY^@P^@R^@S^@SH^@SIL^@T^@TH^@UH^@UW^@V^@W^@Y^@Z^@ZH.

    I found that the invalid ones (AI, OE, etc...) that I have are from a french dictionariry from "LIUM" but I have no idea from where is comming this "LIUM dictionary" ...
    Do you have any idea about it and where is the source (URL) ????

     

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.