Menu

Building a language model - word restriction

Thanasis
2020-09-07
2020-09-07
  • Thanasis

    Thanasis - 2020-09-07

    Hello, im new to speech recognition and trying to understand some of the multiple variables.
    So here is my question:

    does the language model restrict the recognition to its contents (words used to build the language model)
    or can words outside of the LM (but included in the dictionary) still be recognized?

    If the LM does not restrict the recognition and my goal is to
    "recognize natural language but with medical terms",
    would it be better to build my own LM (myLM) and to use it with the en-us dictionary,
    or build a dictionary (myDict) from the same corpus and use the compination myLM and myDict ?

    Thanks in advance

     
  • Nickolay V. Shmyrev

    does the language model restrict the recognition to its contents (words used to build the language model)

    Yes

    or can words outside of the LM (but included in the dictionary) still be recognized?

    No

    would it be better to build my own LM (myLM) and to use it with the en-us dictionary,
    or build a dictionary (myDict) from the same corpus and use the compination myLM and myDict ?

    You have to update both LM and the dictionary

     
  • Thanasis

    Thanasis - 2020-09-07

    Ok, thank you for clearing it out.
    I worked a bit with
    cmuclmtk and the web toolkit and understood in practice what you said ;)

     
  • Thanasis

    Thanasis - 2020-09-07

    edit: post was uploaded twice. sorry

     

    Last edit: Thanasis 2020-09-08

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.