Menu

Fillers and Language model

2016-10-11
2016-10-13
  • Habib  Baluwala

    Habib Baluwala - 2016-10-11

    I am trying to build my own language model from some transcripts. I was wondering that do I need to include the fillers or remove them while building it. For example, is this right
    " Um I have a point to make um that I am the pilot "
    or
    " I have a point to make that I am the pilot "
    Any help is appreciated and look forward to your reply.

     
  • Habib  Baluwala

    Habib Baluwala - 2016-10-13

    Another question on the same line, while building the language models would it matter if I have very long sentences (like 200 words in one line) or is it better to break it up into smaller sentences and then train the language model? Look forward to your reply and thanks for the earlier answer.

     
    • Nickolay V. Shmyrev

      It is better to break them and also it is important that those breaks match the actual breaks people make when speaking. It is not a trivial task though.

       
  • Habib  Baluwala

    Habib Baluwala - 2016-10-13

    Thanks Nickolay. will do that check the results. Thanks for the help :)

     

Log in to post a comment.