Menu

update language model as per different domain content

Addie
2015-03-17
2015-03-17
  • Addie

    Addie - 2015-03-17

    Hello,

    can i update language model with different domain content without recreating graph. as sometimes, it required on client side if they have different content for transcription as per their domain. if possible, then do explain the way

     
    • Daniel Povey

      Daniel Povey - 2015-03-17

      The language model is encoded in the graph so ideally you should recompile
      the graph if the LM changes, but there are other ways to do this, e.g.
      lattice rescoring. Some of the recipes show how to do this, search for
      "rescore" in the run.sh scripts. There are two ways: using a graph-based
      LM and using an ARPA LM; the latter is more efficient if you have an
      ARPA-style language model, search for "carpa" in the scripts.

      Dan

      On Tue, Mar 17, 2015 at 12:23 PM, Ashish Dave badboys4life007@users.sf.net
      wrote:

      Hello,

      can i update language model with different domain content without
      recreating graph. as sometimes, it required on client side if they have
      different content for transcription as per their domain. if possible, then
      do explain the way


      update language model as per different domain content
      https://sourceforge.net/p/kaldi/discussion/1355347/thread/a16ca4fe/?limit=25#2823


      Sent from sourceforge.net because you indicated interest in
      https://sourceforge.net/p/kaldi/discussion/1355347/

      To unsubscribe from further messages, please visit
      https://sourceforge.net/auth/subscriptions/

       
  • Addie

    Addie - 2015-03-17

    yes i found it using an ARPA LM but does it affect on performance with real time decoding and on WER with nnet2 in compare to graph-based LM