Kaldi / Discussion / Open Discussion: update language model as per different domain content

Addie - 2015-03-17

Hello,

can i update language model with different domain content without recreating graph. as sometimes, it required on client side if they have different content for transcription as per their domain. if possible, then do explain the way

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Daniel Povey - 2015-03-17
  
  The language model is encoded in the graph so ideally you should recompile
  the graph if the LM changes, but there are other ways to do this, e.g.
  lattice rescoring. Some of the recipes show how to do this, search for
  "rescore" in the run.sh scripts. There are two ways: using a graph-based
  LM and using an ARPA LM; the latter is more efficient if you have an
  ARPA-style language model, search for "carpa" in the scripts.
  
  Dan
  
  On Tue, Mar 17, 2015 at 12:23 PM, Ashish Dave badboys4life007@users.sf.net
  wrote:
  
  Hello,
  
  can i update language model with different domain content without
  recreating graph. as sometimes, it required on client side if they have
  different content for transcription as per their domain. if possible, then
  do explain the way
  
  update language model as per different domain content
  https://sourceforge.net/p/kaldi/discussion/1355347/thread/a16ca4fe/?limit=25#2823
  
  Sent from sourceforge.net because you indicated interest in
  https://sourceforge.net/p/kaldi/discussion/1355347/
  
  To unsubscribe from further messages, please visit
  https://sourceforge.net/auth/subscriptions/
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Addie - 2015-03-17

yes i found it using an ARPA LM but does it affect on performance with real time decoding and on WER with nnet2 in compare to graph-based LM

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Daniel Povey - 2015-03-17
  
  It hasn't been implemented yet in real-time decoding but shouldn't be super
  hard to do. WER will depend how different the LMs are. If extremely
  different, then it would be better to make the graph.
  Dan
  
  On Tue, Mar 17, 2015 at 3:53 PM, Ashish Dave badboys4life007@users.sf.net
  wrote:
  
  yes i found it using an ARPA LM but does it affect on performance with
  real time decoding and on WER with nnet2 in compare to graph-based LM
  
  update language model as per different domain content
  https://sourceforge.net/p/kaldi/discussion/1355347/thread/a16ca4fe/?limit=25#552c
  
  Sent from sourceforge.net because you indicated interest in
  https://sourceforge.net/p/kaldi/discussion/1355347/
  
  To unsubscribe from further messages, please visit
  https://sourceforge.net/auth/subscriptions/
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

update language model as per different domain content

Forums

Help

update language model as per different domain content

update language model as per different domain content

Forums

Help

update language model as per different domain content document.SUBSCRIPTION_OPTIONS = { "thing": "topic", "subscribed": false, "url": "subscribe", "icon": { "css": "fa fa-envelope-o" } };

update language model as per different domain content