Menu

GET CTM FILE

Delion
2015-05-05
2015-05-05
  • Delion

    Delion - 2015-05-05

    I'm using /egs/wsj to get the ctm file for NIST scoring tools.
    With the /steps/get_train_ctm.sh script, I can get a final ctm file for the whole training data; while using /steps/get_ctm.sh script, I got several ctm files under $decode_dir/score_( is from minlmwt to maxlmwt). In each ctm file, the content is just one single line like:
    "440c0401 1 0.87 0.41"
    without any words behind the during time, and these ctm files only include the same file(like 440c0401), totally different from the get_train_ctm.sh performed ctm file.
    Each file in my eval set includes only one utterance, and the eval set is directly extracted from the training data.
    I wonder how to get the "real" ctm file I need for the eval set, as get_train_ctm.sh does for the training dataset. Did I miss someting or did it wrong?

     
  • Guoguo Chen

    Guoguo Chen - 2015-05-05

    The multiple directories from steps/get_ctm.sh are fine -- they are just decodings with different LM scale. Likely you have some file mismatch. It will be helpful if you check the logs at $your_decoding_dir/scoring/log/get_ctm.*.log, and see if there are warnings or errors.

    Guoguo

     
    • Daniel Povey

      Daniel Povey - 2015-05-05

      That output is consistent with what Guoguo said, i.e. a lexicon
      mismatch that caused int2sym.pl to die. I see now that get_ctm.sh and
      get_train_ctm.sh are not properly detecting that error due to not
      setting pipefail for the pipe that had a problem, and I just committed
      a fix for this that should make this problem cause a failure earlier
      on.
      Dan

      On Tue, May 5, 2015 at 7:40 AM, Guoguo Chen chenguoguo@users.sf.net wrote:

      The multiple directories from steps/get_ctm.sh are fine -- they are just
      decodings with different LM scale. Likely you have some file mismatch. It
      will be helpful if you check the logs at
      $your_decoding_dir/scoring/log/get_ctm.*.log, and see if there are warnings
      or errors.

      Guoguo


      GET CTM FILE


      Sent from sourceforge.net because you indicated interest in
      https://sourceforge.net/p/kaldi/discussion/1355347/

      To unsubscribe from further messages, please visit
      https://sourceforge.net/auth/subscriptions/