Menu

faligner stail predlist failed

Help
Tony
2011-08-01
2012-09-22
  • Tony

    Tony - 2011-08-01

    I had this problem while trying to run falignment>>>sphinx3_align:
    s3_align.c:927: align_build_sent_hmm: Assertion `stail.predlist' failed.
    I read a similar post on this problem dated in may last year, you suggested to
    check for utf-8 bom coding in transcripts, but i think my problem might be
    different cuz training was completed. Could you please suggest what to do?
    Thank you.

    Tony

     
  • Nickolay V. Shmyrev

    Could you please suggest what to do?

    Check utf-8 bom symbol in the beginning of the transcript. sphinx3 should
    break on it. Trainer is more robust to such issues now and could work fine
    despite your file has problems.

     
  • Tony

    Tony - 2011-08-01

    Then while doing the 20 training ci models, this error ERROR: "corpus.c", line
    355: Must be at least one line in the control file occured. What could be the
    cause of this error?

    I'm gonna say all my thanks here for all my posts and your replies so you
    don't have to open the thread just to read thanks from people. Thank you

    Tony

     
  • Nickolay V. Shmyrev

    Then while doing the 20 training ci models, this error ERROR: "corpus.c",
    line 355: Must be at least one line in the control file occured. What could be
    the cause of this error?

    Your previous stage (forced alignment) didn't align any files properly, all
    failed and the control file is empty because of that. You need to check forced
    alignment stage logs to find out what is wrong

    I'm gonna say all my thanks here for all my posts and your replies so you
    don't have to open the thread just to read thanks from people.

    You are welcome

     
  • Tony

    Tony - 2011-08-02

    I used Kate to detect Unicode coding, nothing. I also saved the transcript
    again in kate (I was told it automaticly removes BOM). But I still have

    sphinx3_align: s3_align.c:927: align_build_sent_hmm: Assertion
    `stail.predlist' failed. (What should I do now)
    WARNING tmat.c line 192: Normalization failed failed for tmat 1 from state
    0, 1, 2 (what does this mean)

    Tony

     
  • Nickolay V. Shmyrev

    Share your model folder

     
  • Nickolay V. Shmyrev

    WARNING: "tmat.c", line 192: Normalization failed for tmat 31 from state 0
    WARNING: "tmat.c", line 192: Normalization failed for tmat 31 from state 1
    WARNING: "tmat.c", line 192: Normalization failed for tmat 31 from state 2

    Those warning from the log should make you think. You can find int the
    model_architecture/name.falign.ci_mdef that tmat 31 corresponds to the phone
    SIL.

    The SIL was trained incorrectly because you don't have and in your
    transcription file. Each line in the transcription file must start with
    and must end with
    .

    You should run verification stage before the training. Verification state also
    issued a warning for you about SIL. You should take care about the warnings on
    verification stage before you proceed to training.

     
  • Tony

    Tony - 2011-08-02

    that's right, I removed the and because in the training tutorial it
    said to do so
    http://www.speech.cs.cmu.edu/sphinxman/scriptman1.html#04
    So I should put them back to the transcript and retrain the model?

    Tony

     
  • Tony

    Tony - 2011-08-02

    What could cause the ERROR:"main_align.c" line 765: Final state not reached:
    no alignment for audiofile? How should I solve this? Due to this problem a few
    senones were also not observed.

    failed to open ~falignout/model.alignfiles at 20. ci_hmm at Baum_welch.pl line
    134. Is it due to the above problems?

    Tony

     
  • Nickolay V. Shmyrev

    How should I solve this? Due to this problem a few senones were also not
    observed.

    See

    http://cmusphinx.sourceforge.net/wiki/tutorialam#troubleshooting

    failed to open ~falignout/model.alignfiles at 20. ci_hmm at Baum_welch.pl
    line 134. Is it due to the above problems?

    The file model.alignfiles wasn't created for some reason. For more information
    you need to check training logs in logdir folder.

     
  • Tony

    Tony - 2011-08-03

    The senones are coded with numbers in the log. How can I find out which number
    corresponds to which senone?

    Should I just remove the corresponding phones and words from my files with
    these senones?

    Tony

     
  • Nickolay V. Shmyrev

    How can I find out which number corresponds to which senone?

    Number is senone. If you are looking for corresponding context or for
    corresponding central phoneme you can find this mapping inside mdef file.

     
  • Nickolay V. Shmyrev

    Should I just remove the corresponding phones and words from my files with
    these senones?

    You should better find out why the senones were not represented with enough
    occurences in the training database. Either you need to train with less
    senones or you need to drop rare phones.

     
  • Tony

    Tony - 2011-08-03

    I had fatal error during cd traing before with rare phones, so what I did was
    find all the words containing these phones and turned them into ++words++
    +rare_phone+ in all places and move them to filler dic, and cd training
    completed. Now for falignment I moved these ++word++ from the filler dict to
    dict according to the tutorial.The error log then says these modified
    +rare_phone+ don't exit in the input data. I guess I should put them back into
    the filler dic? If this is the case then what about the instruction for
    falignment asking me to remove the fillers cuz faligner is not good at
    inserting them?

    Tony

     
  • Tony

    Tony - 2011-08-03

    p.s. shouldn't these ++word++ and +rare_phone+ be ignored during decision tree
    building? Other filler phones in dict don't have the above problems.

     
  • Tony

    Tony - 2011-08-04

    "failed to open ~falignout/model.alignfiles at 20. ci_hmm at Baum_welch.pl
    line 134. Is it due to the above problems? "
    The file model.alignfiles wasn't created for some reason. For more information
    you need to check training logs in logdir folder.

    haha, your reply is exactly what was said in the sphinx log but in 20.
    ci_hmm log there was no information regarding that :s

     
  • Nickolay V. Shmyrev

    Now for falignment I moved these ++word++ from the filler dict to dict
    according to the tutorial.

    You are using some obsolete tutorial it seems. You can find all recent and up-
    to-date information on our website

    http://cmusphinx.sourceforge.net/wiki/tutorialam

    p.s. shouldn't these ++word++ and +rare_phone+ be ignored during decision
    tree building?

    They are ignored

     
  • Tony

    Tony - 2011-08-05

    After enabling everything to do with force alignment , I trained and the
    training completed, does it mean falignement is also complete? Which acoustic
    model should I use? What about the model.falign_ci_gaussian models?

    It's strange that when I tried to train another model, it failed because the
    libblas.so.3 doesn't exist. But how come I could train other models no
    problems?

    They are ignored
    Does it correspond to the warning that all the +phones+ senones are not
    observed in the input data?

    Tony

     
  • Nickolay V. Shmyrev

    I trained and the training completed, does it mean falignement is also
    complete?

    It's not related. You need to check logs and faligner folder contents for
    details.

    Which acoustic model should I use?

    This question is covered in tutorial

    What about the model.falign_ci_gaussian models?

    Those are intermediate models used during the training

    It's strange that when I tried to train another model, it failed because the
    libblas.so.3 doesn't exist. But how come I could train other models no
    problems?

    Maybe you installed libblas.so.3 already

    Does it correspond to the warning that all the +phones+ senones are not
    observed in the input data?

    No, it's unrelated.

     
  • Tony

    Tony - 2011-08-08
    1. They way I dropped rare phones are change the word containing the rare phones into a ++word++ filler with filler phone +rare_phone+, but this caused the senones were never observed in the input data warning? would this affect the acoustic model quality? How should I solve this?

    2. After I added more training data, it caused more of the above warnings for phones that were observed before addition of data. Why?

     
  • Tony

    Tony - 2011-08-08
    1. And in 30. cd model logdir norm.log I have couple million (T.T) "gauden.c" line 1554 (mgau=#, feat......) never observed warnings. What would be the cause? It seems training still completed. What Implications do these warnings have? (model quality?)
     
  • Tony

    Tony - 2011-08-08
    1. "Over 500 senones never occur in the input data. This is normal for context-dependent untied senone training or for adaptation, but could indicate a serious problem otherwise"

    How do I find out if it is a serious problem?

     
  • Tony

    Tony - 2011-08-09

    Haven't heard back from you. What do you think about the above problems?

     
  • Nickolay V. Shmyrev

    How do I find out if it is a serious problem?

    Read the documentation about training stages and the meaning of this message
    will become clear.

     

Log in to post a comment.