Menu

sphinx3_align phone duration

Help
Jigar
2012-08-31
2012-09-22
  • Jigar

    Jigar - 2012-08-31

    Hello i am doing Confidence measure in speech recognition. I read a paper in
    which i came up with a method in which phone durations are compared..

    Basically in each utterance i want to extract the phone boundaries.

    I ran sphinx3_align as

    sphinx3_align 
    -hmm model_1/    (dir in which the model is present)
     -dict marathiAgmark1500.dic  (dictionary)
     -fdict 850spkr.filler (filler dict)
     -ctl docs/fileid.txt   (contains the fileIds)
     -insent phone.insent (input transcription file)
     -cepdir features/    (mfcc features)
     -phsegdir phonesegdir/ 
     -phlabdir phonelabdir/
     -stsegdir statesegdir/
     -wdsegdir aligndir/
     -outsent phone.outsent
    

    I ran this and got a error

    INFO: main_align.c(1009): M13MH01A0001I501: 96 input frames
    not in dictionary", line 889: (M13MH01A0001I501)
    ERROR: "main_align.c", line 826: No sentence HMM; no alignment for
    M13MH01A0001I501

    Please help me out.

     
  • Nickolay V. Shmyrev

    In those three words:

    not in dictionary

    which word do you have trouble to understand?

     
  • Jigar

    Jigar - 2012-08-31

    But the word in the transcription is already there in the dictionary..
    This is my phone.insent file

    SOMETHING HERE

    kaapuusa (M13MH01A0001I501)

    and this is the corresponding word in the dict

    SOMETHING HERE

    kaapuusa k aa p u s

    I am not getting this error..

     
  • Nickolay V. Shmyrev

    and this is the corresponding word in the dict "SOMETHING HERE"kaapuusa k aa
    p u s

    It doesn't mean that the word is in dictionary. The word transcription might
    have incorrect phones or might be stripped for some other reason for example
    because file format is wrong. If the word is in the last line you might miss a
    proper newline after it. You should have earlier message about it in the log.
    You need to read the whole log, not just the last line of it. To get faster
    answer share the files you are using. You can do it through dropbox.

     
  • Jigar

    Jigar - 2012-08-31

    Hey thanks a lot.. i got the error.
    Actually My transcription file was from Windows and i read it in linux so an
    extra carriage return was added at the end..

     

Log in to post a comment.