Menu

How to time a transcript

Help
2009-07-16
2012-09-22
  • Terry Luedtke

    Terry Luedtke - 2009-07-16

    Hello,

    I'm trying to use an audio file and a text transcript to create a caption file by using Spninx4 to determine the time each word is said. I see that I can get frame numbers from the nodes in the resulting lattice. How do I convert frames into milliseconds? Is there a frame length property somewhere that I am missing?

    Also, is it possible to have the recognizer return a single result set for a two minute audio clip? Right now I get a half dozen results or so, which I can work with, but a single lattice would be easier.

    Thanks,
    Terry Luedtke

     
    • Terry Luedtke

      Terry Luedtke - 2009-07-20

      Thank you Nickolay for the information.

      • Terry Luedtke
       
    • Nickolay V. Shmyrev

      > I see that I can get frame numbers from the nodes in the resulting lattice. How do I convert frames into milliseconds?

      1 frame = 10 milliseconds

      >Also, is it possible to have the recognizer return a single result set for a two minute audio clip? Right now I get a half dozen results or so, which I can work with, but a single lattice would be easier.

      It's bad because of memory usage reasons.

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.