Menu

Forced alignment with Sphinx?

Help
2014-02-19
2014-02-19
  • Jamie Bullock

    Jamie Bullock - 2014-02-19

    Hi,

    The problem I am trying to solve is this:

    • I have a collection of audio files containing dialog for an arbitrary number of speakers all in American English
    • The audio files are clean and only contain dialog, nothing else
    • For each audio file I have a text file that is a word-for-word transcript of the dialog
    • I need to label each word in the transcript with its temporal location in the audio file
    • Ideally I also need a temporal location for each phoneme within a word

    My application is written in C, so I need to solve the problem with C/C++ libraries and / or custom code.

    Is this something that Sphinx can be used for?

    I have read online that "Sphinx 4" can be used for this, but it appears to be a Java library.

    Is it possible to solve this problem using Sphinxbase with additional custom C code?

    Sorry if these questions are obvious, I am new to the Sphinx ecosystem!

    Thanks,

    Jamie

     
  • Nickolay V. Shmyrev

    How long each file is?

     

Log in to post a comment.