Menu

Model file formats

Help
Wout
2007-06-06
2012-09-22
<< < 1 2 (Page 2 of 2)
  • cerisara

    cerisara - 2007-06-14

    Sorry for my last unfinished post - I write too fast and
    hit the wrong key from time to time :-)

    I just wanted to say that I don't want to make competitions
    between HTK and S4, but I think this kind of interoperability
    is really important, especially as HTK is a kind of standard now,
    and if S4 supports the HTK format, then I think much more people will
    use it, and benefit from the advantages of both systems (adaptation
    for HTK, large vocabulary for S4, ...)

    Thanks again !
    Best regards,
    Christophe

     
    • Nagendra Kumar Goel

      I fully support your argument. Will be happy to help if I can.
      I feel that Sphinx class of decoders (2, 3, 4) are great,
      and have good optimizations, though need more work - like everything else needs work.

      HTK interface will only help relaxing effort pressure on training,
      so one can focus better on the decoder.

       
      • cerisara

        cerisara - 2007-06-19

        Hello,

        This is a follow-up about HTK models loading...

        Some news: it looks like my HTKLoader is working, but...
        (there's always a but !) I had to make several "assumptions",
        especially with regard to 1ph and 2ph, which are not used in the same way
        in HTK and S4. Actually, it is not very clear for me what's the best
        solution how to handle these 1ph and 2ph, but at least I have a baseline framework that's working.

        There are certainly lots of improvements possible of this framework, but I think that if I try and do them alone, I might miss lots of details.

        Also, I have not yet done a real accuracy comparison, because:
        1- it requires a lot of work: weights tuning, output normalisation, ...
        2- I'm pretty sure S4 results will be lower than julius' results,
        because of these "assumptions"
        I have just checked that I get reasonnable results on a few sentences
        of a large vocabulary task.

        In my code, I tried to respect the S4 philosophy, and it can be applied as a patch, but it would be better to have it in SVN, otherwise
        it will inevitably "diverge" from the S4 code and become sooner or later
        not compatible any more.
        But of course, this requires first at least some unit testing...
        I might think about that if you believe it can be integrated in SVN.

        I don't know how and where to upload/post this patch so that you can have
        a look if you're interested, just let me know please.

        Thank you !
        Christophe

         
        • Nickolay V. Shmyrev

          Amazing work. But what stops you from publishing the patch? For example this page has a link Tracker/Patches in the header where patches to sphinx could be uploaded in theory.

           
          • cerisara

            cerisara - 2007-06-21

            OK, I uploaded it in the "tracker-patch" area.
            I was reluctant to do it because the soft is not
            satisfying yet: accuracy is not as good as expected,
            but I need the help of others to find out why anyway.

            You can look at the beginning of the file HTKLoader.java,
            I tried to write down there all the "assumptions" I had to do.

            Regards,
            Christophe

             
<< < 1 2 (Page 2 of 2)

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.