Menu

How to analyse textfiles with Sphinx

Help
dreaduriel
2009-07-20
2012-09-22
  • dreaduriel

    dreaduriel - 2009-07-20

    Dear Sir or Madam,

    I study maths and informatics at the University in Erlangen, Germany.

    We work on a project to analyse learning processes of pupils.
    The pupils had to solve different tasks with a learning software called “Eval Kara”. In "Eval KAra" they have a ladybeetle which can be controlled over view commands like "go ahead", "go left", "go right", to lay or to put something. The individual techniques they used to solve these tasks are recorded into log-files in form of text-files with separators between the single actions they have executed. Our intention now is to analyse these log-files with Hidden Markov Models, which implies that we have to feed the collected data into the Sphinx.

    Here is an excerpt of one log-file:


    11:45:38 Bild "invertieren" (mittel) 1 5 STATE_ADDED Start 1 0 ""
    11:45:42 Bild "invertieren" (mittel) 1 8 TRANSITION_ADDED Start Start "" ""
    11:45:43 Bild "invertieren" (mittel) 1 12 TRANSITION_INPUT_CHANGED Start onLeaf 1 ""
    11:45:44 Bild "invertieren" (mittel) 1 12 TRANSITION_INPUT_CHANGED Start onLeaf 2 ""
    11:45:45 Bild "invertieren" (mittel) 1 12 TRANSITION_INPUT_CHANGED Start onLeaf 0 ""
    11:45:48 Bild "invertieren" (mittel) 1 12 TRANSITION_INPUT_CHANGED Start onLeaf 1 ""
    11:45:52 Bild "invertieren" (mittel) 1 9 TRANSITION_COMMAND_ADDED Start Start 0 removeLeaf
    11:45:54 Bild "invertieren" (mittel) 1 9 TRANSITION_COMMAND_ADDED Start Start 1 move
    11:46:07 Bild "invertieren" (mittel) 1 8 TRANSITION_ADDED Start Start "" ""
    11:46:07 Bild "invertieren" (mittel) 1 12 TRANSITION_INPUT_CHANGED Start onLeaf 1 ""
    11:46:08 Bild "invertieren" (mittel) 1 12 TRANSITION_INPUT_CHANGED Start onLeaf 2 ""
    11:46:10 Bild "invertieren" (mittel) 1 9 TRANSITION_COMMAND_ADDED Start Start 0 putLeaf
    11:46:12 Bild "invertieren" (mittel) 1 9 TRANSITION_COMMAND_ADDED Start Start 1 move
    11:46:18 Bild "invertieren" (mittel) 3 2 NO_START_STATE_EXCEPTION "" "" "" ""
    11:46:18 Bild "invertieren" (mittel) 0 3 ERROR_STOP "" "" "" ""
    11:46:27 Bild "invertieren" (mittel) 1 4 START_STATE_SET Start "" "" ""
    11:46:30 Bild "invertieren" (mittel) 0 4 GOT_READY "" "" "" ""
    11:46:30 Bild "invertieren" (mittel) 0 6 PLAYING "" "" "" ""
    11:46:30 Bild "invertieren" (mittel) 2 6 WORLD_OBJECT_REMOVED L0 1 1 ""
    11:46:30 Bild "invertieren" (mittel) 0 7 STEPPED "" "" "" ""
    11:46:31 Bild "invertieren" (mittel) 0 7 STEPPED "" "" "" ""
    11:46:32 Bild "invertieren" (mittel) 2 6 WORLD_OBJECT_REMOVED L0 2 1 ""

    The strukture is:

    Timestamp | Taskname | action-id | action-name they executed | different other parameters...

    In a nutshell:
    We don't need to analyse Speech-data. We already have simple recorded textfiles from recorded behaviour of every pupil.
    Now we want to design Hidden Markov Modells to find spezial sequences ( similar to DNA-Sequencing).
    Is this possible with Sphinx? And how should we do this?

    It would be really very very nice hearing from you soon.

    kind regards,

    Sebastian Sossalla,
    University Erlangen

     
    • dreaduriel

      dreaduriel - 2009-07-22

      Thanks for your fast reply.

      Ok. Do you have a suggestion concerning this problem, what HMM implementation is suited for this.

       
    • dreaduriel

      dreaduriel - 2009-07-22

      Thanks for your fast reply.

      Ok. Do you have a suggestion concerning this problem, what HMM implementation is suited for this.

       
    • Nickolay V. Shmyrev

      I don't think CMU sphinx decoders are suitable for this. I suggest you to find another HMM implementation, there are many of them.

       

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.