Menu

Should we repeat each command 60 times?

Help
rezaee
2016-10-18
2016-10-26
  • rezaee

    rezaee - 2016-10-18

    In the tutorial says:

    1 hour of recording for command and control for single speaker
    

    So, let's consider we can record 40 commands in 1 minute, and we need only these 40 commands. And our system is command and control system for only one user! so we should repeat each of our commands 60times to get one hour of recorded file!

    for example I should record 60times my sound and say "go forward" or "how are you", etc. And do it for each command!

    Am I right?

     
    • Nickolay V. Shmyrev

      You are correct.

       
  • rezaee

    rezaee - 2016-10-22

    Very thank you Nikolay!
    What should be the structure of the "train.transcription" file?

    For example, the wav files from 1 to 60 are the "hello world" recorded sound, so the structure of train.transcription file will looks like as bellow?

    <s>hello worlds</s> (1)
    <s>hello worlds</s> (2)
    <s>hello worlds</s> (3)
    .
    .
    .
    <s>hello worlds</s> (60)
    
     

    Last edit: rezaee 2016-10-22
    • Nickolay V. Shmyrev

      You need space after <s> and before </s>

       <s> hello worlds </s> (1)
      
       
  • rezaee

    rezaee - 2016-10-23

    What should be the structure of train.fileids file?

    struct1:

    1
    2
    3
    

    or
    struct2:

    1
    1
    1
    .
    (60 times)
    .
    1
    2
    2
    (60 times)
    .
    2
    3
    (60 times)
    3
    

    I think I read somewhere in tutorial, it said the number of lines in transcription and fileids file should be equal! So, shall I repaet each number(the name of recorded wav file) for 60 times like structure 2?

     

    Last edit: rezaee 2016-10-23
  • rezaee

    rezaee - 2016-10-24

    May you answer the last question pls

     
    • Arseniy Gorin

      Arseniy Gorin - 2016-10-24

      It would be useful if you reviewed the tutorial once again: http://cmusphinx.sourceforge.net/wiki/tutorialam#data_preparation

      Each line in ".fileids" should have the associated line in ".transcription"

      The transcription file should have a text line and the file name inside braces. The exact structure depends on how you name your files

       
      • rezaee

        rezaee - 2016-10-26

        Thank you!

        Let's consider I have 2 commands and 10 recorded file for each command (1 to 10 for command 1 and 11 to 20 for command 2), ok?
        Should be the structrurs of .fileids and .transcription as bellow?

        .fileids:

        train/1
        train/2
        train/3
        train/4
        train/5
        train/6
        train/7
        train/8
        train/9
        train/10
        train/11
        train/12
        train/13
        train/14
        train/15
        train/16
        train/17
        train/18
        train/19
        train/20
        

        .transcroption:

        <s> command1 </s> (1)
        <s> command1 </s> (2)
        <s> command1 </s> (3)
        <s> command1 </s> (4)
        <s> command1 </s> (5)
        <s> command1 </s> (6)
        <s> command1 </s> (7)
        <s> command1 </s> (8)
        <s> command1 </s> (9)
        <s> command1 </s> (10)
        <s> command2 </s> (11)
        <s> command2 </s> (12)
        <s> command2 </s> (13)
        <s> command2 </s> (14)
        <s> command2 </s> (15)
        <s> command2 </s> (16)
        <s> command2 </s> (17)
        <s> command2 </s> (18)
        <s> command2 </s> (19)
        <s> command2 </s> (20)
        
         
        • Arseniy Gorin

          Arseniy Gorin - 2016-10-26

          yes, the example you provided seems OK

           

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.