Menu

Extra white space

Help
Rossi Luo
2010-02-20
2012-09-22
  • Rossi Luo

    Rossi Luo - 2010-02-20

    I was stucked here when I was training my own simple acoustic models:

    MODULE: 00 verify training files (2010-02-20 10:07)
    
    O.S. is case sensitive ("A" != "a").
    
    Phones will be treated as case sensitive.
    
    Phase 1: DICT - Checking to see if the dict and filler dict agrees with the phonelist file.
    
    Found 13 words using 16 phones
    passed
    
    Phase 2: DICT - Checking to make sure there are not duplicate entries in the dictionary
    passed
    
    Phase 3: CTL - Check general format; utterance length (must be positive); files exist
    passed
    
    Phase 4: CTL - Checking number of lines in the transcript should match lines in control file
    passed
    
    Phase 5: CTL - Determine amount of training data, see if n_tied_states seems reasonable.
    
    Total Hours Training: 0.00611410256410256
    
    This is a small amount of data, no comment at this time
    WARNING
    
    Phase 6: TRANSCRIPT - Checking that all the words in the transcript are in the dictionary
    
    Words in dictionary: 10
    
    Words in filler dictionary: 3
    
    WARNING: Extra whitespace at end of line 1
    
    WARNING: Extra whitespace at end of line 2
    
    WARNING: Extra whitespace at end of line 3
    
    WARNING: Extra whitespace at end of line 4
    
    WARNING: Extra whitespace at end of line 5
    
    WARNING: Extra whitespace at end of line 6
    
    WARNING: Extra whitespace at end of line 7
    
    WARNING: Extra whitespace at end of line 8
    
    WARNING: Extra whitespace at end of line 9
    
    WARNING: Extra whitespace at end of line 10
    
    WARNING: Extra whitespace at end of line 11
    
    WARNING: Extra whitespace at end of line 12
    
    WARNING: Extra whitespace at end of line 13
    
    WARNING: Extra whitespace at end of line 14
    
    WARNING: Extra whitespace at end of line 15
    
    WARNING: Extra whitespace at end of line 16
    
    WARNING: Extra whitespace at end of line 17
    
    WARNING: Extra whitespace at end of line 18
    
    WARNING: Extra whitespace at end of line 19
    
    WARNING: Extra whitespace at end of line 20
    
    WARNING: Extra whitespace at end of line 21
    
    WARNING: Extra whitespace at end of line 22
    
    WARNING: Extra whitespace at end of line 23
    
    WARNING: Extra whitespace at end of line 24
    
    WARNING: Extra whitespace at end of line 25
    
    WARNING: Extra whitespace at end of line 26
    
    WARNING: Extra whitespace at end of line 27
    
    WARNING: Extra whitespace at end of line 28
    
    WARNING: Extra whitespace at end of line 29
    
    WARNING: Extra whitespace at end of line 30
    
    WARNING: Extra whitespace at end of line 31
    
    WARNING: Extra whitespace at end of line 32
    
    WARNING: Extra whitespace at end of line 33
    
    WARNING: Extra whitespace at end of line 34
    
    WARNING: Extra whitespace at end of line 35
    
    WARNING: Extra whitespace at end of line 36
    
    WARNING: Extra whitespace at end of line 37
    
    WARNING: Extra whitespace at end of line 38
    
    WARNING: Extra whitespace at end of line 39
    FAILED
    
    Phase 7: TRANSCRIPT - Checking that all the phones in the transcript are in the phonelist, and all phones in the phonelist appear at least once
    passed
    

    This is my fileids file:

    1/1_1

    1/1_2

    1/1_3

    1/1_4

    1/1_5

    1/1_6

    1/1_7

    1/1_8

    1/1_9

    1/1_10

    2/2_1

    2/2_2

    2/2_3

    2/2_4

    2/2_5

    2/2_6

    2/2_7

    2/2_8

    2/2_9

    2/2_10

    3/3_1

    3/3_2

    3/3_3

    3/3_4

    3/3_5

    3/3_6

    3/3_7

    3/3_8

    3/3_9

    3/3_10

    4/4_1

    4/4_2

    4/4_3

    4/4_4

    4/4_5

    4/4_6

    4/4_7

    4/4_8

    4/4_9

    4/4_10

    and this is my transcription file:

    1 (1_1)

    2 (1_2)

    3 (1_3)

    4 (1_4)

    5 (1_5)

    6 (1_6)

    7 (1_7)

    8 (1_8)

    9 (1_9)

    10 (1_10)

    1 (2_1)

    2 (2_2)

    3 (2_3)

    4 (2_4)

    5 (2_5)

    6 (2_6)

    7 (2_7)

    8 (2_8)

    9 (2_9)

    10 (2_10)

    1 (3_1)

    2 (3_2)

    3 (3_3)

    4 (3_4)

    5 (3_5)

    6 (3_6)

    7 (3_7)

    8 (3_8)

    9 (3_9)

    10 (3_10)

    1 (4_1)

    2 (4_2)

    3 (4_3)

    4 (4_4)

    5 (4_5)

    6 (4_6)

    7 (4_7)

    8 (4_8)

    9 (4_9)

    10 (4_10)

    I am sure there is no any extra white space at the end of transcription file,
    what's wrong with this?
    Many thanks.

     
  • Nickolay V. Shmyrev

    Your files have Windows-style line endings (CR+LF), you need to remove CR and
    convert to Unix style line endings (LF only). This can be done with proper
    text editor for example.

     

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.