Thread: [Pyparsing] whitespace related question

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Hello list,

I am stumped by some unexpected behaviour.

I want to parse tables of the following form:
     table = """
         # NAME   # col1 # col2 # col3 ## cola # colb #
         # Test1  # 1    # 2    # 3    ## a    # b    #
         # Test_2 # 4    # 5    # 6    ## c    # d    #
     """

For this, I have specified a TableParser (code follows after this mail).
At first sight, the TableParser does exactly what I want. But I found 
out that parsing stops if one of the table rows contains a space after 
the last "#", and I do not understand why. I expected the p.restOfLine 
to take care of this. This is with pyparsing 1.4.12.

Any ideas?

Best regards,
Stefaan.

import pyparsing as p

identifier = p.Word(p.alphas + "_", p.alphas + p.nums + "_")

col = p.Literal("#").suppress()
list_of_cols = p.delimitedList(p.CharsNotIn("#\n\r"), "#")

left_table_header = col + 
p.ZeroOrMore(identifier).setResultsName("TestColumnName") + col + \ 
list_of_cols.setResultsName("HeaderSetupDataColumns")

right_table_header = 
list_of_cols.setResultsName("HeaderCheckDataColumns") + \
                      p.restOfLine.suppress()

table_header = left_table_header.setResultsName("LeftTableHeader") + \
                p.Literal("##").suppress() + \
                right_table_header.setResultsName("RightTableHeader") + \
                p.lineEnd.suppress()

left_table_row = col + \
                  identifier.setResultsName("TestName") + \
                  col + \
                  list_of_cols.setResultsName("RowSetupDataColumns")

right_table_row = list_of_cols.setResultsName("RowCheckDataColumns") + \
                   p.restOfLine.suppress()

table_row = left_table_row.setResultsName("LeftTableRow") + \
             p.Literal("##").suppress() + \
             right_table_row.setResultsName("RightTableRow") + \
             p.lineEnd.suppress()

TableParser = table_header + \
               p.OneOrMore(p.Group(table_row)).setResultsName("Rows")

Thread: [Pyparsing] whitespace related question

pyparsing-users