pyparsing-users Mailing List for Python parsing module (Page 6)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

You might look at one of the variations on parsing that pyparsing
expressions can do.  

The typical parser case is one which the parser handles all the input text.
It requires the most work because it has to handle everything in the input.

You can also write a pyparsing parser that only matches part of the input
file, and then scan or search for just those parts. I think this may be
suitable for your case. Look over the following code and see how
searchString and scanString return the matching lines, and how with
scanString (which returns a Python generator - if you're not familiar with
these, look it up), you can pull out the text between parses, since
scanString returns not only the matching text, but also the start and end
locations.

-- Paul

from pyparsing import *

line_of_words = OneOrMore(Word(alphas))

inputText = """\
sldjf lskjflsja lasdfljsdf owiuerowue ndf
122
1203 080182 0123 1023021 013802
02108

aslkjweoiur olsuaperu lsfiwuer  kfdsldf
293749237
029 927397 2979 29793732974
9237
82739

sjfdhhwl oewr lwkejrlj wlehrnmb
34982 9392
"""

# find all groups of words using searchString
for line in line_of_words.searchString(inputText):
    print line

# prints:
# ['sldjf', 'lskjflsja', 'lasdfljsdf', 'owiuerowue', 'ndf']
# ['aslkjweoiur', 'olsuaperu', 'lsfiwuer', 'kfdsldf']
# ['o']
# ['sjfdhhwl', 'oewr', 'lwkejrlj', 'wlehrnmb']

# find all groups and their start/end locations using scanString
for line,start,end in line_of_words.scanString(inputText):
    print line

# prints:
# ['sldjf', 'lskjflsja', 'lasdfljsdf', 'owiuerowue', 'ndf']
# ['aslkjweoiur', 'olsuaperu', 'lsfiwuer', 'kfdsldf']
# ['o']
# ['sjfdhhwl', 'oewr', 'lwkejrlj', 'wlehrnmb']

# use scanString to associate intervening text with matched line
parsedData = []
scanner = line_of_words.scanString(inputText)
lastLine,lastStart,lastEnd = next(scanner)
for line, start, end in scanner:
    parsedData.append((lastLine, inputText[lastEnd:start].splitlines()))
    lastLine,lastEnd = line,end

# add final group after last parsed line
parsedData.append((lastLine, inputText[lastEnd:].splitlines()))

for line,data in parsedData:
    print '-', ' '.join(line)
    for d in data:
        print ' ', d

# prints
#- sldjf lskjflsja lasdfljsdf owiuerowue ndf
#  
#  122
#  1203 080182 0123 1023021 013802
#  02108
#  
#- aslkjweoiur olsuaperu lsfiwuer kfdsldf
#  
#  293749237
#  029 927397 2979 29793732974
#  9237
#- o
#  82739
#  
#- sjfdhhwl oewr lwkejrlj wlehrnmb
#  
#  34982 9392
#

-----Original Message-----
From: Hanchel Cheng [mailto:han...@br...] 
Sent: Tuesday, November 05, 2013 7:15 PM
To: pyp...@li...
Subject: [Pyparsing] Using grammar as a condition for loop

Hello!

I have a text file in a structure like this:
######start#######
[line1 matching grammar]
#[text]
#[text]
                [text]

[line2 matching grammar]
                #[text]
                [etc.]
#######end#######
There can be N amounts of lines with or without the # under each indent with
a line that matches the grammar.

I'm checking for the grammar, then I would like to check all the lines until
the next line that follows the grammar.

Something like...
for line in text_file:
                if not(line matches grammar):
                                do something

Can pyparsing do this? If not, any suggestions? I can give more info if
necessary.

I really appreciate the help!

Kind regards,
Hanchel
----------------------------------------------------------------------------
--
November Webinars for C, C++, Fortran Developers Accelerate application
performance with scalable programming models. Explore techniques for
threading, error checking, porting, and tuning. Get the most from the latest
Intel processors and coprocessors. See abstracts and register
http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk
_______________________________________________
Pyparsing-users mailing list
Pyp...@li...
https://lists.sourceforge.net/lists/listinfo/pyparsing-users

---
This email is free from viruses and malware because avast! Antivirus protection is active.
http://www.avast.com

2004	Jan	Feb	Mar (1)	Apr	May (1)	Jun	Jul	Aug (2)	Sep	Oct	Nov (2)	Dec
2005	Jan (2)	Feb	Mar (2)	Apr (12)	May (2)	Jun	Jul	Aug (12)	Sep	Oct (1)	Nov	Dec
2006	Jan (5)	Feb (1)	Mar (10)	Apr (3)	May (7)	Jun (2)	Jul (2)	Aug (7)	Sep (8)	Oct (17)	Nov	Dec (3)
2007	Jan (4)	Feb	Mar (10)	Apr	May (6)	Jun (11)	Jul (1)	Aug	Sep (19)	Oct (8)	Nov (32)	Dec (8)
2008	Jan (12)	Feb (6)	Mar (42)	Apr (47)	May (17)	Jun (15)	Jul (7)	Aug (2)	Sep (13)	Oct (6)	Nov (11)	Dec (3)
2009	Jan (2)	Feb (3)	Mar	Apr	May (11)	Jun (13)	Jul (19)	Aug (17)	Sep (8)	Oct (3)	Nov (7)	Dec (1)
2010	Jan (2)	Feb	Mar (19)	Apr (6)	May	Jun (2)	Jul	Aug (1)	Sep	Oct (4)	Nov (3)	Dec (2)
2011	Jan (4)	Feb	Mar (5)	Apr (1)	May (3)	Jun (8)	Jul (6)	Aug (8)	Sep (35)	Oct (1)	Nov (1)	Dec (2)
2012	Jan (2)	Feb	Mar (3)	Apr (4)	May	Jun (1)	Jul	Aug (6)	Sep (18)	Oct	Nov (1)	Dec
2013	Jan (7)	Feb (7)	Mar (1)	Apr (4)	May	Jun	Jul (1)	Aug (5)	Sep (3)	Oct (11)	Nov (3)	Dec
2014	Jan (3)	Feb (1)	Mar	Apr (6)	May (10)	Jun (4)	Jul	Aug (5)	Sep (2)	Oct (4)	Nov (1)	Dec
2015	Jan	Feb	Mar	Apr (13)	May (1)	Jun	Jul (2)	Aug	Sep (9)	Oct (2)	Nov (11)	Dec (2)
2016	Jan	Feb (3)	Mar (2)	Apr	May	Jun	Jul (3)	Aug	Sep	Oct (1)	Nov (1)	Dec (4)
2017	Jan (2)	Feb (2)	Mar (2)	Apr	May	Jun	Jul (4)	Aug	Sep	Oct (4)	Nov (3)	Dec
2018	Jan (10)	Feb	Mar (1)	Apr	May	Jun (1)	Jul	Aug	Sep	Oct (2)	Nov	Dec
2019	Jan	Feb	Mar	Apr	May	Jun (2)	Jul	Aug	Sep	Oct	Nov	Dec
2020	Jan	Feb (1)	Mar	Apr	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2022	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec (1)
2023	Jan	Feb	Mar	Apr (1)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2024	Jan	Feb (1)	Mar	Apr (1)	May	Jun	Jul (1)	Aug (3)	Sep (1)	Oct (1)	Nov	Dec

pyparsing-users Mailing List for Python parsing module (Page 6)

pyparsing-users — User notes and help on the pyparsing module