pyparsing-users Mailing List for Python parsing module (Page 30)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

Cool,

I was afraid that I was missing something simple, some sort of
'splitOn('?')' function. Glad to see that I'm not quite that dense.
That example works well, urlparse looks rather interesting too. I may
go with it for now, seems a little less wordy in this case (Python
seems to be well oriented to support my innate desire to type less).

BTW, the examples are the best part about pyparsing. While the docs
are fairly clear as well, the examples made it a real breeze to dig
in.

Now I guess I just need to spend some serious time going through the
python library to see what other nifty gems I'm missing.

Thx,

Tom

On 3/6/06, Paul McGuire <pa...@al...> wrote:
> Tom -
>
> Thanks for the glowing compliments on pyparsing!  For your immediate
> question, standard python includes a module called urlparse that may be
> sufficient for you.
>
> On the other hand, if you are set on using a pure-pyparsing solution, I
> looked that the source for urlparse a while ago, and came up with this
> (patterned after urlparse's strange logic):
>
>         scheme_chars =3D alphanums + "+-."
>         urlscheme =3D Word( scheme_chars )
>         netloc_chars =3D "".join( [ c for c in printables if c not in "/.=
" ] )
>         netloc =3D delimitedList( Word( netloc_chars ), ".", combine=3DTr=
ue )
>         path_chars =3D "".join( [ c for c in printables if c not in "?" ]=
 )
>         path =3D Word( path_chars )
>         query_chars =3D "".join( [ c for c in printables if c not in "#" =
] )
>         query =3D Word( query_chars )
>         fragment =3D Word( printables+" " )
>         _urlBNF =3D Combine(Optional(urlscheme.setResultsName("scheme") +=
 ":"
> ) +
>                           Optional(Literal("//").suppress() + netloc,
> default=3D"").setResultsName("netloc") +
>                           Optional(path.setResultsName("path"), default=
=3D"")
> +
>                           Optional(Literal("?").suppress()  + query,
> default=3D"").setResultsName("query") +
>                           Optional(Literal("#").suppress()  + fragment,
> default=3D"").setResultsName("fragment") )
>
>
> Using your test string, I wrote the following test code:
>         testurl =3D "http://11.11.111.11/adframe.php?n=3Dad1f311a&what=3D=
zone:56"
>         urlParts =3D _urlBNF.parseString(testurl)
>         print testurl
>         for k in urlParts.keys():
>             print "urlParts.%s =3D %s" % (k,urlParts[k])
>
> Giving:
> http://11.11.111.11/adframe.php?n=3Dad1f311a&what=3Dzone:56
> urlParts.fragment =3D
> urlParts.path =3D /adframe.php
> urlParts.scheme =3D http
> urlParts.netloc =3D 11.11.111.11
> urlParts.query =3D n=3Dad1f311a&what=3Dzone:56
>
>
> I hope this gets you going - let us know!
>
> Regards,
> -- Paul
>
>
>
> -----Original Message-----
> From: pyp...@li...
> [mailto:pyp...@li...] On Behalf Of Tom Wie=
be
> Sent: Monday, March 06, 2006 11:59 AM
> To: pyp...@li...
> Subject: [Pyparsing] splitting the query from a url?
>
> Hi all,
>
> First off, thanks for this wonderful module. I was able to extend the
> httpserverlogparser.py example to do 90% of what I need in a matter of
> minutes, with a bare minimum of Python experience. I can see using
> PyParsing a lot moving forward.
>
> Stuck on one last little bit though, given a fairly standard combined
> log from apache with the form:
>
> www.domain.com 11.111.11.111 - - [16/Feb/2004:10:35:12 -0800] "GET
> /ads/redirectads/468x60redirect.htm?foo=3Dbar&bar=3Dfoo HTTP/1.1" 200 541
> "http://11.11.111.11/adframe.php?n=3Dad1f311a&what=3Dzone:56" "Mozilla/4.=
0
> (compatible; MSIE 6.0; Windows NT 5.1) Opera 7.20  [ru\"]"
>
> I've added support for the virtualhost at the start of the log, and
> have split the http action, request and http version into separate
> entities, what I want to do now is split the query off from the url in
> the request.
>
>         request =3D Word( printables )
>
> Works to grab the whole request, URL and query combined but,
> everything I've tried thus far to split on the (optional) ? that
> starts a get query has failed. Basically, I think what I'm trying to
> get is "everything up to the question mark, if it's there, otherwise
> everything til the next field".
>
> For this case, I'm actually going to be just throwing the query away
> so doing anything of note with it really doesn't matter right now.
>
> I know I'll feel dumb for asking this as soon as I see the answer but,
> a gentle nudge in the right direction would be greatly appreciated.
>
> Tom
>
>
> -------------------------------------------------------
> This SF.Net email is sponsored by xPML, a groundbreaking scripting langua=
ge
> that extends applications into web and mobile media. Attend the live webc=
ast
> and join the prime developer group breaking into this new coding territor=
y!
> http://sel.as-us.falkag.net/sel?cmdlnk&kid=110944&bid$1720&dat=121642
> _______________________________________________
> Pyparsing-users mailing list
> Pyp...@li...
> https://lists.sourceforge.net/lists/listinfo/pyparsing-users
>
>
>

2004	Jan	Feb	Mar (1)	Apr	May (1)	Jun	Jul	Aug (2)	Sep	Oct	Nov (2)	Dec
2005	Jan (2)	Feb	Mar (2)	Apr (12)	May (2)	Jun	Jul	Aug (12)	Sep	Oct (1)	Nov	Dec
2006	Jan (5)	Feb (1)	Mar (10)	Apr (3)	May (7)	Jun (2)	Jul (2)	Aug (7)	Sep (8)	Oct (17)	Nov	Dec (3)
2007	Jan (4)	Feb	Mar (10)	Apr	May (6)	Jun (11)	Jul (1)	Aug	Sep (19)	Oct (8)	Nov (32)	Dec (8)
2008	Jan (12)	Feb (6)	Mar (42)	Apr (47)	May (17)	Jun (15)	Jul (7)	Aug (2)	Sep (13)	Oct (6)	Nov (11)	Dec (3)
2009	Jan (2)	Feb (3)	Mar	Apr	May (11)	Jun (13)	Jul (19)	Aug (17)	Sep (8)	Oct (3)	Nov (7)	Dec (1)
2010	Jan (2)	Feb	Mar (19)	Apr (6)	May	Jun (2)	Jul	Aug (1)	Sep	Oct (4)	Nov (3)	Dec (2)
2011	Jan (4)	Feb	Mar (5)	Apr (1)	May (3)	Jun (8)	Jul (6)	Aug (8)	Sep (35)	Oct (1)	Nov (1)	Dec (2)
2012	Jan (2)	Feb	Mar (3)	Apr (4)	May	Jun (1)	Jul	Aug (6)	Sep (18)	Oct	Nov (1)	Dec
2013	Jan (7)	Feb (7)	Mar (1)	Apr (4)	May	Jun	Jul (1)	Aug (5)	Sep (3)	Oct (11)	Nov (3)	Dec
2014	Jan (3)	Feb (1)	Mar	Apr (6)	May (10)	Jun (4)	Jul	Aug (5)	Sep (2)	Oct (4)	Nov (1)	Dec
2015	Jan	Feb	Mar	Apr (13)	May (1)	Jun	Jul (2)	Aug	Sep (9)	Oct (2)	Nov (11)	Dec (2)
2016	Jan	Feb (3)	Mar (2)	Apr	May	Jun	Jul (3)	Aug	Sep	Oct (1)	Nov (1)	Dec (4)
2017	Jan (2)	Feb (2)	Mar (2)	Apr	May	Jun	Jul (4)	Aug	Sep	Oct (4)	Nov (3)	Dec
2018	Jan (10)	Feb	Mar (1)	Apr	May	Jun (1)	Jul	Aug	Sep	Oct (2)	Nov	Dec
2019	Jan	Feb	Mar	Apr	May	Jun (2)	Jul	Aug	Sep	Oct	Nov	Dec
2020	Jan	Feb (1)	Mar	Apr	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2022	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec (1)
2023	Jan	Feb	Mar	Apr (1)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2024	Jan	Feb (1)	Mar	Apr (1)	May	Jun	Jul (1)	Aug (3)	Sep (1)	Oct (1)	Nov	Dec

pyparsing-users Mailing List for Python parsing module (Page 30)

pyparsing-users — User notes and help on the pyparsing module