Re: [Ebiness-crawler] decent Html parser?
Status: Alpha
Brought to you by:
o3dozone
|
From: Allan R. G. L. <al...@li...> - 2001-05-28 14:04:21
|
Can be a template in C++ to recognize a "Regular expression" like <a href="">'? Looking for "regex++"... -------------------------------------------- Allan Reffson Granja Lima (al...@li...) ICQ: 34004301 Mestrado em Ciencia da Computacao On Sat, 26 May 2001, Mike Davis wrote: > Hi guys, > > Well, I've already written an parser that simply extracts '<a href="">' > tags from Html for the crawler, but was wondering if anyone knows of a > good, stable parser that will generically parse Html? I think this would > be far superior. > > I think the 'expat' library (hosted on SF - http://expat.sourceforge.net/) > is the standard for Xml parsing, so we can use that for any Xml/xHtml we > come across. > > Mike > > > _______________________________________________ > Ebiness-crawler mailing list > Ebi...@li... > http://lists.sourceforge.net/lists/listinfo/ebiness-crawler > |