Re: [Htmlparser-developer] removing navigation bar from parse output
Brought to you by:
derrickoswald
From: Derrick O. <der...@gm...> - 2009-12-16 05:33:46
|
Nav bars are usually identified by a DIV tag with a special Id which you could filter on with a TagNameFilter *and* a HasAttribute filter. A simplistic approach would just delete the node from it's parent, which may work in your case. On Tue, Dec 15, 2009 at 10:56 PM, Ted Yu <yuz...@gm...> wrote: > Hi, > Is it possible to remove navigation bar of web page (common on web portals) > from parse output ? > > The motivation is to provide more focused contents for page categorization. > > Thanks > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Htmlparser-developer mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-developer > > |