htmlparser-developer Mailing List for HTML Parser (Page 19)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Somik,

My instincts would say to use the simplest mechanism possible.
In this case it would be instanceof, since the getType() way involves 
extra fields and accessor methods.

But what problem are you trying to solve?

Is it the "if (node instanceof HTMLLinkTag)" that seems to be needed 
everywhere?
Perhaps HTMLNode should have a "getLink()" method that returns null but 
is overridden in HTMLLinkTag?
Similarly, rationalization of toString(), getPlainTextString(), 
getHTML() and any required new methods to return appropriate renditions 
of the text within the node could eliminate the instanceof operations in 
StringExtractor and elsewhere.
My $0.02 worth.

Derrick

Somik Raha wrote:

>Hi Derrick,
>    It was really nice to read your reply. I tried a more accurate test (no,
>I didnt include instanceof HTMLNode, as our matches are at most one level
>up). The results (attached graph) show that it is almost the same - there is
>no perceivable improvement in this case. I guess if one goes a couple of
>layers up, the benefits would start to show.
>
>    Which brings me to the next question - knowing that we have no
>perceptible improvement to gain, should we recommend the use of the
>object-oriented way ?
>
>Regards,
>Somik
>
>----- Original Message -----
>From: "Derrick Oswald" <Der...@ro...>
>To: <htm...@li...>
>Sent: Saturday, January 18, 2003 6:36 AM
>Subject: Re: [Htmlparser-developer] Java Performance question
>  
>

2001	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (4)	Nov (1)	Dec (4)
2002	Jan (12)	Feb	Mar (7)	Apr (27)	May (14)	Jun (16)	Jul (27)	Aug (74)	Sep (1)	Oct (23)	Nov (12)	Dec (119)
2003	Jan (31)	Feb (23)	Mar (28)	Apr (59)	May (119)	Jun (10)	Jul (3)	Aug (17)	Sep (8)	Oct (38)	Nov (6)	Dec (1)
2004	Jan (4)	Feb (4)	Mar (1)	Apr (2)	May	Jun (7)	Jul (6)	Aug (1)	Sep	Oct	Nov	Dec
2005	Jan	Feb (1)	Mar	Apr (8)	May	Jun	Jul	Aug (2)	Sep (10)	Oct (4)	Nov (15)	Dec
2006	Jan	Feb (1)	Mar	Apr (4)	May (11)	Jun	Jul	Aug	Sep (2)	Oct	Nov	Dec
2007	Jan (3)	Feb (2)	Mar	Apr (2)	May	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec
2008	Jan	Feb (1)	Mar	Apr	May	Jun	Jul	Aug	Sep (5)	Oct (1)	Nov	Dec
2009	Jan	Feb (1)	Mar	Apr (2)	May	Jun (4)	Jul	Aug (1)	Sep	Oct	Nov	Dec (2)
2010	Jan (1)	Feb	Mar	Apr (8)	May	Jun	Jul	Aug	Sep (6)	Oct	Nov (1)	Dec
2011	Jan	Feb	Mar	Apr	May (3)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2012	Jan	Feb	Mar	Apr	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2014	Jan	Feb	Mar	Apr	May (1)	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2015	Jan	Feb	Mar	Apr (1)	May	Jun (1)	Jul	Aug	Sep	Oct	Nov (2)	Dec (1)
2016	Jan	Feb	Mar	Apr	May	Jun	Jul (2)	Aug	Sep	Oct	Nov (2)	Dec (2)

htmlparser-developer Mailing List for HTML Parser (Page 19)

htmlparser-developer — The developer mailing list of the htmlparser project