htmlparser-user Mailing List for HTML Parser (Page 19)

Brought to you by: derrickoswald

htmlparser-user — The user mailing list for users of the htmlparser library

You can subscribe to this list here.

2001	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov (1)	Dec
2002	Jan (7)	Feb	Mar (9)	Apr (50)	May (20)	Jun (47)	Jul (37)	Aug (32)	Sep (30)	Oct (11)	Nov (37)	Dec (47)
2003	Jan (31)	Feb (70)	Mar (67)	Apr (34)	May (66)	Jun (25)	Jul (48)	Aug (43)	Sep (58)	Oct (25)	Nov (10)	Dec (25)
2004	Jan (38)	Feb (17)	Mar (24)	Apr (25)	May (11)	Jun (6)	Jul (24)	Aug (42)	Sep (13)	Oct (17)	Nov (13)	Dec (44)
2005	Jan (10)	Feb (16)	Mar (16)	Apr (23)	May (6)	Jun (19)	Jul (39)	Aug (15)	Sep (40)	Oct (49)	Nov (29)	Dec (41)
2006	Jan (28)	Feb (24)	Mar (52)	Apr (41)	May (31)	Jun (34)	Jul (22)	Aug (12)	Sep (11)	Oct (11)	Nov (11)	Dec (4)
2007	Jan (39)	Feb (13)	Mar (16)	Apr (24)	May (13)	Jun (12)	Jul (21)	Aug (61)	Sep (31)	Oct (13)	Nov (32)	Dec (15)
2008	Jan (7)	Feb (8)	Mar (14)	Apr (12)	May (23)	Jun (20)	Jul (9)	Aug (6)	Sep (2)	Oct (7)	Nov (3)	Dec (2)
2009	Jan (5)	Feb (8)	Mar (10)	Apr (22)	May (85)	Jun (82)	Jul (45)	Aug (28)	Sep (26)	Oct (50)	Nov (8)	Dec (16)
2010	Jan (3)	Feb (11)	Mar (39)	Apr (56)	May (80)	Jun (64)	Jul (49)	Aug (48)	Sep (16)	Oct (3)	Nov (5)	Dec (5)
2011	Jan (13)	Feb	Mar (1)	Apr (7)	May (7)	Jun (7)	Jul (7)	Aug (8)	Sep	Oct (6)	Nov (2)	Dec
2012	Jan (5)	Feb	Mar (3)	Apr (3)	May (4)	Jun (8)	Jul (1)	Aug (5)	Sep (10)	Oct (3)	Nov (2)	Dec (4)
2013	Jan (4)	Feb (2)	Mar (7)	Apr (7)	May (6)	Jun (7)	Jul (3)	Aug	Sep (1)	Oct	Nov	Dec
2014	Jan	Feb (2)	Mar (1)	Apr	May (3)	Jun (1)	Jul	Aug	Sep (1)	Oct (4)	Nov (2)	Dec (4)
2015	Jan (4)	Feb (2)	Mar (8)	Apr (7)	May (6)	Jun (7)	Jul (3)	Aug (1)	Sep (1)	Oct (4)	Nov (3)	Dec (4)
2016	Jan (4)	Feb (6)	Mar (9)	Apr (9)	May (6)	Jun (1)	Jul (1)	Aug	Sep	Oct (1)	Nov (1)	Dec (1)
2017	Jan	Feb (1)	Mar (3)	Apr (1)	May	Jun (1)	Jul (2)	Aug (3)	Sep (6)	Oct (3)	Nov (2)	Dec (5)
2018	Jan (3)	Feb (13)	Mar (28)	Apr (5)	May (4)	Jun (2)	Jul (2)	Aug (8)	Sep (2)	Oct (1)	Nov (5)	Dec (1)
2019	Jan (8)	Feb (1)	Mar	Apr (1)	May (4)	Jun	Jul (1)	Aug	Sep	Oct	Nov (2)	Dec (2)
2020	Jan	Feb	Mar (1)	Apr (1)	May (1)	Jun (2)	Jul (1)	Aug (1)	Sep (1)	Oct	Nov (1)	Dec (1)
2021	Jan (3)	Feb (2)	Mar (1)	Apr (1)	May (2)	Jun (1)	Jul (2)	Aug (1)	Sep	Oct	Nov	Dec
2022	Jan	Feb	Mar	Apr (1)	May (1)	Jun (1)	Jul	Aug (1)	Sep	Oct	Nov	Dec
2023	Jan (2)	Feb	Mar	Apr	May	Jun	Jul	Aug (1)	Sep	Oct	Nov	Dec
2024	Jan (2)	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2025	Jan	Feb	Mar	Apr	May	Jun (1)	Jul	Aug	Sep	Oct (1)	Nov	Dec

Flat | Threaded

<< < 1 .. 17 18 19 20 21 .. 99 > >> (Page 19 of 99)

[Htmlparser-user] Htmlparser does not parse <div> tag

From: Henry T. <htr...@ya...> - 2008-06-16 11:07:47

Hi All,
I am having difficulty parsing the following table using htmlparser table data filter statements:
<table border="0" cellpadding="0" cellspacing="0" width="782" id="main-content">
<tr>
<td valign="top" class="top"> 
<table border="0" cellpadding="0" cellspacing="0"> 
<tr>
<td valign="top" class="top"> 
<!-- un-delay results 14/10/2004 .................................. --->
<div class="greyBorder">
<table border="0" cellspacing="0" cellpadding="2" width="100%">
<tr>
<td class="propType">&nbsp;</td>
<td class="propType"><b>Patient</b></td>
<td class="propType"><b>Firstname</b></td>
<td class="propType"><b>Surname</b></td>
<td class="propType" align="right"><b>Date of birth</b></td>
<td class="propType">Sex</td>
</tr>
<tr class="smallnarrow">
<td class="even" width="10" align="left"></td>
<td class="even" style="vertical-align: middle;">Clinic</td> 
<td class="even" style="vertical-align: middle;">John</td> 
<td class="even" style="vertical-align: middle;">Smith</td> 
<td class="even" align="right" style="vertical-align: middle;">10/02/1940</td>
<td class="even" width="10" style="vertical-align: middle;">M</td>
</tr>
</table>
</div>
<div style="margin-top:10px;"> 
<br> <br>
<br>
</div>
<div align="center" style="margin-bottom: 20px;">
.........
</td></tr></table></td></tr></table>
The table data filter statements below pick up every lines shown above which is more than what I wanted:
(1) new AndFilter ( new TagNameFilter ("table"), 
(2) new AndFilter ( new HasAttributeFilter ("border","0"),
(3) new AndFilter ( new HasAttributeFilter ("cellspacing","0"),
(4) new AndFilter ( new HasAttributeFilter ("cellpadding"),
(5) new AndFilter ( new HasAttributeFilter ("width","782"),
(6) new AndFilter ( new HasAttributeFilter ("id","main-content"),
(7) new HasChildFilter ( new AndFilter ( new TagNameFilter ("tr"),
(8) new HasChildFilter ( new AndFilter ( new TagNameFilter ("td"),
(9) new HasChildFilter ( new AndFilter ( new TagNameFilter ("table"),
(10) new HasChildFilter ( new AndFilter ( new TagNameFilter ("tr"),
(11) new HasChildFilter ( new TagNameFilter ("td"),true)),true)),true)),true)),true)))))));
However, I would like to narrow down the parsing by extracting only the Patient table data in bold aboved. Nevertheless, the additional parsing statements below have not proven to be successful:
(1) new AndFilter ( new TagNameFilter ("table"), 
(2) new AndFilter ( new HasAttributeFilter ("border","0"),
(3) new AndFilter ( new HasAttributeFilter ("cellspacing","0"),
(4) new AndFilter ( new HasAttributeFilter ("cellpadding"),
(5) new AndFilter ( new HasAttributeFilter ("width","782"),
(6) new AndFilter ( new HasAttributeFilter ("id","main-content"),
(7) new HasChildFilter ( new AndFilter ( new TagNameFilter ("tr"),
(8) new HasChildFilter ( new AndFilter ( new TagNameFilter ("td"),
(9) new HasChildFilter ( new AndFilter ( new TagNameFilter ("table"),
(10) new HasChildFilter ( new AndFilter ( new TagNameFilter ("tr"),
(11) new HasChildFilter ( new AndFilter ( new TagNameFilter ("td"),
(12) new HasChildFilter ( new AndFilter ( new TagNameFilter ("div"),
(13) new HasAttributeFilter "class","greyBorder")),true)),true)),true)),true)),true)),true))))))); 
Line 12-13 searches for the <div> with attribute class=greyBorder but it did not pick up the Patient table at all. Any idea on where the last parsing statement went wrong? It appears that the htmlparser does not treat <div> as a nested tag around the Patient table.
Many thanks,
Henry


      Get the name you always wanted with the new y7mail email address.
www.yahoo7.com.au/mail

[Htmlparser-user] Fw: Does htmlparser Support cellpadding?

From: Henry T. <htr...@ya...> - 2008-06-11 23:36:22

Hi All,
Could anyone help out with this possible issue?
I still could not parse the cellpadding attribute.
Thanks,
Henry

----- Forwarded Message ----
From: Henry Tran <htr...@ya...>
To: Htm...@li...
Sent: Monday, 9 June, 2008 8:40:39 PM
Subject: Does htmlparser Support cellpadding?

Hi forum members,

I am having difficulty parsing the content of the table below due to what appears to be the HasAttributeFilter() class which could not recognise the "cellpadding" attribute:

        <table border="0" cellspacing="0" cellpadding="2" width="100%">

Here are the table data filters that I have tried without much luck:

(i) new AndFilter ( new TagNameFilter ("table"), new HasAttributeFilter("cellpadding","2"));
(ii) new AndFilter ( new TagNameFilter ("table"), new HasAttributeFilter("cellspacing","0"));
(iii) new AndFilter ( new TagNameFilter ("table"), 
          new AndFilter ( new HasAttributeFilter("cellspacing","0"), 
              new HasAttributeFilter("width","100%")));
(iv)  new AndFilter ( new TagNameFilter ("table"), 
                          new AndFilter ( new HasAttributeFilter("cellspacing","0"), 
                              new AndFilter ( new HasAttributeFilter("cellpadding","2"), 
                                  new HasAttributeFilter("width","100%"))));
Table data filters (i) & (iv) did not pick up anything while (ii) and (iii) worked but also include other tables that were not needed. Filter (iv) is perfect if only it would work. As a result, I would like to make the following queries on this issue:

(a) Does HasAttributeFilter() support cellpadding?
(b) Is there a limit on how many attribute HasAttributeFilter() could pick up in a table?
(c) Can HasAttributeFilter() pick up attributes in nested tables? This table is nested inside another table.
(d) Does the search for the attributes follow certain order? If so, it may mean that order of the HasAttributeFilter() may need to be alter to achieve the desire search.

Many thanks,
Henry
________________________________
Get the name you always wanted with the new y7mail email address.

      Get the name you always wanted with the new y7mail email address.
www.yahoo7.com.au/mail