Thread: Re: [Htmlparser-user] Filter Help
Brought to you by:
derrickoswald
|
From: Derrick O. <Der...@Ro...> - 2005-12-21 13:15:06
Attachments:
ListItems.java
|
Having the URL makes it easy... It needs to ignore lists with class or id attributes. I used a NotFilter containing an OrFilter with the two HasAttributeFilters. I've added this to the example. Srinivas Vemula wrote: > Thanks for the helo Derick. It is not able to filter the Features part > of it, and I am ending up getting all the <UL> tags in the web page. > I am attaching the web page, and here is the URL > > http://www.avagotech.com/products/product-detail.jsp?navId=H0,C2,C5212,C5255,P88706 > > Thanks for your time and help on this > > > > |
|
From: Srinivas V. <sv...@al...> - 2005-12-21 13:21:14
|
Thank you very much, Some how I am not able to use the tool to create that perfect filter, and I am sure . Its JUST ME. http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C5213,= C 5249,P88749 Could you please modify the filter .. So that it catches all the <UL> tags for "Features" block. Above is the URL which is an example, this product has sub lists for a particular listItem in the main features list Thanks a lot =20 -----Original Message----- From: htm...@li... [mailto:htm...@li...] On Behalf Of Derrick Oswald Sent: Wednesday, December 21, 2005 6:45 PM To: htm...@li... Subject: Re: [Htmlparser-user] Filter Help Having the URL makes it easy... It needs to ignore lists with class or id attributes. I used a NotFilter containing an OrFilter with the two HasAttributeFilters. I've added this to the example. Srinivas Vemula wrote: > Thanks for the helo Derick. It is not able to filter the Features part > of it, and I am ending up getting all the <UL> tags in the web page. > I am attaching the web page, and here is the URL > =20 > = http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C5212 > ,C5255,P88706 > =20 > Thanks for your time and help on this > =20 > =20 > > |
|
From: Derrick O. <Der...@Ro...> - 2005-12-21 13:33:49
|
If you are talking about the options at the bottom:
<li>Options available are:</ul>500 = Tape and Reel Packaging,
850<br>XXXE = Lead Free Option
they aren't part of a list.
You're probably better off post-processing the list you get to find all
it's siblings:
NodeList lists = ....apply the filter to the page;
BulletList list = (BulletList)lists.elementAt (0);
NodeList siblings = list.getParent ().getChildren ();
Srinivas Vemula wrote:
>Thank you very much, Some how I am not able to use the tool to create
>that perfect filter, and I am sure . Its JUST ME.
>
>http://www.avagotech.com/products/product-detail.jsp?navId=H0,C2,C5213,C
>5249,P88749
>
>Could you please modify the filter .. So that it catches all the <UL>
>tags for "Features" block. Above is the URL which is an example, this
>product has sub lists for a particular listItem in the main features
>list
>
>Thanks a lot
>
>
>
>-----Original Message-----
>From: htm...@li...
>[mailto:htm...@li...] On Behalf Of
>Derrick Oswald
>Sent: Wednesday, December 21, 2005 6:45 PM
>To: htm...@li...
>Subject: Re: [Htmlparser-user] Filter Help
>
>Having the URL makes it easy...
>It needs to ignore lists with class or id attributes. I used a NotFilter
>containing an OrFilter with the two HasAttributeFilters.
>I've added this to the example.
>
>Srinivas Vemula wrote:
>
>
>
>>Thanks for the helo Derick. It is not able to filter the Features part
>>
>>
>
>
>
>>of it, and I am ending up getting all the <UL> tags in the web page.
>>I am attaching the web page, and here is the URL
>>
>>http://www.avagotech.com/products/product-detail.jsp?navId=H0,C2,C5212
>>,C5255,P88706
>>
>>Thanks for your time and help on this
>>
>>
>>
>>
>>
>>
>
>
>
>
>-------------------------------------------------------
>This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
>for problems? Stop! Download the new AJAX search engine that makes
>searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
>http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
>_______________________________________________
>Htmlparser-user mailing list
>Htm...@li...
>https://lists.sourceforge.net/lists/listinfo/htmlparser-user
>
>
>
|
|
From: Srinivas V. <sv...@al...> - 2005-12-21 13:28:21
|
One more thing Derek. It should only catch the Sub Lists, if there are any. Or should behave as before. =20 -----Original Message----- From: htm...@li... [mailto:htm...@li...] On Behalf Of Srinivas Vemula Sent: Wednesday, December 21, 2005 6:51 PM To: htm...@li... Subject: RE: [Htmlparser-user] Filter Help Thank you very much, Some how I am not able to use the tool to create that perfect filter, and I am sure . Its JUST ME. http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C5213,= C 5249,P88749 Could you please modify the filter .. So that it catches all the <UL> tags for "Features" block. Above is the URL which is an example, this product has sub lists for a particular listItem in the main features list Thanks a lot =20 -----Original Message----- From: htm...@li... [mailto:htm...@li...] On Behalf Of Derrick Oswald Sent: Wednesday, December 21, 2005 6:45 PM To: htm...@li... Subject: Re: [Htmlparser-user] Filter Help Having the URL makes it easy... It needs to ignore lists with class or id attributes. I used a NotFilter containing an OrFilter with the two HasAttributeFilters. I've added this to the example. Srinivas Vemula wrote: > Thanks for the helo Derick. It is not able to filter the Features part > of it, and I am ending up getting all the <UL> tags in the web page. > I am attaching the web page, and here is the URL > =20 > = http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C5212 > ,C5255,P88706 > =20 > Thanks for your time and help on this > =20 > =20 > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id=16865&op=3Dick _______________________________________________ Htmlparser-user mailing list Htm...@li... https://lists.sourceforge.net/lists/listinfo/htmlparser-user |
|
From: Srinivas V. <sv...@al...> - 2005-12-21 13:37:37
|
Yes, You are right... But the current filter only returns till </UL> for
the block and "500 =3D Tape and Reel Packaging, 850<br>XXXE =3D Lead =
Free
Option they aren't part of a list" is just text after the list. I am not
able to get to that part=20
-----Original Message-----
From: htm...@li...
[mailto:htm...@li...] On Behalf Of
Derrick Oswald
Sent: Wednesday, December 21, 2005 7:04 PM
To: htm...@li...
Subject: Re: [Htmlparser-user] Filter Help
If you are talking about the options at the bottom: =20
<li>Options available are:</ul>500 =3D Tape and Reel Packaging,
850<br>XXXE =3D Lead Free Option they aren't part of a list.
You're probably better off post-processing the list you get to find all
it's siblings:
NodeList lists =3D ....apply the filter to the page;
BulletList list =3D (BulletList)lists.elementAt (0);
NodeList siblings =3D list.getParent ().getChildren ();
Srinivas Vemula wrote:
>Thank you very much, Some how I am not able to use the tool to create
>that perfect filter, and I am sure . Its JUST ME.
>
>http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C5213=
,
C
>5249,P88749
>
>Could you please modify the filter .. So that it catches all the <UL>
>tags for "Features" block. Above is the URL which is an example, this
>product has sub lists for a particular listItem in the main features
>list
>
>Thanks a lot
>
>=20
>
>-----Original Message-----
>From: htm...@li...
>[mailto:htm...@li...] On Behalf Of
>Derrick Oswald
>Sent: Wednesday, December 21, 2005 6:45 PM
>To: htm...@li...
>Subject: Re: [Htmlparser-user] Filter Help
>
>Having the URL makes it easy...
>It needs to ignore lists with class or id attributes. I used a
NotFilter
>containing an OrFilter with the two HasAttributeFilters.
>I've added this to the example.
>
>Srinivas Vemula wrote:
>
> =20
>
>>Thanks for the helo Derick. It is not able to filter the Features part
>> =20
>>
>
> =20
>
>>of it, and I am ending up getting all the <UL> tags in the web page.
>>I am attaching the web page, and here is the URL
>>=20
>>http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C521=
2
>>,C5255,P88706
>>=20
>>Thanks for your time and help on this
>>=20
>>=20
>>
>>
>> =20
>>
>
>
>
>
>-------------------------------------------------------
>This SF.net email is sponsored by: Splunk Inc. Do you grep through log
files
>for problems? Stop! Download the new AJAX search engine that makes
>searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
>http://ads.osdn.com/?ad_idv37&alloc_id=16865&op=3Dclick
>_______________________________________________
>Htmlparser-user mailing list
>Htm...@li...
>https://lists.sourceforge.net/lists/listinfo/htmlparser-user
>
> =20
>
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log
files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=3D7637&alloc_id=3D16865&op=3Dclick
_______________________________________________
Htmlparser-user mailing list
Htm...@li...
https://lists.sourceforge.net/lists/listinfo/htmlparser-user
|
|
From: Srinivas V. <sv...@al...> - 2005-12-21 13:45:32
|
Thanks Derick,
I got it now. That worked...=20
Srini=20
P.S: I really have to get a handle on using these filters...=20
-----Original Message-----
From: htm...@li...
[mailto:htm...@li...] On Behalf Of
Derrick Oswald
Sent: Wednesday, December 21, 2005 7:04 PM
To: htm...@li...
Subject: Re: [Htmlparser-user] Filter Help
If you are talking about the options at the bottom: =20
<li>Options available are:</ul>500 =3D Tape and Reel Packaging,
850<br>XXXE =3D Lead Free Option they aren't part of a list.
You're probably better off post-processing the list you get to find all
it's siblings:
NodeList lists =3D ....apply the filter to the page;
BulletList list =3D (BulletList)lists.elementAt (0);
NodeList siblings =3D list.getParent ().getChildren ();
Srinivas Vemula wrote:
>Thank you very much, Some how I am not able to use the tool to create
>that perfect filter, and I am sure . Its JUST ME.
>
>http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C5213=
,
C
>5249,P88749
>
>Could you please modify the filter .. So that it catches all the <UL>
>tags for "Features" block. Above is the URL which is an example, this
>product has sub lists for a particular listItem in the main features
>list
>
>Thanks a lot
>
>=20
>
>-----Original Message-----
>From: htm...@li...
>[mailto:htm...@li...] On Behalf Of
>Derrick Oswald
>Sent: Wednesday, December 21, 2005 6:45 PM
>To: htm...@li...
>Subject: Re: [Htmlparser-user] Filter Help
>
>Having the URL makes it easy...
>It needs to ignore lists with class or id attributes. I used a
NotFilter
>containing an OrFilter with the two HasAttributeFilters.
>I've added this to the example.
>
>Srinivas Vemula wrote:
>
> =20
>
>>Thanks for the helo Derick. It is not able to filter the Features part
>> =20
>>
>
> =20
>
>>of it, and I am ending up getting all the <UL> tags in the web page.
>>I am attaching the web page, and here is the URL
>>=20
>>http://www.avagotech.com/products/product-detail.jsp?navId=3DH0,C2,C521=
2
>>,C5255,P88706
>>=20
>>Thanks for your time and help on this
>>=20
>>=20
>>
>>
>> =20
>>
>
>
>
>
>-------------------------------------------------------
>This SF.net email is sponsored by: Splunk Inc. Do you grep through log
files
>for problems? Stop! Download the new AJAX search engine that makes
>searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
>http://ads.osdn.com/?ad_idv37&alloc_id=16865&op=3Dclick
>_______________________________________________
>Htmlparser-user mailing list
>Htm...@li...
>https://lists.sourceforge.net/lists/listinfo/htmlparser-user
>
> =20
>
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log
files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=3D7637&alloc_id=3D16865&op=3Dclick
_______________________________________________
Htmlparser-user mailing list
Htm...@li...
https://lists.sourceforge.net/lists/listinfo/htmlparser-user
|