Re: [Htmlparser-user] Help on extracting clean body content from web page
Brought to you by:
derrickoswald
From: Derrick O. <der...@ro...> - 2007-11-13 11:58:12
|
You probably want the StringBean. The main() method of StringBean is an example of its use. ----- Original Message ---- From: cash cash <ca...@ya...> To: htm...@li... Sent: Tuesday, November 13, 2007 1:07:33 AM Subject: [Htmlparser-user] Help on extracting clean body content from web page Hi all, I am new to htmlparser. have download it and tried a few examples. However, i am having problem knowing the" correct way" to achieve my goal. I'm looking for a way to extract body content from web page, exclude all script sections. For example, using the following text <html> <head><title>title</title> <style> css style </style> </head> <body> Hello world <?php phpinfo() ?> </body> The correct code should only extract Hello world. Can any one help me on this? Thanks in advance. ____________________________________________________________________________________ Be a better sports nut! Let your teams follow you with Yahoo Mobile. Try it now. http://mobile.yahoo.com/sports;_ylt=At9_qDKvtAbMuh1G1SQtBI7ntAcJ ------------------------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now >> http://get.splunk.com/ _______________________________________________ Htmlparser-user mailing list Htm...@li... https://lists.sourceforge.net/lists/listinfo/htmlparser-user |