Re: [Htmlparser-developer] parsing error of a html page

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Ling,

The StringExtractor gets every StringNode.
If you don't want the comments (script) try this:

#import org.htmlparser.beans.StringBean;

public class TryBeans
{
    public static void main (String[] args)
    {
        StringBean sb = new StringBean ();
        sb.setURL ("http://www.cnnfn.com/2001/11/29/companies/enron/");
        System.out.println (sb.getStrings ());
    }
}

See http://htmlparser.sourceforge.net/docs/index.php/JavaBeans for more 
details.

Derrick

Mr LING MA wrote:

>When I try to use htmlparser stringextractor on page:
>
>http://www.cnnfn.com/2001/11/29/companies/enron/
>
>the comment tags below is also outputted. Can this  
>be an error of style tag or comment tag?
>
>Thanks
>
>Ling Ma
>
>OUTPUT after extracted tag:
><!--
>adSetTarget('_top');
>htmlAdWH( (new
><snip>
>  
>