Re: [Htmlparser-user] trouble understanding parser data flow

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Filter the full node list:
  NodeList nl = parser.parse (null);
  NodeList list = nl.extractAllNodesThatMatch (filter);
  NodeList list2 = nl.extractAllNodesThatMatch (filter2);

Antony Sequeira wrote:

>Hi
>
>My first task was to extract links from pages
>I looked at example and tried  the following
>
>                   NodeFilter filter = new NodeClassFilter (LinkTag.class);
>                   NodeList list = parser.extractAllNodesThatMatch (filter);
>                   log("links follow:");
>                   for (int i = 0; i < list.size (); i++)
>                       log (list.elementAt (i).toHtml ());
>
>this works fine if the partser was just constrcuted before running this code
>
>On the other hand, if I put the following code preceeding the code above
>                   NodeList nl = parser.parse (null);
>                   log(nl.asString());
>
>I get nothing for the links.
>
>How do I structure my code when I want to do multiple things while
>parsing a page.
>For example, I want to extract links, I want to extract forms and form
>fields, I want to extract text.
>
>-Antony Sequeira
>
>  
>