[Htmlparser-user] Extract Data from Table Row Question.

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hello All and Thanks for looking at my Question.

I am still new to Java and HtmlParser I have se series of Web pages stored 
offline that i need to process, that are made up of tables, i can find the 
tables tag, and then all Table Rows, but the next bit is stumping me, I.e 
how do i read the TD values or how to check invidual tags to see if there is
more processing to do (see Source Example below)

Many Thanks for Any help.

public static void process(NodeList listx)
    {
    // Scan for "tr" tags and Extract info
    NodeList TableList = listx.extractAllNodesThatMatch(new 
TagNameFilter("tr"));
    for (int x = 0; x < xx.size(); x++)
    {

    // Process Nodes or Tags  this is what is stamping me

   1. How do i read all TD from nodes with say format <TD class="listi"> etc 
and get their value
   2. Or How do i get invidural Tags for futher processing

    }
    }

    public static void main(String[] args) {

         try {
            parser = new Parser("c:\\HtmlTest0002.htm");

// Look for Table Tag

            list = parser.parse (new TagNameFilter("table"));
            for (int x = 0; x < list.size(); x++)
            {

// Is it the right Table

            if (list.elementAt(x).toString().contains("listme"))
            {
            // Get all Children and process
                process(list.elementAt(x).getChildren());
            }
            }
            } catch (ParserException ex) {
            ex.printStackTrace();
        }

    }

}