[Htmlparser-user] Parsing Partial HTML text

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

I am having trouble parsing html tagged text. It seems that I can retrieve 
a node but that element does not have the child nodes as expected. 

       String table =
                "<tbody>\n" +
                "<tr>\n" +
                "<td><span>brain_normal_GSM80627</span></td>\n" +
                "<td><span>normal</span></td>\n" +
                "<td><span>cerebral cortex</span></td>\n" +
                "<td><span>brain</span></td>\n" +
                "</tr>\n" +
                "</tbody>\n";

        Parser parser = new Parser(new Lexer(table));
        try {
            Node tBodyNode = parser.extractAllNodesThatMatch(new 
TagNameFilter("tbody")).elementAt(0);
            System.out.println(tBodyNode.getChildren());  // Prints null 
<---------------
        } catch (ParserException e) {
            e.printStackTrace();  //To change body of catch statement use 
File | Settings | File Templates.
        }

Does HTML Parser not handle text input or partial html files well?

_________________________

CONFIDENTIALITY NOTICE

The information contained in this e-mail message is intended only for the 
exclusive use of the individual or entity named above and may contain 
information that is privileged, confidential or exempt from disclosure 
under applicable law. If the reader of this message is not the intended 
recipient, or the employee or agent responsible for delivery of the 
message to the intended recipient, you are hereby notified that any 
dissemination, distribution or copying of this communication is strictly 
prohibited. If you have received this communication in error, please 
notify the sender immediately by e-mail and delete the material from any 
computer.  Thank you.