lexer issue?

Brought to you by: derrickoswald

lexer issue?

Forum: Help

Creator: Hoang Long Nguyen

Created: 2010-10-28

Updated: 2013-04-27

Hoang Long Nguyen - 2010-10-28

Dear All,

I was trying to extract content of a simple html string, but it failed to do it.
this is the string :

"<p> this dd <blockquote>saf</blockquote> is strong </p>";

and i use TagNameFilter to extract the blockquote.

nl = parser.extractAllNodesThatMatch(new TagNameFilter("blockquote"));

            for(int i =0 ; i< nl.size(); i++)
            {
                Node tn = (Node)nl.elementAt(i);

                System.out.println("result1:" + tn.toPlainTextString());
                System.out.println("result2:" + tn.toHtml());
                System.out.println("result3:" + tn.getText());
                System.out.println("result4:" + tn.toString());
            }

i didn't receive the content text. I looked at the source code of lexer class and i guess the problem is around how the next node and returning virtual end tag achieved.

have you got this problem before? Or if you have some ideas of any possible workaround.

regards,

-Hoang Long

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.