Dear All,

I was trying to extract content of a simple html string, but it failed to do it.
this is the string :

"<p> this dd <blockquote>saf</blockquote> is strong </p>";

and i use TagNameFilter to extract the blockquote.

nl = parser.extractAllNodesThatMatch(new TagNameFilter("blockquote"));
           
            for(int i =0 ;  i< nl.size(); i++)
            {
                Node  tn = (Node)nl.elementAt(i);
            
                System.out.println("result1:" + tn.toPlainTextString());
                System.out.println("result2:" + tn.toHtml());
                System.out.println("result3:" + tn.getText());
                System.out.println("result4:" + tn.toString());
            }

i didn't receive the content text. I looked at the source code of lexer class and i guess  the problem is around how the next node and returning virtual end tag achieved.

have you got this problem before? Or if you have some ideas of any possible workaround.

regards,

-Hoang Long