Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser
In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv6056/src/org/htmlparser
Modified Files:
Parser.java
Log Message:
Fix bug #995703 Parser Crash and bug #988846 Linkbean getLinks() segmentation fault
by not testing for content type "text/XXX" in Page, but rather issuing a warning when this is
discovered by the Parser level.
Index: Parser.java
===================================================================
RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Parser.java,v
retrieving revision 1.96
retrieving revision 1.97
diff -C2 -d -r1.96 -r1.97
*** Parser.java 17 Jul 2004 13:45:05 -0000 1.96
--- Parser.java 29 Jul 2004 01:19:21 -0000 1.97
***************
*** 440,443 ****
--- 440,444 ----
{
NodeFactory factory;
+ String type;
if (null != lexer)
***************
*** 449,452 ****
--- 450,460 ----
lexer.setNodeFactory (factory);
mLexer = lexer;
+ // warn about content that's not likely text
+ type = mLexer.getPage ().getContentType ();
+ if (type != null && !type.startsWith ("text"))
+ getFeedback ().warning (
+ "URL "
+ + mLexer.getPage ().getUrl ()
+ + " does not contain text");
}
}
|