[Htmlparser-cvs] htmlparser/src/org/htmlparser Parser.java,1.96,1.97
Brought to you by:
derrickoswald
From: Derrick O. <der...@us...> - 2004-07-29 01:19:30
|
Update of /cvsroot/htmlparser/htmlparser/src/org/htmlparser In directory sc8-pr-cvs1.sourceforge.net:/tmp/cvs-serv6056/src/org/htmlparser Modified Files: Parser.java Log Message: Fix bug #995703 Parser Crash and bug #988846 Linkbean getLinks() segmentation fault by not testing for content type "text/XXX" in Page, but rather issuing a warning when this is discovered by the Parser level. Index: Parser.java =================================================================== RCS file: /cvsroot/htmlparser/htmlparser/src/org/htmlparser/Parser.java,v retrieving revision 1.96 retrieving revision 1.97 diff -C2 -d -r1.96 -r1.97 *** Parser.java 17 Jul 2004 13:45:05 -0000 1.96 --- Parser.java 29 Jul 2004 01:19:21 -0000 1.97 *************** *** 440,443 **** --- 440,444 ---- { NodeFactory factory; + String type; if (null != lexer) *************** *** 449,452 **** --- 450,460 ---- lexer.setNodeFactory (factory); mLexer = lexer; + // warn about content that's not likely text + type = mLexer.getPage ().getContentType (); + if (type != null && !type.startsWith ("text")) + getFeedback ().warning ( + "URL " + + mLexer.getPage ().getUrl () + + " does not contain text"); } } |