From: SourceForge.net <no...@so...> - 2012-08-09 12:37:46
|
Bugs item #3549920, was opened at 2012-07-27 04:50 Message generated for change (Comment added) made by You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=952178&aid=3549920&group_id=195122 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: scanner Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: https://www.google.com/accounts () Assigned to: Nobody/Anonymous (nobody) Summary: Incorrect parsing from InputStream at closing script tag Initial Comment: If the InputStream splits an ending script tag as "</" and "script> the parser misses the end script tag and incorretcly parses the page. This bug is driving me mad because it only fails sometimes on the same exact page! As pointed by Janito Vaqueiro Ferreira the problem may be at: HTMLScanner.nextContent(int) but I am having a hard time understanding that code. I have developed a test case with an InputStream that exposes this problem. ---------------------------------------------------------------------- >Comment By: https://www.google.com/accounts () Date: 2012-08-09 05:37 Message: I have attached a potential fix for this issue. Could you try it? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=952178&aid=3549920&group_id=195122 |