[Htmlparser-user] Harvester
Brought to you by:
derrickoswald
From: Mohd-Taqiyuddin Z. <mt...@ec...> - 2003-02-26 16:34:06
|
Hi, I'm doing my harvester to harvest information in the formtag. It works find when I parse to any html pages that I need to parse except for this URL http://developer.java.sun.com/developer/Quizzes/misc/earlyadopterjxta.html. It seems that the page that gives the error does not have an endtag for the formtag and the parser loopback to find the endtag for the formtag. Is this a bug? Do you know a solution that I can still parse the page and still get the Vector FormInput for further processing. Hope you can help me on this. below is the generated error. " ERROR: HTMLReader.readElement() : Error occurred while trying to decipher the tag using scanners Tag being processed : FORM Current Tag Line : <form action="earlyadopterjxtaanswers.jsp" method="POST"> at Line 690 : null Previous Line 689 : </HTML> ERROR: HTMLReader.readElement() : Error occurred while trying to read the next element, at Line 690 : null Previous Line 689 : </HTML> ERROR: Unexpected Exception occurred while reading http://developer.java.sun.com/developer/Quizzes/misc/earlyadopterjxta.html, in nextHTMLNode at Line 690 : null Previous Line 689 : </HTML> org.htmlparser.util.ParserException: Unexpected Exception occurred while reading http://developer.java.sun.com/developer/Quizzes/misc/earlyadopterjxta .html, in nextHTMLNode at Line 690 : null Previous Line 689 : </HTML>" |