The exception happens somewhere after line 278 (containing the word "Seguridad") of the HTML source. I suspect it happens relatively near the end of the HTML source.
I'm not very familiar with SourceForge, hence if you need more data please let me know by e-mail to cto@simitel.com. Thanks.
Best Regards,
Serge
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
Hello,
I'm getting the following exception with HTMLParser 1.1 (build from CVS today):
java.lang.NullPointerException
at com.kizna.html.tags.HTMLTag.find(HTMLTag.java:125)
at com.kizna.html.HTMLReader.readElement(HTMLReader.java:143)
at com.kizna.html.HTMLParser$1.hasMoreElements(HTMLParser.java:200)
at CurlTest.clean(CurlTest.java:88)
at CurlTest.comToServer(CurlTest.java:223)
at CurlTest.main(CurlTest.java:166)
I'm trying to extract all the text elements from an HTML page (regular text and text in links) for the purpose of doing some Web scraping. The URL: https://www.santander.com.mx/SuperNetII/ctrlacceso/Login.jsp.
The exception happens somewhere after line 278 (containing the word "Seguridad") of the HTML source. I suspect it happens relatively near the end of the HTML source.
I'm not very familiar with SourceForge, hence if you need more data please let me know by e-mail to cto@simitel.com. Thanks.
Best Regards,
Serge
Pls upgrade to HTMLParser 1.2. 1.1 is very old (over 8 months of work has gone into 1.2).
Regards,
Somik