Exception in HTMLParser 1.1 (HTMLTag.find)

Brought to you by: derrickoswald

Exception in HTMLParser 1.1 (HTMLTag.find)

Forum: htmlparser-user

Creator: sergejf

Created: 2002-04-16

Updated: 2003-01-03

sergejf - 2002-04-16

Hello,

I'm getting the following exception with HTMLParser 1.1 (build from CVS today):

java.lang.NullPointerException

    at com.kizna.html.tags.HTMLTag.find(HTMLTag.java:125)

    at com.kizna.html.HTMLReader.readElement(HTMLReader.java:143)

    at com.kizna.html.HTMLParser$1.hasMoreElements(HTMLParser.java:200)

    at CurlTest.clean(CurlTest.java:88)

    at CurlTest.comToServer(CurlTest.java:223)

    at CurlTest.main(CurlTest.java:166)

I'm trying to extract all the text elements from an HTML page (regular text and text in links) for the purpose of doing some Web scraping. The URL: https://www.santander.com.mx/SuperNetII/ctrlacceso/Login.jsp.

The exception happens somewhere after line 278 (containing the word "Seguridad") of the HTML source. I suspect it happens relatively near the end of the HTML source.

I'm not very familiar with SourceForge, hence if you need more data please let me know by e-mail to cto@simitel.com. Thanks.

Best Regards,

Serge

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:
- Somik Raha - 2003-01-03
  
  Pls upgrade to HTMLParser 1.2. 1.1 is very old (over 8 months of work has gone into 1.2).
  
  Regards,
  Somik
  
  If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Log in to post a comment.