Menu

#7 ArrayOutOfBounds exception w/ some HTML

v0.1.2
open
Lexer (4)
5
2002-04-08
2002-04-08
Ed Howland
No

Got the following exception when using cvsgrab to
download jboss/zoap/src/default.dfPackage

Discussion

  • Nobody/Anonymous

    Logged In: NO

    I also get similar error message when i test some url:

    caught exception parsing
    java.lang.ArrayIndexOutOfBoundsException null
    java.lang.ArrayIndexOutOfBoundsException
    at hotsax.html.sax.HtmlLexer.yylex(HtmlLexer.java:612)
    at hotsax.html.sax.HtmlLexer._yylex
    (HtmlLexer.java:227)
    at hotsax.html.sax.HtmlParser.yylex
    (HtmlParser.java:377)
    at hotsax.html.sax.HtmlParser.yyparse
    (HtmlParser.java:602)
    at hotsax.html.sax.SaxParser.parse
    (SaxParser.java:219)
    at hotsax.html.sax.SaxParser.parse
    (SaxParser.java:169)
    at HotSAXParser.main(HotSAXParser.java:26)

     
  • Anonymous

    Anonymous - 2008-03-16

    Logged In: YES
    user_id=2037509
    Originator: NO

    In my case, it was an issue with encoding of some chars in the html file I parsed.
    I seems that Hotsax doesn't handle well UTF-8 chars that are 2 bytes-wide.
    So, i used recode -d UTF-8..HTML before using hotsax and It was all ok then.

     

Log in to post a comment.

MongoDB Logo MongoDB