User Activity

  • Modified a comment on ticket #221 on HtmlCleaner

    I just ran into this myself. I believe the behavior change occurred in https://sourceforge.net/p/htmlcleaner/code/521/ (released in 2.22). That change made it so that most HTML entities (e.g., "ã") are now decoded (which is good for me!), but some characters that previously were not encoded are now being encoded, such as & and >. Even though escapeXml=false is passed into the DomSerializer constructor, Utils.escapeXml() still ends up getting called because recognizeUnicodeChars=true. The name...

  • Posted a comment on ticket #221 on HtmlCleaner

    I just ran into this myself. I believe the behavior change occurred in https://sourceforge.net/p/htmlcleaner/code/521/ (released in 2.22). That change made it so that most HTML entities (e.g., "ã") are now decoded (which is good for me!), but some characters that previously were not encoded are now being encoded, such as & and >. Even though escapeXml=false is passed into the DomSerializer constructor, Utils.escapeXml() still ends up getting called because recognizeUnicodeChars=true. The name of...

  • Posted a comment on ticket #191 on HtmlCleaner

    Here's the HTML (once again eaten by sourceforge): <time><b><li>

  • Created ticket #191 on HtmlCleaner

    Infinite loop on <time><b><li>

  • Modified a comment on ticket #190 on HtmlCleaner

    Oops, sourceforge ate part of the markup. Here's the minimal test case: <html xmlns="x"><ul><a> And the stacktrace: java.lang.NullPointerException at org.htmlcleaner.HtmlCleaner.makeTree(HtmlCleaner.java:1097) at org.htmlcleaner.HtmlTokenizer.addToken(HtmlTokenizer.java:103) at org.htmlcleaner.HtmlTokenizer.tagStart(HtmlTokenizer.java:546) at org.htmlcleaner.HtmlTokenizer.start(HtmlTokenizer.java:480) at org.htmlcleaner.HtmlCleaner.clean(HtmlCleaner.java:461) at org.htmlcleaner.HtmlCleaner.clean...

  • Posted a comment on ticket #190 on HtmlCleaner

    Oops, sourceforge ate part of the markup. Here's the minimal test case: <html xmlns="x"><ul><a>

  • Created ticket #190 on HtmlCleaner

    NullPointerException in HtmlCleaner.makeTree

  • Created ticket #185 on HtmlCleaner

    Unclosed CDATA can cause ArrayIndexOutOfBoundsException

View All

Personal Data

Username:
mryan42
Joined:
2017-04-14 13:42:27

Projects

  • No projects to display.

Personal Tools

MongoDB Logo MongoDB