From: SourceForge.net <no...@so...> - 2010-04-22 08:09:24
|
Bugs item #2990554, was opened at 2010-04-21 19:09 Message generated for change (Comment added) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: Tidy can wrap line in the middle of utf8 byte sequence. Initial Comment: I have used jtidy to clean up xml files. In some cases it produce files with incorrect symbols(0x0 symbol in utf8 xml files). I spent some time to debug: Tidy use line wrapping by default and have a bug with line wrapping. Tidy can wrap line in the middle of utf8 byte sequence. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 08:09 Message: There is a test file: <root> <a>1,1′-azobis</a> </root> My test settings: Tidy tidy = new Tidy(); tidy.setXmlTags(true); tidy.setXmlOut(true); tidy.setFixBackslash(true); // replace \ with / in urls tidy.setFixComments(true); // detect mal-formed comments tidy.setHideComments(true); // hide all comments tidy.setHideEndTags(true); // prefer self-closing tag where possible tidy.setLowerLiterals(true); // output lower-case attrib names tidy.setMakeBare(true); // clean Microsoft cruft tidy.setNumEntities(true); // prefer number entities to named ones tidy.setTidyMark(false); // don't add meta tag giving tidy credit tidy.setOutputEncoding("UTF-8");// force tidy to report encoding as utf-8 instead of other tidy.setQuoteNbsp(true); // if char 160 is found, output as tidy.setMakeClean(true); // remove presentational clutter tidy.setDocType("omit"); tidy.setWraplen(9); ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 07:04 Message: I try to fix it today and upload patch. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 |