You can subscribe to this list here.
2004 |
Jan
(29) |
Feb
(1) |
Mar
(6) |
Apr
(31) |
May
(2) |
Jun
(2) |
Jul
(13) |
Aug
(31) |
Sep
(41) |
Oct
(12) |
Nov
(13) |
Dec
(4) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2005 |
Jan
(17) |
Feb
(3) |
Mar
(3) |
Apr
|
May
(1) |
Jun
(2) |
Jul
(1) |
Aug
(3) |
Sep
(3) |
Oct
(1) |
Nov
(2) |
Dec
(6) |
2006 |
Jan
(4) |
Feb
(6) |
Mar
(2) |
Apr
(1) |
May
|
Jun
|
Jul
(21) |
Aug
(7) |
Sep
(5) |
Oct
(4) |
Nov
(2) |
Dec
(2) |
2007 |
Jan
(1) |
Feb
|
Mar
|
Apr
(2) |
May
|
Jun
|
Jul
(1) |
Aug
(2) |
Sep
(2) |
Oct
(2) |
Nov
|
Dec
(1) |
2008 |
Jan
(1) |
Feb
(1) |
Mar
(7) |
Apr
(2) |
May
|
Jun
|
Jul
(1) |
Aug
|
Sep
(1) |
Oct
(1) |
Nov
(2) |
Dec
(8) |
2009 |
Jan
(1) |
Feb
|
Mar
|
Apr
|
May
(2) |
Jun
(2) |
Jul
(5) |
Aug
(24) |
Sep
(16) |
Oct
(8) |
Nov
(42) |
Dec
(3) |
2010 |
Jan
(8) |
Feb
(8) |
Mar
(14) |
Apr
(29) |
May
(2) |
Jun
(1) |
Jul
(11) |
Aug
(47) |
Sep
(4) |
Oct
(16) |
Nov
(18) |
Dec
|
2011 |
Jan
(5) |
Feb
(4) |
Mar
(2) |
Apr
|
May
|
Jun
(10) |
Jul
(50) |
Aug
(4) |
Sep
(4) |
Oct
(1) |
Nov
(4) |
Dec
|
2012 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(2) |
Jun
(8) |
Jul
|
Aug
|
Sep
|
Oct
|
Nov
|
Dec
|
2017 |
Jan
|
Feb
|
Mar
|
Apr
|
May
|
Jun
|
Jul
|
Aug
(1) |
Sep
|
Oct
|
Nov
|
Dec
|
From: SourceForge.net <no...@so...> - 2010-07-31 16:30:18
|
Bugs item #3036734, was opened at 2010-07-30 03:08 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3036734&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None >Status: Closed >Resolution: Accepted Priority: 5 Private: No Submitted By: Johannes Weberhofer (weberho) >Assigned to: Adrian Sandor (aditsu) Summary: EncodingNameMapperTest.java.patch Initial Comment: Correct value for WINDOWS-31J; make testcase working again ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-08-01 00:30 Message: Applied, thanks ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3036734&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-30 22:39:49
|
Feature Requests item #2933753, was opened at 2010-01-17 19:02 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=363153&aid=2933753&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Priority: 5 Private: No Submitted By: Daniel Matuschek (matuschd) Assigned to: Nobody/Anonymous (nobody) Summary: optgroup parser should stop optgroup parsing on unknown tags Initial Comment: Todays implementation processed only <option> tags inside an optgroup. If the optgroup is not closed correctly in the HTML code, all tags after this will be ignored. Browsers seems to close the optgroup if there are other tags then <option>. It seems to be a better idea to close the optgroup if an unknown tag was found instead of ignoring these tags. This should give better results on buggy HTML code. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-07-31 06:39 Message: Can you provide a test case? How does Tidy (the C program) handle it? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=363153&aid=2933753&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-30 22:35:30
|
Feature Requests item #1780883, was opened at 2007-08-24 17:44 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=363153&aid=1780883&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Priority: 5 Private: No Submitted By: JCopistAdmin (jcopistadmin) Assigned to: Nobody/Anonymous (nobody) Summary: Release new version in Maven Central Repo Initial Comment: Hi there ! We currently use Jtidy in our project running with maven2. As a result, we would need a new version to be uploaded to Maven central repository. The last version found there seems really old : http://repo1.maven.org/maven2/jtidy/jtidy/4aug2000r7-dev/. We have noticed that you provide a snapshot repository (http://jtidy.sourceforge.net/snapshots). However, we would need to depend on a fixed version better than on a snapshot. Last version there is from august 2006. Could you publish this one as a new version and provide it in Maven central repo ? ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-07-31 06:35 Message: Copying my answer from https://sourceforge.net/projects/jtidy/forums/forum/41437/topic/3673061 : Hi, I'm the current JTidy maintainer. I joined the project last year, after noticing that it was almost abandoned. This feature request is much older. I don't use maven at all, I don't know what needs to be done and I'd rather not bother doing it. But if it is useful to you and you know how to release it to whatever repository you need, then just go ahead. Let me know what you need and I will assist you. ---------------------------------------------------------------------- Comment By: dstrasse (dstrasse) Date: 2010-07-01 18:19 Message: Hi there, is there any progress? It would be very nice to provide jtidy via maven central for us, too. Kind regards ---------------------------------------------------------------------- Comment By: Tjeerd Verhagen (verhagent) Date: 2010-04-12 18:36 Message: Would indeed be nice to see the artifact appear soon on the public Maven repository. >From what I understand, from the http://maven.apache.org/guides/mini/guide-central-repository-upload.html the group id should change to the domain name, that JTidy owns. Which mean it should be updated to: <groupId>net.sourceforge.jtidy</groupId> I'm the project lead of http://docbook-utils.sourceforge.net/maven-tidy-plugin_1.0/docbook/article-project-overview.html, this Maven Plug-in depends on JTidy release r938, which can not be resolved through a central Maven repository. Maybe the JTidy release manager should have a look into setting up a Sonatype Forge account, so your fixed release get through that repo uploaded in the central Maven repository. And also a SNAPSHOT repository, will be available there. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=363153&aid=1780883&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-30 22:03:05
|
Bugs item #3025841, was opened at 2010-07-06 21:22 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3025841&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None >Status: Closed >Resolution: Invalid Priority: 5 Private: No Submitted By: Dirk Przybylla (dpr42) >Assigned to: Adrian Sandor (aditsu) Summary: unescaped & or unknown entity in URL Initial Comment: Hello, i implemented some links The eclipsetidy version 1.2.2 marks the following line after a check <a href="http://cds.sun.com/is-bin/INTERSHOP.enfinity/WFS/CDS-CDS_Developer-Site/en_US/-/USD/VerifyItem-Start/jre-6u20-windows-i586.exe?BundledLineItemUUID=2RKJ_hCyQ_oAAAEpg7o9a6NS&OrderID=pPSJ_hCyy5UAAAEpd7o9a6NS&ProductID=MUNIBe.pkFEAAAEns3oZAPYl&FileName=/jre-6u20-windows-i586.exe">Java Runzime Environment ab Version 1.5"</a> with Multiple annotations found at this line: - unescaped & or unknown entity "&ProductID" - column 222 - unescaped & or unknown entity "&OrderID" - column 189 - unescaped & or unknown entity "&FileName" - column 257 However, in this case the ampersand is the separator of the arguments of an Url and not an unescaped & or an unknown entity With best regards Dirk ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-07-31 06:03 Message: First of all, this is not the eclipsetidy project. However, your problem is quite clear: ampersands in urls need to be escaped in html pages, see http://htmlhelp.com/tools/validator/problems.html#amp ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3025841&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-30 21:52:24
|
Bugs item #3022414, was opened at 2010-06-29 00:13 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3022414&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: DOM Support Group: None >Status: Closed >Resolution: Fixed Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) >Assigned to: Adrian Sandor (aditsu) Summary: getNodeValue() returning "value" for empty string Initial Comment: The fix for bug 2922337 made in revision 1094 looks to have introduced a problem parsing/retrieving empty string values. After parsing, the getNodeValue() method seems to return a value of "value" instead of empty string. For example, StringReader inReader = new StringReader(strHTML); Tidy tidy = new Tidy(); tidy.setShowWarnings(false); Document doc = tidy.parseDOM(inReader, null); Where strHTML contains... <input type="text" name="TEXT1" id="TEXT1" value="" maxlength="255" class="ltext" /> Using the getNodeValue() on the input node within the returned DOM returns "value" rather than "" node.getAttributes().getNamedItem("value").getNodeValue() Changing line 658 of the getString() function within TidyUtils.java to return an emptry string rather than null seems to prevent the problem: return length == 0 ? "" : new String(bytes, offset, Math.min(length, bytes.length - offset), "UTF8"); Seen in revision 1094 onwards. Revision 938 does not have this problem. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-07-31 05:52 Message: The fix for bug 2922337 was actually the Math.min part; I added the "return null for empty" part for full compatibility with Tidy, but that seems to lose the distinction between empty attribute values and attributes with no value (for which the implicit value is the attribute name). I checked the "null for empty" change and it doesn't seem to affect anything else (except some messages in certain cases) so I changed it back now. Thanks for the report! ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3022414&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-30 20:59:04
|
Bugs item #2977242, was opened at 2010-03-27 03:47 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2977242&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None >Status: Closed Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: Not correctly Initial Comment: Use this INPUT for jtidy : http://www.senado.gov.ar/web/comisiones/listado.php it works in every browser but jtidy is destroying it. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-07-31 04:59 Message: Closing due to lack of feedback ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-03 20:40 Message: Could you explain what the problem is? How is it destroying it? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2977242&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-30 08:50:35
|
Bugs item #3037011, was opened at 2010-07-30 10:50 Message generated for change (Tracker Item Submitted) made by weberho You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3037011&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Johannes Weberhofer (weberho) Assigned to: Nobody/Anonymous (nobody) Summary: Patch: JTidyTask.java.patch Initial Comment: This patch fixes the check for a valid properies-file ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3037011&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-29 19:11:13
|
Bugs item #3036736, was opened at 2010-07-29 21:11 Message generated for change (Tracker Item Submitted) made by weberho You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3036736&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Johannes Weberhofer (weberho) Assigned to: Nobody/Anonymous (nobody) Summary: Patch: Force tests to run always with english locale Initial Comment: This patch forces usage of english error messages to match the expected english test-cases ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3036736&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-29 19:08:30
|
Bugs item #3036734, was opened at 2010-07-29 21:08 Message generated for change (Tracker Item Submitted) made by weberho You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3036734&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Johannes Weberhofer (weberho) Assigned to: Nobody/Anonymous (nobody) Summary: EncodingNameMapperTest.java.patch Initial Comment: Correct value for WINDOWS-31J; make testcase working again ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3036734&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-06 13:22:44
|
Bugs item #3025841, was opened at 2010-07-06 15:22 Message generated for change (Tracker Item Submitted) made by dpr42 You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3025841&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Dirk Przybylla (dpr42) Assigned to: Nobody/Anonymous (nobody) Summary: unescaped & or unknown entity in URL Initial Comment: Hello, i implemented some links The eclipsetidy version 1.2.2 marks the following line after a check <a href="http://cds.sun.com/is-bin/INTERSHOP.enfinity/WFS/CDS-CDS_Developer-Site/en_US/-/USD/VerifyItem-Start/jre-6u20-windows-i586.exe?BundledLineItemUUID=2RKJ_hCyQ_oAAAEpg7o9a6NS&OrderID=pPSJ_hCyy5UAAAEpd7o9a6NS&ProductID=MUNIBe.pkFEAAAEns3oZAPYl&FileName=/jre-6u20-windows-i586.exe">Java Runzime Environment ab Version 1.5"</a> with Multiple annotations found at this line: - unescaped & or unknown entity "&ProductID" - column 222 - unescaped & or unknown entity "&OrderID" - column 189 - unescaped & or unknown entity "&FileName" - column 257 However, in this case the ampersand is the separator of the arguments of an Url and not an unescaped & or an unknown entity With best regards Dirk ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3025841&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-07-01 10:19:33
|
Feature Requests item #1780883, was opened at 2007-08-24 11:44 Message generated for change (Comment added) made by dstrasse You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=363153&aid=1780883&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Priority: 5 Private: No Submitted By: JCopistAdmin (jcopistadmin) Assigned to: Nobody/Anonymous (nobody) Summary: Release new version in Maven Central Repo Initial Comment: Hi there ! We currently use Jtidy in our project running with maven2. As a result, we would need a new version to be uploaded to Maven central repository. The last version found there seems really old : http://repo1.maven.org/maven2/jtidy/jtidy/4aug2000r7-dev/. We have noticed that you provide a snapshot repository (http://jtidy.sourceforge.net/snapshots). However, we would need to depend on a fixed version better than on a snapshot. Last version there is from august 2006. Could you publish this one as a new version and provide it in Maven central repo ? ---------------------------------------------------------------------- Comment By: dstrasse (dstrasse) Date: 2010-07-01 12:19 Message: Hi there, is there any progress? It would be very nice to provide jtidy via maven central for us, too. Kind regards ---------------------------------------------------------------------- Comment By: Tjeerd Verhagen (verhagent) Date: 2010-04-12 12:36 Message: Would indeed be nice to see the artifact appear soon on the public Maven repository. >From what I understand, from the http://maven.apache.org/guides/mini/guide-central-repository-upload.html the group id should change to the domain name, that JTidy owns. Which mean it should be updated to: <groupId>net.sourceforge.jtidy</groupId> I'm the project lead of http://docbook-utils.sourceforge.net/maven-tidy-plugin_1.0/docbook/article-project-overview.html, this Maven Plug-in depends on JTidy release r938, which can not be resolved through a central Maven repository. Maybe the JTidy release manager should have a look into setting up a Sonatype Forge account, so your fixed release get through that repo uploaded in the central Maven repository. And also a SNAPSHOT repository, will be available there. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=363153&aid=1780883&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-06-28 16:13:01
|
Bugs item #3022414, was opened at 2010-06-28 16:13 Message generated for change (Tracker Item Submitted) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3022414&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: DOM Support Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: getNodeValue() returning "value" for empty string Initial Comment: The fix for bug 2922337 made in revision 1094 looks to have introduced a problem parsing/retrieving empty string values. After parsing, the getNodeValue() method seems to return a value of "value" instead of empty string. For example, StringReader inReader = new StringReader(strHTML); Tidy tidy = new Tidy(); tidy.setShowWarnings(false); Document doc = tidy.parseDOM(inReader, null); Where strHTML contains... <input type="text" name="TEXT1" id="TEXT1" value="" maxlength="255" class="ltext" /> Using the getNodeValue() on the input node within the returned DOM returns "value" rather than "" node.getAttributes().getNamedItem("value").getNodeValue() Changing line 658 of the getString() function within TidyUtils.java to return an emptry string rather than null seems to prevent the problem: return length == 0 ? "" : new String(bytes, offset, Math.min(length, bytes.length - offset), "UTF8"); Seen in revision 1094 onwards. Revision 938 does not have this problem. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=3022414&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-05-05 12:09:32
|
Bugs item #2996942, was opened at 2010-05-05 14:43 Message generated for change (Settings changed) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2996942&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. >Category: Tidy functionality Group: None Status: Closed >Resolution: Works For Me Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) >Assigned to: Adrian Sandor (aditsu) Summary: Encoding bug Initial Comment: "Grün" is not encoded properly even with output and input encoding set to utf-8. see attached file. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-05-05 20:08 Message: After further discussion with the submitter, we determined that: - the attached file is not the original html - the following test programs works correctly: import java.net.URL; import org.w3c.tidy.Tidy; public class JTest { public static void main(String[] args) throws Exception { final Tidy tidy = new Tidy(); final URL url = new URL("http://www.autoscout24.de/Details.aspx?id=bheebqbxn4eb"); tidy.setInputEncoding("UTF-8"); tidy.setOutputEncoding("UTF-8"); tidy.parse(url.openStream(), System.out); } } Therefore closing this bug now. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2996942&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-05-05 12:08:26
|
Bugs item #2996942, was opened at 2010-05-05 14:43 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2996942&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. >Category: None Group: None >Status: Closed Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) >Assigned to: Nobody/Anonymous (nobody) Summary: Encoding bug Initial Comment: "Grün" is not encoded properly even with output and input encoding set to utf-8. see attached file. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-05-05 20:08 Message: After further discussion with the submitter, we determined that: - the attached file is not the original html - the following test programs works correctly: import java.net.URL; import org.w3c.tidy.Tidy; public class JTest { public static void main(String[] args) throws Exception { final Tidy tidy = new Tidy(); final URL url = new URL("http://www.autoscout24.de/Details.aspx?id=bheebqbxn4eb"); tidy.setInputEncoding("UTF-8"); tidy.setOutputEncoding("UTF-8"); tidy.parse(url.openStream(), System.out); } } Therefore closing this bug now. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2996942&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-24 07:21:49
|
Bugs item #2990554, was opened at 2010-04-22 03:09 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) >Assigned to: Adrian Sandor (aditsu) Summary: Tidy can wrap line in the middle of utf8 byte sequence. Initial Comment: I have used jtidy to clean up xml files. In some cases it produce files with incorrect symbols(0x0 symbol in utf8 xml files). I spent some time to debug: Tidy use line wrapping by default and have a bug with line wrapping. Tidy can wrap line in the middle of utf8 byte sequence. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-04-24 15:21 Message: Hi, first of all, what JTidy version are you using? Your patch doesn't compile with the current code. By the way, you can attach files to bug reports. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 20:05 Message: I have fixed it. Sorry, my first assumption was wrong. Problem in PPrint.java lines 706, 721. Code is: wraphere = linelen + 2; // 2, because AddChar is not till later but is should be: wraphere = linelen + 1; // 1, because we have arrays uses 0 as first char index and position of last symbol is len - 1 and because AddChar is not till later I have uploaded file with my changes. http://tuzikbottle.com/images/exchange/PPrint.java If you have any question mail me to vyacheslav.gudkov at gmail.com ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 16:09 Message: There is a test file: <root> <a>1,1′-azobis</a> </root> My test settings: Tidy tidy = new Tidy(); tidy.setXmlTags(true); tidy.setXmlOut(true); tidy.setFixBackslash(true); // replace \ with / in urls tidy.setFixComments(true); // detect mal-formed comments tidy.setHideComments(true); // hide all comments tidy.setHideEndTags(true); // prefer self-closing tag where possible tidy.setLowerLiterals(true); // output lower-case attrib names tidy.setMakeBare(true); // clean Microsoft cruft tidy.setNumEntities(true); // prefer number entities to named ones tidy.setTidyMark(false); // don't add meta tag giving tidy credit tidy.setOutputEncoding("UTF-8");// force tidy to report encoding as utf-8 instead of other tidy.setQuoteNbsp(true); // if char 160 is found, output as tidy.setMakeClean(true); // remove presentational clutter tidy.setDocType("omit"); tidy.setWraplen(9); ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 15:04 Message: I try to fix it today and upload patch. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-22 12:05:26
|
Bugs item #2990554, was opened at 2010-04-21 19:09 Message generated for change (Comment added) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: Tidy can wrap line in the middle of utf8 byte sequence. Initial Comment: I have used jtidy to clean up xml files. In some cases it produce files with incorrect symbols(0x0 symbol in utf8 xml files). I spent some time to debug: Tidy use line wrapping by default and have a bug with line wrapping. Tidy can wrap line in the middle of utf8 byte sequence. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 12:05 Message: I have fixed it. Sorry, my first assumption was wrong. Problem in PPrint.java lines 706, 721. Code is: wraphere = linelen + 2; // 2, because AddChar is not till later but is should be: wraphere = linelen + 1; // 1, because we have arrays uses 0 as first char index and position of last symbol is len - 1 and because AddChar is not till later I have uploaded file with my changes. http://tuzikbottle.com/images/exchange/PPrint.java If you have any question mail me to vyacheslav.gudkov at gmail.com ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 08:09 Message: There is a test file: <root> <a>1,1′-azobis</a> </root> My test settings: Tidy tidy = new Tidy(); tidy.setXmlTags(true); tidy.setXmlOut(true); tidy.setFixBackslash(true); // replace \ with / in urls tidy.setFixComments(true); // detect mal-formed comments tidy.setHideComments(true); // hide all comments tidy.setHideEndTags(true); // prefer self-closing tag where possible tidy.setLowerLiterals(true); // output lower-case attrib names tidy.setMakeBare(true); // clean Microsoft cruft tidy.setNumEntities(true); // prefer number entities to named ones tidy.setTidyMark(false); // don't add meta tag giving tidy credit tidy.setOutputEncoding("UTF-8");// force tidy to report encoding as utf-8 instead of other tidy.setQuoteNbsp(true); // if char 160 is found, output as tidy.setMakeClean(true); // remove presentational clutter tidy.setDocType("omit"); tidy.setWraplen(9); ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 07:04 Message: I try to fix it today and upload patch. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-22 08:09:24
|
Bugs item #2990554, was opened at 2010-04-21 19:09 Message generated for change (Comment added) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: Tidy can wrap line in the middle of utf8 byte sequence. Initial Comment: I have used jtidy to clean up xml files. In some cases it produce files with incorrect symbols(0x0 symbol in utf8 xml files). I spent some time to debug: Tidy use line wrapping by default and have a bug with line wrapping. Tidy can wrap line in the middle of utf8 byte sequence. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 08:09 Message: There is a test file: <root> <a>1,1′-azobis</a> </root> My test settings: Tidy tidy = new Tidy(); tidy.setXmlTags(true); tidy.setXmlOut(true); tidy.setFixBackslash(true); // replace \ with / in urls tidy.setFixComments(true); // detect mal-formed comments tidy.setHideComments(true); // hide all comments tidy.setHideEndTags(true); // prefer self-closing tag where possible tidy.setLowerLiterals(true); // output lower-case attrib names tidy.setMakeBare(true); // clean Microsoft cruft tidy.setNumEntities(true); // prefer number entities to named ones tidy.setTidyMark(false); // don't add meta tag giving tidy credit tidy.setOutputEncoding("UTF-8");// force tidy to report encoding as utf-8 instead of other tidy.setQuoteNbsp(true); // if char 160 is found, output as tidy.setMakeClean(true); // remove presentational clutter tidy.setDocType("omit"); tidy.setWraplen(9); ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 07:04 Message: I try to fix it today and upload patch. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-22 07:04:50
|
Bugs item #2990554, was opened at 2010-04-21 19:09 Message generated for change (Comment added) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: Tidy can wrap line in the middle of utf8 byte sequence. Initial Comment: I have used jtidy to clean up xml files. In some cases it produce files with incorrect symbols(0x0 symbol in utf8 xml files). I spent some time to debug: Tidy use line wrapping by default and have a bug with line wrapping. Tidy can wrap line in the middle of utf8 byte sequence. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-22 07:04 Message: I try to fix it today and upload patch. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-21 19:09:30
|
Bugs item #2990554, was opened at 2010-04-21 19:09 Message generated for change (Tracker Item Submitted) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: Tidy functionality Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Nobody/Anonymous (nobody) Summary: Tidy can wrap line in the middle of utf8 byte sequence. Initial Comment: I have used jtidy to clean up xml files. In some cases it produce files with incorrect symbols(0x0 symbol in utf8 xml files). I spent some time to debug: Tidy use line wrapping by default and have a bug with line wrapping. Tidy can wrap line in the middle of utf8 byte sequence. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2990554&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-17 19:19:12
|
Bugs item #2985849, was opened at 2010-04-12 19:23 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2985849&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None >Status: Closed >Resolution: Fixed Priority: 5 Private: No Submitted By: Rajorshi Biswas (rajorshi) Assigned to: Adrian Sandor (aditsu) Summary: Spaces are lost between elements Initial Comment: I think this is fairly serious. Please run the attached html through jtidy,jar. You will see that the input HTML: private String parseDescription becomes: privateString parseDescription The space between the span tags is lost. HTML Tidy works fine for this. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-04-18 03:19 Message: Fixed in svn (r1106) ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 21:56 Message: Simplified test case: foo<font><span> bar</span></font> ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 21:09 Message: Confirmed ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2985849&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-17 19:17:23
|
Bugs item #2984038, was opened at 2010-04-09 01:56 Message generated for change (Settings changed) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None >Status: Closed >Resolution: Works For Me Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Adrian Sandor (aditsu) Summary: Posible issue with attributes manipulation Initial Comment: Using JTidy with some HTMLs that have attributes with value without quotes (") I gave an error because the separator space for attributes was deleted. Example: Original: <A NAME='JD_CdigoTributarioArt.6RESOLUCIONN55'> After JTidy: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55"name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> In this case, JTidy added a 'name' attribute with the same value of the 'id' attribute but without separator spaces for attributes. The options used for this case are: Tidy tidy = new Tidy(); tidy.setXmlOut(true); tidy.setXHTML(true); tidy.setPrintBodyOnly(true); tidy.setShowWarnings(false); tidy.setQuiet(true); tidy.setNumEntities(true); tidy.setDropProprietaryAttributes(true); tidy.setLiteralAttribs(true); I solve this using the option: tidy.setIndentAttributes(true); I attach the original htm of the example. Sorry for my english and for guest user post ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-12 22:05 Message: You're right, I'm sorry. Now, I'm using jtidy-r938.jar. I will test it much more Thanks for your answer. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 21:46 Message: No, I used your exact code, without setIndentAttributes. And you haven't answered my question. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-12 21:43 Message: Hi I think that you has tested it with setIndentAttributes(true). Try it WITHOUT this option. I solved my problem with this option. Without setIndentAttributes(true) you can see my error. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-11 21:14 Message: I tried your input and your code, and the output I get is: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55" name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> As you can see, the name attribute is on a separate line. What version of JTidy are you using? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-12 14:05:41
|
Bugs item #2984038, was opened at 2010-04-08 17:56 Message generated for change (Comment added) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Adrian Sandor (aditsu) Summary: Posible issue with attributes manipulation Initial Comment: Using JTidy with some HTMLs that have attributes with value without quotes (") I gave an error because the separator space for attributes was deleted. Example: Original: <A NAME='JD_CdigoTributarioArt.6RESOLUCIONN55'> After JTidy: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55"name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> In this case, JTidy added a 'name' attribute with the same value of the 'id' attribute but without separator spaces for attributes. The options used for this case are: Tidy tidy = new Tidy(); tidy.setXmlOut(true); tidy.setXHTML(true); tidy.setPrintBodyOnly(true); tidy.setShowWarnings(false); tidy.setQuiet(true); tidy.setNumEntities(true); tidy.setDropProprietaryAttributes(true); tidy.setLiteralAttribs(true); I solve this using the option: tidy.setIndentAttributes(true); I attach the original htm of the example. Sorry for my english and for guest user post ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-12 14:05 Message: You're right, I'm sorry. Now, I'm using jtidy-r938.jar. I will test it much more Thanks for your answer. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 13:46 Message: No, I used your exact code, without setIndentAttributes. And you haven't answered my question. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-12 13:43 Message: Hi I think that you has tested it with setIndentAttributes(true). Try it WITHOUT this option. I solved my problem with this option. Without setIndentAttributes(true) you can see my error. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-11 13:14 Message: I tried your input and your code, and the output I get is: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55" name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> As you can see, the name attribute is on a separate line. What version of JTidy are you using? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-12 13:56:51
|
Bugs item #2985849, was opened at 2010-04-12 19:23 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2985849&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Rajorshi Biswas (rajorshi) Assigned to: Adrian Sandor (aditsu) Summary: Spaces are lost between elements Initial Comment: I think this is fairly serious. Please run the attached html through jtidy,jar. You will see that the input HTML: private String parseDescription becomes: privateString parseDescription The space between the span tags is lost. HTML Tidy works fine for this. ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 21:56 Message: Simplified test case: foo<font><span> bar</span></font> ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 21:09 Message: Confirmed ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2985849&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-12 13:46:18
|
Bugs item #2984038, was opened at 2010-04-09 01:56 Message generated for change (Comment added) made by aditsu You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Adrian Sandor (aditsu) Summary: Posible issue with attributes manipulation Initial Comment: Using JTidy with some HTMLs that have attributes with value without quotes (") I gave an error because the separator space for attributes was deleted. Example: Original: <A NAME='JD_CdigoTributarioArt.6RESOLUCIONN55'> After JTidy: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55"name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> In this case, JTidy added a 'name' attribute with the same value of the 'id' attribute but without separator spaces for attributes. The options used for this case are: Tidy tidy = new Tidy(); tidy.setXmlOut(true); tidy.setXHTML(true); tidy.setPrintBodyOnly(true); tidy.setShowWarnings(false); tidy.setQuiet(true); tidy.setNumEntities(true); tidy.setDropProprietaryAttributes(true); tidy.setLiteralAttribs(true); I solve this using the option: tidy.setIndentAttributes(true); I attach the original htm of the example. Sorry for my english and for guest user post ---------------------------------------------------------------------- >Comment By: Adrian Sandor (aditsu) Date: 2010-04-12 21:46 Message: No, I used your exact code, without setIndentAttributes. And you haven't answered my question. ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-12 21:43 Message: Hi I think that you has tested it with setIndentAttributes(true). Try it WITHOUT this option. I solved my problem with this option. Without setIndentAttributes(true) you can see my error. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-11 21:14 Message: I tried your input and your code, and the output I get is: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55" name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> As you can see, the name attribute is on a separate line. What version of JTidy are you using? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 |
From: SourceForge.net <no...@so...> - 2010-04-12 13:43:15
|
Bugs item #2984038, was opened at 2010-04-08 17:56 Message generated for change (Comment added) made by nobody You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 Please note that this message will contain a full copy of the comment thread, including the initial issue submission, for this request, not just the latest update. Category: None Group: None Status: Open Resolution: None Priority: 5 Private: No Submitted By: Nobody/Anonymous (nobody) Assigned to: Adrian Sandor (aditsu) Summary: Posible issue with attributes manipulation Initial Comment: Using JTidy with some HTMLs that have attributes with value without quotes (") I gave an error because the separator space for attributes was deleted. Example: Original: <A NAME='JD_CdigoTributarioArt.6RESOLUCIONN55'> After JTidy: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55"name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> In this case, JTidy added a 'name' attribute with the same value of the 'id' attribute but without separator spaces for attributes. The options used for this case are: Tidy tidy = new Tidy(); tidy.setXmlOut(true); tidy.setXHTML(true); tidy.setPrintBodyOnly(true); tidy.setShowWarnings(false); tidy.setQuiet(true); tidy.setNumEntities(true); tidy.setDropProprietaryAttributes(true); tidy.setLiteralAttribs(true); I solve this using the option: tidy.setIndentAttributes(true); I attach the original htm of the example. Sorry for my english and for guest user post ---------------------------------------------------------------------- Comment By: Nobody/Anonymous (nobody) Date: 2010-04-12 13:43 Message: Hi I think that you has tested it with setIndentAttributes(true). Try it WITHOUT this option. I solved my problem with this option. Without setIndentAttributes(true) you can see my error. ---------------------------------------------------------------------- Comment By: Adrian Sandor (aditsu) Date: 2010-04-11 13:14 Message: I tried your input and your code, and the output I get is: <a id="JD_CdigoTributarioArt.6RESOLUCIONN55" name='JD_CdigoTributarioArt.6RESOLUCIONN55'></a> As you can see, the name attribute is on a separate line. What version of JTidy are you using? ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=113153&aid=2984038&group_id=13153 |