I have been unable to get Tidy to leave the content of my atrtibute values unchanged. These attribute values are produced by MS Access, and are not SGML-compliant - one of them contains an un-escaped XML document containing the data model for the page. Every option I have tried in Tidy ends up converting the "<" and ">" characters in the attribute value to "<," and ">".
I realize that this is the "right" thing to do from the standpoint of fixing HTML errors, but it is wrong from the standpoint of readability and makes it very difficult to tell if two of these generated pages have the same data model or not, since the elements don't get indented since they are within an attribute..
Tidy already has
quote--ampersand, which allows standalone "&" characters to be left alone if desired (though it defaults to yes),
and quote-marks, that is no by default and leaves """ and "'" alone,
and quote-nbsp, which allows non-breaking spaces to be left alone if desired.
Would it be possible to implement a similar feature for "<" and ">" inside attributes and other places where Tidy would otherwise guess they should be escaped? literal-attributes, which allows white space inside atributes to be left alone, doesn't seem to do the trick (whether it is set to yes or to no).
(p.s. sorry for posting this request directly to the mailing list as well; My first time to this web site I couldn't find where to add a new request.)
Log in to post a comment.