From: Tomas K. <to...@us...> - 2004-09-17 14:00:34
|
> > I received a mail with a encoded subject, wich was not decoded: > Subject: =?windows-1252?Q?Nikon Information 06-04: Photokina 2004?= http://www.faqs.org/rfcs/rfc2047.html 2. Syntax of encoded-words ... IMPORTANT: 'encoded-word's are designed to be recognized as 'atom's by an RFC 822 parser. As a consequence, unencoded white space characters (such as SPACE and HTAB) are FORBIDDEN within an 'encoded-word'. For example, the character sequence =?iso-8859-1?q?this is some text?= would be parsed as four 'atom's, rather than as a single 'atom' (by an RFC 822 parser) or 'encoded-word' (by a parser which understands 'encoded-words'). The correct way to encode the string "this is some text" is to encode the SPACE characters as well, e.g. =?iso-8859-1?q?this=20is=20some=20text?= The characters which may appear in 'encoded-text' are further restricted by the rules in section 5. ... 4.2. The "Q" encoding ... 3.) 8-bit values which correspond to printable ASCII characters other than "=", "?", and "_" (underscore), MAY be represented as those characters. (But see section 5 for restrictions.) In particular, SPACE and TAB MUST NOT be represented as themselves within encoded words. -- Tomas |