Thread: [Yaml-core] open issues: 8-bit BOM and lookahead

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Ok.  In the last week we seem to have two open issues:

1. The spec does not require a BOM for UTF-8 and it seems
   that this is industry practice so that legacy encodings
   can be handled.  Also, without requiring UTF-8, it is 
   a bit harder to do a #ENCODING:ISO8859-1 at a later date,
   for example.  

   I suggest we ammend the specification to allow strict ASCII
   (only 7 bit characters) without a BOM and require all streams
   which use unicode to start with the UTF BOM.  This should have 
   zero little impact on existing YAML users since there arn't
   any unicode parsers yet...

2. We have a potentially large lookahead for the series/key 
   short-hand.  I suggest limiting this case to only allow
   for in-line string (without anchors and type family).
   The impact is that the following become illegal:

   ---
   - this: is legal
   - this: [ is, also, legal]
   - this: !!remains &001 legal
   ---
   - !!this is: illegal
   - &so is: this
   - [and,this,is]: illegal too

If we all agree, I can patch up the spec in the next few days.

Best,

Clark

Thread: [Yaml-core] open issues: 8-bit BOM and lookahead

yaml-core