Thread: RE: [Yaml-core] New Draft (31-Jul-2001)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Clark C . Evans [mailto:cc...@cl...] wrote:
> | >      document ::= bom? (topmap (sep topmap)* eol)?
> | >      topmap ::= pair(0) (eol(0) pair(0))*
> | 
> | Clark seems to agree with you so I modified the productions 
> accordingly.
> | Note that this means a document can't be an empty list, and 
> no top-level map
> | may be empty. Are we comfortable with this?
> 
> Now that you point out the dis-advantages, I'm not sure.  
> I can see where an empty map may be useful, as well as
> an empty list.  Having the 'empty' document be valid 
> could also rather useful.  I can see "touch fname" being
> used to create a valid YAML text.  Perhaps it was better 
> the other way...

It is easy enough to change back...

> I wonder if the "class/type/kind" indicator
> should just be a property of each node (an attribute
> of each node in the information model), and not
> be an implicit map...

I'm dead set against it. The model is and should be a simple
map/list/scalar, no hidden attributes etc. Want an attribute? use a map.
Also, what about comments?

Putting some more churn into the loop: How about we restrict the shorthand
just to '!'. If you want a comment, write:

    delivery: %
        =:  1/3/2001
        !:  date
        #:  John, if you don't make this
            date, you'll be DOA, not Doe!

Seems acceptable enough. Comments just don't mix well with shorthand forms.
They quickly tend to get multi-line. As for the class vs. format issue, I've
had the idea that we should follow IANA's notion:

    mug shot: !image/bmp/gzip/base64
        ...base64 data...

That is, the 'class' would be multi-part. The interpretation would be
schema-specific, of course (each application having its own set of types),
but the notion would be that the first part(s) would specify the interface
and/or the concrete class; further parts would specify transfer encoding
steps. The above is an "image" (interface?) in "bmp" format (concrete
class?) which was gzip-ed and then base64-ed to obtain the text value places
in the YAML file. IANA could be used as a source for both "first parts" and
"further parts"...

Again, YAML-CORE would also allow for '!' to be used as a shorthand, and not
enforce any special semantics on it beyond saying that "by convention" it is
used this way. Every application would be free to define its own set of
types (in particular, the empty set would be a valid, common choice).

I guess I'm getting sidetracked into debating this issue too soon, before
you are ready to devote time for it (and Brian is completely away). Let's
table it until we are all back, OK?

> Before I address the next one, I think I'd like
> to limit the simple scalar so that the following...
> 
>    bad: this is a simple scalar
>        that continues on the next line.
> 
> is not allowed.

Bye-bye being able to parse/emit RFC0822 headers, then... Also, no more
being able to write:

point: %
    # : This is a long
        comment.
    x : 12.5
    y : 3.7

I don't know... What's the gain?

> This is necessary to make the following 
> unambiguous:
> 
>   one:
> 
> 
>   two: The above is a single new line
>   xxx:
>       Without this limitation, "one"
>       above is ambiguous, does it have
>       a single new line or two...

No, today it is completely unambiguous. "one" should have the value
"\n\n\n".

> | In short, it seems the wording for the simple
> | scalar section needs a
> | rewrite, the examples need to clarify these points,
> | and maybe we should
> | change the way all the productions handle newlines
> | (at the end instead of at
> | the beginning). Wow!
> 
> You could try to do that... but me thinks it'd be 
> massively ugly.  I'm sure there is a way we can 
> decompose this with meaningful productions while 
> still keeping the new line at the beginning of
> each production.

I'm not convinced. I inherited the "eol at start of production" from your
early drafts, so I never really tried it the other way around. I will just
have to try to see for myself :-)

It seems obvious, though, that placing the eol at the end of productions
would make the eol at the end of the document compulsory. I have no problem
with that.

Have fun,

    Oren Ben-Kiki

Thread: RE: [Yaml-core] New Draft (31-Jul-2001)

yaml-core