Thread: RE: [Yaml-core] A modest proposal

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Clark C . Evans [mailto:cc...@cl...] wrote:
> | File/value starts with '@' or ':' -> list.
> | File/value starts with '=' or '|' -> scalar.
> | File value starts with '%' or anything else -> map.
> 
> Nice, but "=" should not be the indicator for a 
> scalar (since it represents the default value).
> Even $ is better, although I like ^ the best.

Hmmm. I don't want to make $ an indicator, to allow "price: $12". As for
"^", I don't like it somehow. How about "`"?

key1: `% this simple scalar starts with an indicator.
key2: `
    This simple scalar starts at the next line.
    It has no leading or trailing white space.
key3:
    This is: a map
key4:
    : This is a list entry.

Brian's implicit indicators scheme has the advantage that it removes many
indicators from the YAML file: most maps and list won't need any. But it has
a downside in that multi-line simple scalars will have to add an indicator.
OK, there are much less multi-line simple scalars then there are maps and
lists, I guess. But I think we should strive to minimize the visual impact
of these. "`" has much less "polluting visual presence" then "^", "$" or
"=". And it is a kind of quote, after all...

Taking this to its extreme (as I've been known to do :-), here's a much less
modest proposal. Let's use " for quoted strings, ' for simple scalars, ` for
blocks. In all cases the full form uses the quote as a wrapper (`this is a
block`). However, the start and/or end quote may be omitted if this doesn't
lead to an ambiguity (the default style is '). For example, if a " value
happens to start or end with a ", then the leading/trailing " becomes
mandatory. If a block doesn't contain the final newline, the trailing `
becomes mandatory, etc.

Examples:

: "
    This is a string with escapes
    (what we call a quoted string).
    Lines are folded.
    It can use any printable but
    escapes such as \n are expanded.
    Using " is consistent with Perl.
: '
    This is a string without escapes
    (what we call a simple scalar).
    Lines are folded.
    It can use any printable character since
    escapes such as \n are not expanded.
    Using ' is consistent with Perl.
: `
    This is an "as if" string
    (what we call a block).
    It can use any character including
    newlines since no line folding is
    done and escapes are not expanded.
    Since ` is the "odd bird" in the
    quotes family, its use is at least
    not inconsistent with anything.
: `
    This block doesn't have a trailing newline.`
: `
    This block does have it `
    `
: This is implicitly single-quote
: 'This is explicitly single quote
: 'Likewise'
: 'This one ends with a ''
: "This is explicitly double quote
: "Likewise"
: `This is legal for orthogonality`

Pros:
- We use the three quote types for the three scalar types (I'd suggest we
can them 'block', 'folded' and 'escaped', in an increasing level of
processing).
- We avoid the need of needing to escape quotes - we only have to quote
unprintable characters.
- The choice of quote types is intuitive to the Perl-aware.
- It looks better visually (IMVHO).
- There is no need for a special format for a block without trailing newline
format.
- The scalar types become more consistent with each other (specifying them
would be fun - each could be defined in terms of the next-simpler format).
- Using all types in a key becomes "inevitable":

this is an implicit simple key: ...
'this an explicit simple key' : ...
"this is an escaped key" : ...
`this is a block key` : ...

I rather like this combination. Thoughts?

Have fun,

    Oren Ben-Kiki

Thread: RE: [Yaml-core] A modest proposal

yaml-core