Thread: [Yaml-core] The case against comments.

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

Hi.

Recently, Oren posted an example that looked like this:

my-machine: 192.168.1.17

Which was then modified to look like this:

my-machine: %=
  =: 192.168.1.17
  #: DON'T CHANGE THIS! several clients
    access this address directly - talk
    with the Network Administrator first.

I think that this is an excellent example of why substitutability is
important and why our APIs should support it.

However, I think that introducing a comment construct into the language
would be a mistake.

Do we really need a special comment key? If we were going to allow people to
mark up comments using indicators like classes, then I can see the argument
for it but still don't think it would be appropriate.

For example:

my-machine: #"DON'T CHANGE THIS!" 192.168.1.17

The problem with this approach is that I believe that people will be
surprised to find out that the simple act of adding a comment to the source
file changes their data from a simple scalar into a map. This is unlike any
other language in existence (that I know of). Most lexers actually skip over
comments so that they never even end up in the parse tree.

It looks like you might not be allowing this syntax, though, and would
rather make the user explicitly turn a scalar with a comment into a map like
in Oren's example above. My question is then, why make them use the # key?
If it's just adding an extra pair that will get ignored by the application,
then why don't we let them use whatever key they wanted to?

For example, I'd much rather see this:

my-machine: %=
  address: 192.168.1.17
  warning: DON'T CHANGE THIS! several clients
    access this address directly - talk
    with the Network Administrator first.

Here the first pair is the default (do we still like this?) and the second
pair is just an extra pair that would get ignored by the application reading
this file because it doesn't know about it and is probably going to use
Clark's asScalar() or Oren's v() function anyways so would always get the
default value as it were and never even see the warning key.

The advantages to this approach are:

1) Users are explicitly adding a key and so can't claim to be surprised when
they encounter a map during parsing.

2) The # indicator is free to be used or not used elsewhere.

3) More than one comment can be added to a node.

4) There will never be any question as to whether or not comments are part
of the information model and can be ignored since there will no longer be
comments (in the traditional sense).

5) Requiring user's to make up an appropriate key name is more verbose but
conveys more information to anybody reading the source from then on (like
how I renamed # to warning above). This is true even if they use the key
"comment"--that's still more clear to me than "#".

Can somebody come up with some disadvantages? I can't think of any.

Jason.

Thread: [Yaml-core] The case against comments.

yaml-core