|
From: Mike C. <tu...@gm...> - 2006-10-06 06:48:25
|
On 10/5/06, Brian Pratt <bri...@in...> wrote:
>...the unsuitability of XML for eyeballing what is essentially
columnar data, ...
I do think "eyeballability" is important, but I also feel uneasy
placing the key spectrum data beyond the reach of XML in an XML
spectrum format. In essence, in the current version the XML encodes
spectrum metadata--the peaks themselves become an afterthought, hidden
away in a relatively inaccessible appendix.
This would be easier to justify if this were image data, for which
there is no reasonable textual representation. But in this case there
is a trivial representation, and the code to read and write it is
probably simpler than for the base64-encoded case.
There's some discussion here
http://c2.com/cgi/wiki?IsolateEachDatum
that touches on this issue. Also, an example on that page suggests
another possibility for the encoding of peaklists that I prefer to
those discussed so far:
<peaklist>
<peak mz="234.56" i="789" />
<peak mz="3456.43" i="2" />
<peak mz="3457.22" i="234" />
</peaklist>
This would have the virtue of being highly accessible to eyeball and
quick-and-dirty scripts as well. It would also clearly compress well.
And it keeps the peak data within the realm of XML. It would be
conceivable, I think, to use XSLT to create a table of peak data or
even an SVG image of the spectrum, for example, since everything would
be living in XML-land.
> ...A standard that provides n>1 ways
> to state the same thing is n times as difficult to implement and maintain,
> which reduces vendor enthusiasm by a factor of n (squared?), which hinders
> widespread adoption. ...
I generally agree with this, and in particular, I suspect that if the
specification allowed both representations, possibly most vendors
would only produce base64 output. For this reason, if the textual
representation is preferred, maybe the base64 alternative should be
deprecated and marked for removal in a future version.
However, I think that there is still an advantage to having the
textual alternative in the specification, even if instrument vendors
never produce it. It would allow those of us who prefer the textual
format to do convert to it in a standard way, in a way that
coordinates with the mzData standard.
Mike
|