psidev-ms-dev Mailing List for Proteomics Standards Initiative (Page 111)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

On 8/7/07, Brian Pratt <bri...@in...> wrote:
>
>  Hi Angel,
>
> If I understand your question to be about identifying current mismatches
> between terminology in the schema and the ontology, I'm not sure there are
> any - but probably only because the schema has so little actual terminology
> in it.
>

My question was more of a pragmatic one, about where would you add
specificity into the mzML schema. Your selecitonWindow example below is a
good one, in that the specification of of selectWindow is probably a range
value and we should  have two sub-elements that corresponding to type the
cvParam values to define the window (or just a well defined range
sub-element, skipping cvParam altogether).

I don't think your second example is a good one tho, since there are so many
permutations of an ionSelection protocol and that more are certainly one the
way, t is better handled by an ontology specification. Yes this does make
parsers slightly harder, since now you must pay attention to the incoming
ontology, but it is the same amount of work as if everything was in the
schema.

mzXML could get away with tight specification of these complex and changing
annotations, since its sole purpose was support of the ISB pipeline. Its
open source status only served to increase the user base, but the schema
changes were solely driven by the needs of that pipeline and solely by the
community that used it. Tryin to build consensus across many different
groups has led to the current version of mzML and that major structure of
mzML will not change at this point, so please let's just get to the
specifics of going through the schema and identifying where you think an
annotation should be promoted to the level of a schema element, and we'll
discuss as a group.

-angel

Consider this example:
>
> <xs:element name="selectionWindow" maxOccurs="unbounded">
> <xs:complexType>
> <xs:sequence>
> <xs:element name="cvParam" type="dx:CVParamType" minOccurs="2"
> maxOccurs="unbounded"/>
> </xs:sequence>
> </xs:complexType>
> </xs:element>
>
> which says absolutely nothing at all about what a selectionWindow
> element can be expected to contain when you encounter it.  It just says it
> will contain at least two "parameters".  Not much of an aid to software
> development.
>
> The schema, if we can call it that, doesn't even specify what some of the
> most fundamental information about a scan looks like.  For example, it
> specifies that a scan may have a list of precursors, each of which will
> contain an ionSelection, but stops short of telling you what an
> ionSelection looks like:
>
> <xs:element name="ionSelection" type="dx:ParamGroupType">
> <xs:annotation>
> <xs:documentation>This captures the type of ion selection being performed,
> and trigger m/z (or m/z's), neutral loss criteria etc. for tandem-MS or data
> dependent scans.</xs:documentation>
> </xs:annotation>
> </xs:element>
>  Nearly all the details of nearly all the elements are just unspecified
> blobs.  Normally with an XML format you can expect to at least start your
> work by running it through something like XMLSpy that will autogenerate a
> reader and a writer that you can then polish up (to handle, for example, the
> necessary weirdness of base64+zlib in the peaklists).  But with this, you
> get no kind of a head start at all, since the vast majority of the syntax is
> hidden behind blobs like dx:CVParamType and dx:ParamGroupType.  It's just
> not a specification.
>
> The statement that led to your question, I think, was just me saying that
> if we *did* create an actual schema, we'd want its terminology to agree with
> the ontology where ever possible.  But it has to actually contain
> some terminology, unlike the current schema.
>
> Brian
>
>
>  ------------------------------
> *From:* del...@gm... [mailto:del...@gm...] *On Behalf Of *Angel
> Pizarro
> *Sent:* Tuesday, August 07, 2007 1:10 PM
> *To:* Brian Pratt
> *Cc:* psi...@li...
> *Subject:* Re: [Psidev-ms-dev] cvParams using name attribute as value
>
>
>
> On 8/7/07, Brian Pratt <bri...@in...> wrote:
> >
> >
> > Hey, the horse just twitched:  by placing CVparam information in
> > attributes of the elements of a conventionally structured XML schema (ala
> > mzXML) we can make use of the OBO work without adding a lot of unwanted
> > complexity to software systems that aren't really interested in it.  An
> > mzML that integrates well with OBO-aware systems is an excellent idea, but
> > an mzML that demands you BE an OBO-aware system seems less likely to achieve
> > widespread adoption.
> >
>
> Can you name specific attributes that you want to have cv terms be the
> value for that are currently not in the schema?
> -angel
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >>  http://get.splunk.com/
> _______________________________________________
> Psidev-ms-dev mailing list
> Psi...@li...
> https://lists.sourceforge.net/lists/listinfo/psidev-ms-dev
>
>

-- 
Angel Pizarro
Director, Bioinformatics Facility
Institute for Translational Medicine and Therapeutics
University of Pennsylvania
806 BRB II/III
421 Curie Blvd.
Philadelphia, PA 19104-6160

P: 215-573-3736
F: 215-573-9004

2002	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (3)	Nov	Dec
2003	Jan	Feb	Mar	Apr (1)	May	Jun	Jul (1)	Aug	Sep	Oct	Nov (3)	Dec
2004	Jan	Feb	Mar	Apr	May (2)	Jun	Jul (1)	Aug (5)	Sep	Oct (5)	Nov (1)	Dec (2)
2005	Jan (2)	Feb (5)	Mar	Apr (1)	May (5)	Jun (2)	Jul (3)	Aug (7)	Sep (18)	Oct (22)	Nov (10)	Dec (15)
2006	Jan (15)	Feb (8)	Mar (16)	Apr (8)	May (2)	Jun (5)	Jul (3)	Aug (1)	Sep (34)	Oct (21)	Nov (14)	Dec (2)
2007	Jan	Feb (17)	Mar (10)	Apr (25)	May (11)	Jun (30)	Jul (1)	Aug (38)	Sep	Oct (119)	Nov (18)	Dec (3)
2008	Jan (34)	Feb (202)	Mar (57)	Apr (76)	May (44)	Jun (33)	Jul (33)	Aug (32)	Sep (41)	Oct (49)	Nov (84)	Dec (216)
2009	Jan (102)	Feb (126)	Mar (112)	Apr (26)	May (91)	Jun (54)	Jul (39)	Aug (29)	Sep (16)	Oct (18)	Nov (12)	Dec (23)
2010	Jan (29)	Feb (7)	Mar (11)	Apr (22)	May (9)	Jun (13)	Jul (7)	Aug (10)	Sep (9)	Oct (20)	Nov (1)	Dec
2011	Jan	Feb (4)	Mar (27)	Apr (15)	May (23)	Jun (13)	Jul (15)	Aug (11)	Sep (23)	Oct (18)	Nov (10)	Dec (7)
2012	Jan (23)	Feb (19)	Mar (7)	Apr (20)	May (16)	Jun (4)	Jul (6)	Aug (6)	Sep (14)	Oct (16)	Nov (31)	Dec (23)
2013	Jan (14)	Feb (19)	Mar (7)	Apr (25)	May (8)	Jun (5)	Jul (5)	Aug (6)	Sep (20)	Oct (19)	Nov (10)	Dec (12)
2014	Jan (6)	Feb (15)	Mar (6)	Apr (4)	May (16)	Jun (6)	Jul (4)	Aug (2)	Sep (3)	Oct (3)	Nov (7)	Dec (3)
2015	Jan (3)	Feb (8)	Mar (14)	Apr (3)	May (17)	Jun (9)	Jul (4)	Aug (2)	Sep	Oct (13)	Nov	Dec (6)
2016	Jan (8)	Feb (1)	Mar (20)	Apr (16)	May (11)	Jun (6)	Jul (5)	Aug	Sep (2)	Oct (5)	Nov (7)	Dec (2)
2017	Jan (10)	Feb (3)	Mar (17)	Apr (7)	May (5)	Jun (11)	Jul (4)	Aug (12)	Sep (9)	Oct (7)	Nov (2)	Dec (4)
2018	Jan (7)	Feb (2)	Mar (5)	Apr (6)	May (7)	Jun (7)	Jul (7)	Aug (1)	Sep (9)	Oct (5)	Nov (3)	Dec (5)
2019	Jan (10)	Feb	Mar (4)	Apr (4)	May (2)	Jun (8)	Jul (2)	Aug (2)	Sep	Oct (2)	Nov (9)	Dec (1)
2020	Jan (3)	Feb (1)	Mar (2)	Apr	May (3)	Jun	Jul (2)	Aug	Sep	Oct (1)	Nov	Dec (1)
2021	Jan	Feb	Mar	Apr (5)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2022	Jan	Feb	Mar	Apr	May	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec
2023	Jan	Feb	Mar (1)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2024	Jan	Feb (1)	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec (2)
2025	Jan	Feb	Mar	Apr	May	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec

psidev-ms-dev Mailing List for Proteomics Standards Initiative (Page 111)

psidev-ms-dev — Mass spectroscopy standard development