psidev-ms-dev Mailing List for Proteomics Standards Initiative (Page 106)

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

Hi Henk, Randy and others,

I think that normally you will produce two separate mzML files from that 
workflow. The first one will represent all the MS spectra collected in 
the first run, and the second one will contain a mixture of MS scans and 
MS/MS scans from the run that is performed with an inclusion list (pick 
list). The second  file would look similar to the file: 
http://psidev.cvs.sourceforge.net/*checkout*/psidev/psi/psi-ms/mzML/instanceFile/1min.mzML
where some spectra are MS (level one) and some MSMS (level two). The 
picked masses are found under 'precursor' for the MS/MS spectra. In 
addition, probably the complete inclusion list should be given as 
cvParams or userParams in a 'referencableParamGroup' to specify which 
peaks the instrument was programmed to look for.

One could imagine that you construct a third mzML file which is 
assembled from the first two files, but I'm not sure if that is allowed 
within the standard, since only one 'run' can be specified. What would 
be the preferred way to accomplish this? analysisXML or mzML? Has anyone 
created an mzML file from multiple runs?

Regards

Fredrik

Randy Julian wrote:
> Hi Henk,
>
> mzML was designed for the application you described.  Take a look at the
> specification document:
>
> http://www.psidev.info/index.php?q=node/303
>
> In this public comment release, the spectrum element allows multiple
> binary arrays to be stored.  The main ones would be m/z and intensity.
> The thought was there could be others - like picked peaks.  We have
> wrestled with allowing human readable arrays and I think the group
> concluded they would be too confusing.  There are many ways to do human
> readable arrays and that violates the goal of minimizing 'ways to
> represent the same thing' in the standard - a very good goal.
>
> This means that you will either have to encode the peak list in binary,
> or you could use either the cvParam or userParam elements.  I would
> recommend that we adopt a standard nomenclature for picked peaks and
> represent this in cvParams for situations where there are not too many.
>
> The fragmentation spectra can be stored directly and are best
> represented in the binaryDataArray - this is what it was meant for.  If
> you have a large number of picked peaks, this binary array is also the
> best way to store this type of data.
>
> As for 'fragments' of mzML, the spectrum element does have an ID
> attribute.  In theory, this means that each is uniquely identified in
> the file and could be returned as part of a query (I'm thinking XQuery
> style extraction from the document).  While the spectrum element is not
> self contained, it is 'identifiable' so is a candidate for a return
> value from an XQuery or an LSID request - I don't think we have not
> gotten that far - any suggestions?
>
> Read through the specification and let us know if you think it's unclear
> on how the standard could do what you want.  We are at the point where
> external readers are needed.
>
> Thanks,
> Randy Julian
>
> -----Original Message-----
> From: psi...@li...
> [mailto:psi...@li...] On Behalf Of Toorn,
> H.W.P. van den (Henk)
> Sent: Thursday, November 22, 2007 10:23 AM
> To: Mass spectrometry standard development
> Subject: [Psidev-ms-dev] mzML pick file question
>
> Dear developers,
>
> I have some questions concerning the mzML format.
> We have some collaborators who are forced to use MS-peak pick files in
> order to target peaks for MS-MS in a later run. To be more clear, the
> workflow would be: do an MS run, pick the peaks you are interested in,
> rerun the MS, use the list of picked peaks to do further fragmentation. 
>
> My questions are: would it be possible to store such picked peaks in a
> part of the mzML file, together with the original MS-spectra and the
> resulting MS-MS fragmentations? Are there any obvious ways that
> fragments of the mzML files could be used as an intermediary file
> format?
>
> Thanks in advance, Henk van den Toorn
>
> ------------------------------------------------------------------------
> -
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
> _______________________________________________
> Psidev-ms-dev mailing list
> Psi...@li...
> https://lists.sourceforge.net/lists/listinfo/psidev-ms-dev
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Microsoft
> Defy all challenges. Microsoft(R) Visual Studio 2005.
> http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
> _______________________________________________
> Psidev-ms-dev mailing list
> Psi...@li...
> https://lists.sourceforge.net/lists/listinfo/psidev-ms-dev
>   

2002	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (3)	Nov	Dec
2003	Jan	Feb	Mar	Apr (1)	May	Jun	Jul (1)	Aug	Sep	Oct	Nov (3)	Dec
2004	Jan	Feb	Mar	Apr	May (2)	Jun	Jul (1)	Aug (5)	Sep	Oct (5)	Nov (1)	Dec (2)
2005	Jan (2)	Feb (5)	Mar	Apr (1)	May (5)	Jun (2)	Jul (3)	Aug (7)	Sep (18)	Oct (22)	Nov (10)	Dec (15)
2006	Jan (15)	Feb (8)	Mar (16)	Apr (8)	May (2)	Jun (5)	Jul (3)	Aug (1)	Sep (34)	Oct (21)	Nov (14)	Dec (2)
2007	Jan	Feb (17)	Mar (10)	Apr (25)	May (11)	Jun (30)	Jul (1)	Aug (38)	Sep	Oct (119)	Nov (18)	Dec (3)
2008	Jan (34)	Feb (202)	Mar (57)	Apr (76)	May (44)	Jun (33)	Jul (33)	Aug (32)	Sep (41)	Oct (49)	Nov (84)	Dec (216)
2009	Jan (102)	Feb (126)	Mar (112)	Apr (26)	May (91)	Jun (54)	Jul (39)	Aug (29)	Sep (16)	Oct (18)	Nov (12)	Dec (23)
2010	Jan (29)	Feb (7)	Mar (11)	Apr (22)	May (9)	Jun (13)	Jul (7)	Aug (10)	Sep (9)	Oct (20)	Nov (1)	Dec
2011	Jan	Feb (4)	Mar (27)	Apr (15)	May (23)	Jun (13)	Jul (15)	Aug (11)	Sep (23)	Oct (18)	Nov (10)	Dec (7)
2012	Jan (23)	Feb (19)	Mar (7)	Apr (20)	May (16)	Jun (4)	Jul (6)	Aug (6)	Sep (14)	Oct (16)	Nov (31)	Dec (23)
2013	Jan (14)	Feb (19)	Mar (7)	Apr (25)	May (8)	Jun (5)	Jul (5)	Aug (6)	Sep (20)	Oct (19)	Nov (10)	Dec (12)
2014	Jan (6)	Feb (15)	Mar (6)	Apr (4)	May (16)	Jun (6)	Jul (4)	Aug (2)	Sep (3)	Oct (3)	Nov (7)	Dec (3)
2015	Jan (3)	Feb (8)	Mar (14)	Apr (3)	May (17)	Jun (9)	Jul (4)	Aug (2)	Sep	Oct (13)	Nov	Dec (6)
2016	Jan (8)	Feb (1)	Mar (20)	Apr (16)	May (11)	Jun (6)	Jul (5)	Aug	Sep (2)	Oct (5)	Nov (7)	Dec (2)
2017	Jan (10)	Feb (3)	Mar (17)	Apr (7)	May (5)	Jun (11)	Jul (4)	Aug (12)	Sep (9)	Oct (7)	Nov (2)	Dec (4)
2018	Jan (7)	Feb (2)	Mar (5)	Apr (6)	May (7)	Jun (7)	Jul (7)	Aug (1)	Sep (9)	Oct (5)	Nov (3)	Dec (5)
2019	Jan (10)	Feb	Mar (4)	Apr (4)	May (2)	Jun (8)	Jul (2)	Aug (2)	Sep	Oct (2)	Nov (9)	Dec (1)
2020	Jan (3)	Feb (1)	Mar (2)	Apr	May (3)	Jun	Jul (2)	Aug	Sep	Oct (1)	Nov	Dec (1)
2021	Jan	Feb	Mar	Apr (5)	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2022	Jan	Feb	Mar	Apr	May	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec
2023	Jan	Feb	Mar (1)	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec
2024	Jan	Feb (1)	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec (2)
2025	Jan	Feb	Mar	Apr	May	Jun	Jul (1)	Aug	Sep	Oct	Nov	Dec

psidev-ms-dev Mailing List for Proteomics Standards Initiative (Page 106)

psidev-ms-dev — Mass spectroscopy standard development