From: Angel P. <an...@ma...> - 2006-10-05 17:36:47
|
The PSI Fall 2006 working group meeting in Washington, D.C. was a rousing success story for the Mass Spec and Proteomics Informatics working groups. First and foremost, the working group chairs would like to thanks everyone in attendance, as all put forth an unprecedented effort across the numerous activities. For those of you that were not able to attend, here are some highlights: The general philosophy was "get things done" and to that end, the attendees were split up into multiple smaller groups with specific deliverables. These included a thorough review and sign off of the mass spec engine spreadsheet, work on the much talked about merge of the mzData and mzXML formats, the analysisXML UML model, and the beginnings of the ontology for use with analysisXML. Fantastic progress was made on all of these fronts: mzData/mzXML ----------------------------------- The document of the difference between the formats produced and presented by Kent last meeting was used to begin merging of the schema. The data arrays where worked out for the most part, as was annotation and generalization of the instrument, protocol and parameter annotations. The work is slated to be finished by the end of the year, mostly by use of email and a satellite meeting in Seattle. Join the next conference call for further details. Use cases of instrument modes (MRM, LC-MS, LC-MALDI MS/MS, neutral loss scans, etc) were also submitted to be mapped on the model. The ability to computationally validate MS interchange data for MIAPE compliance was also discussed. The working group is considering encoding MIAPE concepts and terms into a CV or ontology, which could be used for future software validation of XML instance documents. AnalysisXML UML model ----------------------------------- Modeling was started using FuGE as a basis. A provisional model was created and turned into XML schema using the AndroMDA tools. Since time was limiting factor, the development effort did not pay careful attention to documentation and diagram formatting, thus the model is undergoing a bit of clean up before release to the rest of the WG. AnaysisXML content and CV ----------------------------------- Based on the search engine spreadsheet generated during this summer, the group did a rigorous review of the content that AnalysisXML should carry and mapped the current search engine outputs with MIAPE requirements and MCP guidelines for reporting mass spec search engine parameters, even adding a few that where missing. A first proposal for CV terms has been generated. The group, was able to agree on a large subset of the parameter names and meanings, which are being added to the PSI ontology. Vendors will be asked for vendor-specific terms. Not covered was quantitative parameters. Currently Jim Shofstal is cleaning up the document prior to sending to the rest of the WG. A discussion was raised about a possibility to homogenize the Accession codes used in the various engines. A main difficulty comes from the interpretation of the fasta header lines by the various tools. A proposal was to study the fasta format generated by Phenyx that is structured in a way it clearly labels different information types such as AC, Description, taxonomy, PTMs, etc. Stay tuned for details regarding this. Once again, thanks everyone for being in attendance and for all of the hard effort allowing such an amazing amount of progress in the short time we had together. Cheers, from the PSI-MS and PSI-PI working group chairs and secretary: Pierre-Alain Binz David Creasy Phil Jones Randy Julian Angel Pizarro |