You can subscribe to this list here.
2016 |
Jan
|
Feb
|
Mar
|
Apr
|
May
(11) |
Jun
(14) |
Jul
|
Aug
(11) |
Sep
|
Oct
(4) |
Nov
(15) |
Dec
(1) |
---|---|---|---|---|---|---|---|---|---|---|---|---|
2017 |
Jan
(1) |
Feb
(9) |
Mar
(2) |
Apr
(2) |
May
(6) |
Jun
(7) |
Jul
(10) |
Aug
(4) |
Sep
(15) |
Oct
(9) |
Nov
(5) |
Dec
(7) |
2018 |
Jan
(5) |
Feb
(7) |
Mar
(3) |
Apr
(15) |
May
(5) |
Jun
(4) |
Jul
(4) |
Aug
(2) |
Sep
(4) |
Oct
(8) |
Nov
(6) |
Dec
(7) |
2019 |
Jan
(7) |
Feb
(2) |
Mar
(4) |
Apr
(7) |
May
(5) |
Jun
(6) |
Jul
(11) |
Aug
(5) |
Sep
(4) |
Oct
(7) |
Nov
(3) |
Dec
(4) |
2020 |
Jan
(5) |
Feb
(5) |
Mar
(8) |
Apr
(3) |
May
(5) |
Jun
(3) |
Jul
(9) |
Aug
(2) |
Sep
(2) |
Oct
(3) |
Nov
(5) |
Dec
(4) |
2021 |
Jan
(3) |
Feb
(4) |
Mar
(2) |
Apr
(4) |
May
(5) |
Jun
(9) |
Jul
(4) |
Aug
(8) |
Sep
(5) |
Oct
(3) |
Nov
(2) |
Dec
(2) |
2022 |
Jan
(2) |
Feb
(2) |
Mar
(3) |
Apr
(3) |
May
(4) |
Jun
(5) |
Jul
(5) |
Aug
(4) |
Sep
(6) |
Oct
(2) |
Nov
(5) |
Dec
|
2023 |
Jan
(7) |
Feb
(5) |
Mar
(5) |
Apr
(2) |
May
(11) |
Jun
(3) |
Jul
(1) |
Aug
(2) |
Sep
(1) |
Oct
(1) |
Nov
(2) |
Dec
(2) |
2024 |
Jan
(2) |
Feb
(3) |
Mar
|
Apr
(2) |
May
(2) |
Jun
(5) |
Jul
(6) |
Aug
(2) |
Sep
(1) |
Oct
(7) |
Nov
(3) |
Dec
(1) |
2025 |
Jan
(3) |
Feb
(2) |
Mar
(1) |
Apr
(2) |
May
(2) |
Jun
(1) |
Jul
(4) |
Aug
(2) |
Sep
(2) |
Oct
(1) |
Nov
|
Dec
|
From: David T. <dt...@su...> - 2018-04-12 05:04:04
|
Well done, Wout! I like that you were able to show instrument readings this way; I remember your working on capturing these readings in the iMonDB database, so it makes sense that we'd want to represent them in qcML. As we move to the outlier.qcml, I see that you're using RawFile to point to a source qcML document (I had earlier used RawFile to point to an mzML rather than a RAW). We may want to work out just what RawFile is supposed to represent or create other terms to represent later products in the pipeline. As Wout notes, though, the RawFile section allows us to specify the file type. The "Meta-analysis settings" set thresholds for variability; as I understand it, though, these thresholds would be applied on individual metrics rather than a dimensionality-reduced set (e.g. via PCA). When we dip down to the comment below below the metaDataParameters section, though, it appears that Wout is giving weights to show how to combine scores, perhaps in a linear combination of metrics to optimize the amount of variability explained? A PCA would really only be feasible for a defined set of metrics from a bunch of input files. This same transform, however, might then be applied to new qcML files that were not part of the set used to conduct PCA. The transform itself does not tell us that a qcML is an outlier or not. Instead, it accepts the quality metrics as an input and outputs the coordinates of this qcML in the transformed space; we can then compute distances in PC space between pairs of qcML files, and an abnormally large distance in PC space implies an outlier. Wout, thank you for trying the multi-file analysis for us! Yes, I think this highlights some areas where we have been vague to date. Merci, Dave On 4/11/2018 10:48 PM, Bittremieux Wout wrote: > Dear colleagues, > > I have also prepared two handcrafted example files: > - one from the iMonDB containing instrument parameters as opposed to > ID-free/ID-based spectral metrics > - one from a meta-analysis to detect low-quality experiments > > The files include some annotations about things to discuss. As you can > see especially for the meta-analysis I'm currently not sure how to > correctly store this information in a qcML file. > > Best, > Wout > > > > On 10 Apr 2018, at 10:15, Bittremieux Wout > <wou...@ua...> wrote: > > > > Hi Dave, > > > > Sorry for the delayed answer to your questions. As far as I'm aware > (please someone correct me if I'm wrong): > > > > - OBO and OWL are two alternative file formats to specify controlled > vocabularies and ontologies. We use the OBO format for our CV. This is > indeed a simpler format than OWL and can be viewed relatively easily > in a simple text editor. Alternatively, Martin has previously > recommended OBO-Edit to visualize the relationships between the > various terms. > > > > - In our previous discussions we have indeed said that in principle > every tool gets its own range of CV accessions. This would enable a > new tool to easily start producing compliant qcML files without having > to check dependencies on other tools. It might make sense to reuse > some trivial definitions though for metrics that don't involve any > computations, such the number of MS/MS scans. On the other hand, how > will downstream tools handle conflicting metrics coming from different > tools? Although in that case maybe the tool authors should be the main > persons responsible to worry about this rather than us. > > In any case, it would be useful to explicitly document how tools can > get CV accessions. > > > > - To report the unit for a single QC metric you can use the > unitAccession/unitName/unitCvRef attributes for any XML element that > is derived from the CVParamType, which qualityParameter elements are. > The CVParamType is specified in our XML schema. > > > > - This XML schema can also be used for simple syntactic validation > and some semantic validation. Because we have a mix between XML and > JSON unfortunately some external semantic validation will have to be > explicitly coded as well though. > > Any decent XML editor should have built-in functionality to validate > XML files against a schema, and otherwise there are various > command-line tools and linters you can use to do that as well. > However, the previously linked v0.0.10 XML schema on GitHub does not > seem to be fully up to date anymore at the moment, and I think the > most recent XML schema is available in Mathias' ongoing pull request. > > > > - As to metrics numbering I don't think we have a process for that > yet. I guess now it's just first-come-first-served. > > > > Best, > > Wout > > > >> On 09 Apr 2018, at 23:56, David Tabb <dt...@su...> wrote: > >> > >> Hi, all. > >> > >> I have not yet received an answer to the three questions below. I > have, > >> however, uploaded it to qcdev, where you can find it at this URL: > >> > https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml. > >> > >> Merci! > >> Dave > >> > >> On 4/5/2018 11:29 AM, David Tabb wrote: > >>> Hi, all. > >>> > >>> At long last, I have completed my "homework" for Heidelberg! I have > >>> created a draft XML to represent the qcML output for computing > QuaMeter > >>> IDFree metrics for a single input mzML file (see text inline > below). I > >>> would greatly appreciate answers to the following questions to > complete > >>> this project: > >>> > >>> 1) If I am reporting only a single value for a metric (such as > >>> "XIC-WideFrac"), how do I report the unit for the metric? > >>> > >>> 2) Do we have an easy way to validate a draft file like this, at least > >>> to determine that I have matching end tags for each one I open? I've > >>> been using Emacs, which tries to help, but a dedicated XML editor > might > >>> be preferable. > >>> > >>> 3) Who officially designates what each metric will be numbered in > the CV? > >>> > >>> Thanks, > >>> Dave > >>> -------------------------------------------------------- > >>> <?xml version="1.0" encoding="UTF-8"?> > >>> <qcML xmlns="http://www.prime-xs.eu/ms/qcml" > >>> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > >>> xsi:schemaLocation="http://www.prime-xs.eu/ms/qcml > >>> file:/home/walzer/psi/qcML-development/schema/v0_0_10/qcML_0_0_10.xsd" > >>> version="0.0.10"> > >>> <runQuality ID="ID001"> > >>> <metaDataParameters ID="fileprovenance001" cvRef="?" > >>> accession="?" name="?" description="do we need cv for toplevel" > >>> value="all cv attributes are optional anyway"> > >>> <InputFiles> > >>> <RawFile > >>> location="C:\Research\20171124-Lizex-Chia\1091_Pool_start.mzML" > >>> id="ID001" name="1091_Pool_start.mzML"> > >>> <FileFormat> > >>> <cvParam cvRef="PSI-MS" accession="MS:1000584" > >>> name="mzML format"/> > >>> </FileFormat> > >>> <!-- In the following line, I computed the md5sum for the > >>> peak-picked mzML, not the RAW! --> > >>> <cvParam cvRef="PSI-MS" accession="MS:1000568" name="MD5" > >>> value="b583f6d2a91b4749d5a75885330f6e5d" /> > >>> <cvParam cvRef="PSI-MS" accession="MS:1000747" > >>> name="completion time" value="2017-12-08-T15:38:57Z" /> > >>> </RawFile> > >>> </InputFiles> > >>> </metaDataParameters> > >>> <!-- Question to consider: how should I link a metric below to the > >>> concept of "liquid chromatography" or "electrospray ionization?" Is > >>> this appropriate--> > >>> <!-- Units I employ below: --> > >>> <!-- UO:0000191 "fraction" --> > >>> <!-- UO:0000010 "second" --> > >>> <!-- UO:0010006 "ratio" --> > >>> <!-- UO:0000189 "count" --> > >>> <!-- UO:0000106 "hertz" --> > >>> <!-- When a line gives a single metric, where do I indicate the > >>> unit type? UO:0000191 "fraction" --> > >>> <qualityParameter ID="XIC-WideFrac" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Fraction of > precursor ions > >>> accounting for the top half of all peak width" value="0.206807"/> > >>> <qualityParameter ID="XIC-FWHM" cvRef="PSI-QC-CV" accession="QC:" > >>> name="QuaMeter IDFree Metric- Distribution of peak widths for the wide > >>> XICs"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="3">{'UO:0000010':[12.5377,14.2244,16.9234]}</content></qualityParameter> > >>> > >>> <!-- UO:0010006 is "ratio" rather than "log ratio"--> > >>> <qualityParameter ID="XIC-Height" cvRef="PSI-QC-CV" accession="QC:" > >>> name="QuaMeter IDFree Metric- Distribution of peak log ratio > heights for > >>> the wide XICs"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="3">{'UO:0010006':[0.776393,0.93114,6.6283]}</content></qualityParameter> > >>> > >>> <!-- In the following, where do I indicate the unit type? > >>> UO:0000010 "second" --> > >>> <qualityParameter ID="RT-Duration" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Highest scan time > observed > >>> minus the lowest scan time observed" value="4920.17"/> > >>> <qualityParameter ID="RT-TIC" cvRef="PSI-QC-CV" accession="QC:" > >>> name="QuaMeter IDFree Metric- Distribution of TIC accumulation as > >>> fraction of RT-Duration"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="4">{'UO:0000191':[0.301236,0.13286,0.174576,0.391328]}</content></qualityParameter> > >>> > >>> <qualityParameter ID="RT-MS" cvRef="PSI-QC-CV" accession="QC:" > >>> name="QuaMeter IDFree Metric- Distribution of MS1 acquisition as > >>> fraction of RT-Duration"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="4">{'UO:0000191':[0.217794,0.272976,0.275845,0.233385]}</content></qualityParameter> > >>> > >>> <qualityParameter ID="RT-MSMS" cvRef="PSI-QC-CV" accession="QC:" > >>> name="QuaMeter IDFree Metric- Distribution of MS2 acquisition as > >>> fraction of RT-Duration"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="4">{'UO:0000191':[0.268157,0.233516,0.236373,0.261954]}</content></qualityParameter> > >>> > >>> <qualityParameter ID="MS1-TIC-Change" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Distribution of log > ratios > >>> of MS1 scan-to-scan TIC changes"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="3">{'UO:0010006':[0.870774,0.900585,4.66521]}</content></qualityParameter> > >>> > >>> <qualityParameter ID="MS1-TIC" cvRef="PSI-QC-CV" accession="QC:" > >>> name="QuaMeter IDFree Metric- Log ratios of MS1 scan TICs"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="3">{'UO:0010006':[0.568866,0.815636,1.18124]}</content></qualityParameter> > >>> > >>> <!-- In the following, where do I indicate the unit type? > >>> UO:0000189 "count" --> > >>> <qualityParameter ID="MS1-Count" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Number of MS1 scans > >>> acquired" value="7832"/> > >>> <!-- In the following, where do I indicate the unit type? > >>> UO:0000106 "hertz" --> > >>> <qualityParameter ID="MS1-Freq-Max" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency > for MS1 > >>> scan acquisition" value="2.41814"/> > >>> <qualityParameter ID="MS1-Density" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak > >>> counts for MS1 scans"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> value="3">{'UO:0000189:[693,1205,1424]}</content></qualityParameter> > >>> <!-- In the following, where do I indicate the unit type? > >>> UO:0000189 "count" --> > >>> <qualityParameter ID="MS2-Count" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Number of MS2 scans > >>> acquired" value="33495"/> > >>> <!-- In the following, where do I indicate the unit type? > >>> UO:0000106 "hertz" --> > >>> <qualityParameter ID="MS2-Freq-Max" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency > for MS2 > >>> scan acquisition" value="7.33107"/> > >>> <qualityParameter ID="MS2-Density" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak > >>> counts for MS2 scans"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> value="3">{'UO:0000189:[27,44,70]}</content></qualityParameter> > >>> <!-- How do we specify that the values of a vector sum to 1? How > >>> about this case, where two vectors together sume to 1?--> > >>> <qualityParameter ID="MS2-PrecZ-Known" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Fraction of known > >>> precursor charges for +1, +2, ..., n, more than n"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" > >>> > value="4">{'UO:0000191':[0,0.15047,0.0687565,0.00877743,0.000507538,0.000268697]}</content></qualityParameter> > >>> > >>> <!-- For the final metric, I used a reporting type where the number > >>> of elements in vector is fixed: QC:3000007--> > >>> <qualityParameter ID="MS2-PrecZ-Unknown" cvRef="PSI-QC-CV" > >>> accession="QC:" name="QuaMeter IDFree Metric- Fraction of unknown > >>> precursor charges for +1, more than +1"> > >>> <content cvRef="PSI-QC-CV" accession="QC:3000007" > >>> > value="2">{'UO:0000191':[0.54235,0.22887]}</content></qualityParameter> > >>> </runQuality> > >>> <cvList> > >>> <cv fullName="The HUPO-PSI QC WG metrics ontology in obo > >>> format" uri="http://www.github.com/HUPO-PSI/.../" ID="PSI-QC-CV"/> > >>> </cvList> > >>> </qcML> > >>> -------------------------------------------------------- > >>> > >>> On 4/5/2018 9:47 AM, David Tabb wrote: > >>>> Hi, all. > >>>> > >>>> In creating my hand-crafted example of qcML from QuaMeter IDFree, > I've > >>>> installed Protege, an ontology viewer. It's useful for perusing the > >>>> HUPO-PSI MS ontology > (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). > >>>> > >>>> I'm a bit confused, though, in how to review the qcML ontology. At > >>>> present, I can find an OBO to download > >>>> > (https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), > >>>> > >>>> > >>>> but I do not see an OWL equivalent. Helpfully, I can simply > review the > >>>> OBO in a text editor. > >>>> > >>>> In particular, I am trying to determine which QC accessions each > of my > >>>> IDFree metrics will represent. Does each metric generator get a > >>>> particular series of numbers that relate to that software's outputs? > >>>> Can I reuse a metric accession from other software if my tool > generates > >>>> the same values (such as the number of MS/MS scans)? > >>>> > >>>> Jinmeng Jia, will you be able to share your paragraph that meets the > >>>> draft MIAPE QC standard with Weimin Zhu for him to present at > >>>> Heidelberg? I was sorry to hear you won't be able to attend > yourself. > >>>> > >>>> Thanks, > >>>> Dave > >>>> > >>>> > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx > <http://cdn.sun.ac.za/100/ProductionFooter.jpg]%3Chttp://www.sun.ac.za/english/Pages/Water-crisis.aspx>> > >>>> > >>>> > >>>> > >>>> The integrity and confidentiality of this email is governed by these > >>>> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > >>>> Die integriteit en vertroulikheid van hierdie e-pos word deur die > >>>> volgende bepalings gereël. > >>>> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > >>>> > >>>> > ------------------------------------------------------------------------------ > >>>> > >>>> > >>>> Check out the vibrant tech community on one of the world's most > >>>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot > >>>> _______________________________________________ > >>>> Psidev-qc-dev mailing list > >>>> Psi...@li... > >>>> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > >>> > >>> > >>> > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx > <http://cdn.sun.ac.za/100/ProductionFooter.jpg]%3Chttp://www.sun.ac.za/english/Pages/Water-crisis.aspx>> > >>> > >>> > >>> The integrity and confidentiality of this email is governed by these > >>> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > >>> Die integriteit en vertroulikheid van hierdie e-pos word deur die > >>> volgende bepalings gereël. > >>> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > >>> > >>> > ------------------------------------------------------------------------------ > >>> > >>> Check out the vibrant tech community on one of the world's most > >>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot > >>> _______________________________________________ > >>> Psidev-qc-dev mailing list > >>> Psi...@li... > >>> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > >> > >> > >> > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx > <http://cdn.sun.ac.za/100/ProductionFooter.jpg]%3Chttp://www.sun.ac.za/english/Pages/Water-crisis.aspx>> > >> > >> The integrity and confidentiality of this email is governed by > these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > >> Die integriteit en vertroulikheid van hierdie e-pos word deur die > volgende bepalings gereël. > Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > >> > >> > ------------------------------------------------------------------------------ > >> Check out the vibrant tech community on one of the world's most > >> engaging tech sites, Slashdot.org! http://sdm.link/slashdot > >> _______________________________________________ > >> Psidev-qc-dev mailing list > >> Psi...@li... > >> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > > > > > ------------------------------------------------------------------------------ > > Check out the vibrant tech community on one of the world's most > > engaging tech sites, Slashdot.org! > http://sdm.link/slashdot_______________________________________________ > > Psidev-qc-dev mailing list > > Psi...@li... > > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > > > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > > > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev |
From: Bittremieux W. <wou...@ua...> - 2018-04-11 20:48:21
|
Dear colleagues, I have also prepared two handcrafted example files: - one from the iMonDB containing instrument parameters as opposed to ID-free/ID-based spectral metrics - one from a meta-analysis to detect low-quality experiments The files include some annotations about things to discuss. As you can see especially for the meta-analysis I'm currently not sure how to correctly store this information in a qcML file. Best, Wout > On 10 Apr 2018, at 10:15, Bittremieux Wout <wou...@ua...> wrote: > > Hi Dave, > > Sorry for the delayed answer to your questions. As far as I'm aware (please someone correct me if I'm wrong): > > - OBO and OWL are two alternative file formats to specify controlled vocabularies and ontologies. We use the OBO format for our CV. This is indeed a simpler format than OWL and can be viewed relatively easily in a simple text editor. Alternatively, Martin has previously recommended OBO-Edit to visualize the relationships between the various terms. > > - In our previous discussions we have indeed said that in principle every tool gets its own range of CV accessions. This would enable a new tool to easily start producing compliant qcML files without having to check dependencies on other tools. It might make sense to reuse some trivial definitions though for metrics that don't involve any computations, such the number of MS/MS scans. On the other hand, how will downstream tools handle conflicting metrics coming from different tools? Although in that case maybe the tool authors should be the main persons responsible to worry about this rather than us. > In any case, it would be useful to explicitly document how tools can get CV accessions. > > - To report the unit for a single QC metric you can use the unitAccession/unitName/unitCvRef attributes for any XML element that is derived from the CVParamType, which qualityParameter elements are. The CVParamType is specified in our XML schema. > > - This XML schema can also be used for simple syntactic validation and some semantic validation. Because we have a mix between XML and JSON unfortunately some external semantic validation will have to be explicitly coded as well though. > Any decent XML editor should have built-in functionality to validate XML files against a schema, and otherwise there are various command-line tools and linters you can use to do that as well. However, the previously linked v0.0.10 XML schema on GitHub does not seem to be fully up to date anymore at the moment, and I think the most recent XML schema is available in Mathias' ongoing pull request. > > - As to metrics numbering I don't think we have a process for that yet. I guess now it's just first-come-first-served. > > Best, > Wout > >> On 09 Apr 2018, at 23:56, David Tabb <dt...@su...> wrote: >> >> Hi, all. >> >> I have not yet received an answer to the three questions below. I have, >> however, uploaded it to qcdev, where you can find it at this URL: >> https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml. >> >> Merci! >> Dave >> >> On 4/5/2018 11:29 AM, David Tabb wrote: >>> Hi, all. >>> >>> At long last, I have completed my "homework" for Heidelberg! I have >>> created a draft XML to represent the qcML output for computing QuaMeter >>> IDFree metrics for a single input mzML file (see text inline below). I >>> would greatly appreciate answers to the following questions to complete >>> this project: >>> >>> 1) If I am reporting only a single value for a metric (such as >>> "XIC-WideFrac"), how do I report the unit for the metric? >>> >>> 2) Do we have an easy way to validate a draft file like this, at least >>> to determine that I have matching end tags for each one I open? I've >>> been using Emacs, which tries to help, but a dedicated XML editor might >>> be preferable. >>> >>> 3) Who officially designates what each metric will be numbered in the CV? >>> >>> Thanks, >>> Dave >>> -------------------------------------------------------- >>> <?xml version="1.0" encoding="UTF-8"?> >>> <qcML xmlns="http://www.prime-xs.eu/ms/qcml" >>> xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" >>> xsi:schemaLocation="http://www.prime-xs.eu/ms/qcml >>> file:/home/walzer/psi/qcML-development/schema/v0_0_10/qcML_0_0_10.xsd" >>> version="0.0.10"> >>> <runQuality ID="ID001"> >>> <metaDataParameters ID="fileprovenance001" cvRef="?" >>> accession="?" name="?" description="do we need cv for toplevel" >>> value="all cv attributes are optional anyway"> >>> <InputFiles> >>> <RawFile >>> location="C:\Research\20171124-Lizex-Chia\1091_Pool_start.mzML" >>> id="ID001" name="1091_Pool_start.mzML"> >>> <FileFormat> >>> <cvParam cvRef="PSI-MS" accession="MS:1000584" >>> name="mzML format"/> >>> </FileFormat> >>> <!-- In the following line, I computed the md5sum for the >>> peak-picked mzML, not the RAW! --> >>> <cvParam cvRef="PSI-MS" accession="MS:1000568" name="MD5" >>> value="b583f6d2a91b4749d5a75885330f6e5d" /> >>> <cvParam cvRef="PSI-MS" accession="MS:1000747" >>> name="completion time" value="2017-12-08-T15:38:57Z" /> >>> </RawFile> >>> </InputFiles> >>> </metaDataParameters> >>> <!-- Question to consider: how should I link a metric below to the >>> concept of "liquid chromatography" or "electrospray ionization?" Is >>> this appropriate--> >>> <!-- Units I employ below: --> >>> <!-- UO:0000191 "fraction" --> >>> <!-- UO:0000010 "second" --> >>> <!-- UO:0010006 "ratio" --> >>> <!-- UO:0000189 "count" --> >>> <!-- UO:0000106 "hertz" --> >>> <!-- When a line gives a single metric, where do I indicate the >>> unit type? UO:0000191 "fraction" --> >>> <qualityParameter ID="XIC-WideFrac" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Fraction of precursor ions >>> accounting for the top half of all peak width" value="0.206807"/> >>> <qualityParameter ID="XIC-FWHM" cvRef="PSI-QC-CV" accession="QC:" >>> name="QuaMeter IDFree Metric- Distribution of peak widths for the wide >>> XICs"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="3">{'UO:0000010':[12.5377,14.2244,16.9234]}</content></qualityParameter> >>> >>> <!-- UO:0010006 is "ratio" rather than "log ratio"--> >>> <qualityParameter ID="XIC-Height" cvRef="PSI-QC-CV" accession="QC:" >>> name="QuaMeter IDFree Metric- Distribution of peak log ratio heights for >>> the wide XICs"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="3">{'UO:0010006':[0.776393,0.93114,6.6283]}</content></qualityParameter> >>> >>> <!-- In the following, where do I indicate the unit type? >>> UO:0000010 "second" --> >>> <qualityParameter ID="RT-Duration" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Highest scan time observed >>> minus the lowest scan time observed" value="4920.17"/> >>> <qualityParameter ID="RT-TIC" cvRef="PSI-QC-CV" accession="QC:" >>> name="QuaMeter IDFree Metric- Distribution of TIC accumulation as >>> fraction of RT-Duration"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="4">{'UO:0000191':[0.301236,0.13286,0.174576,0.391328]}</content></qualityParameter> >>> >>> <qualityParameter ID="RT-MS" cvRef="PSI-QC-CV" accession="QC:" >>> name="QuaMeter IDFree Metric- Distribution of MS1 acquisition as >>> fraction of RT-Duration"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="4">{'UO:0000191':[0.217794,0.272976,0.275845,0.233385]}</content></qualityParameter> >>> >>> <qualityParameter ID="RT-MSMS" cvRef="PSI-QC-CV" accession="QC:" >>> name="QuaMeter IDFree Metric- Distribution of MS2 acquisition as >>> fraction of RT-Duration"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="4">{'UO:0000191':[0.268157,0.233516,0.236373,0.261954]}</content></qualityParameter> >>> >>> <qualityParameter ID="MS1-TIC-Change" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Distribution of log ratios >>> of MS1 scan-to-scan TIC changes"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="3">{'UO:0010006':[0.870774,0.900585,4.66521]}</content></qualityParameter> >>> >>> <qualityParameter ID="MS1-TIC" cvRef="PSI-QC-CV" accession="QC:" >>> name="QuaMeter IDFree Metric- Log ratios of MS1 scan TICs"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="3">{'UO:0010006':[0.568866,0.815636,1.18124]}</content></qualityParameter> >>> >>> <!-- In the following, where do I indicate the unit type? >>> UO:0000189 "count" --> >>> <qualityParameter ID="MS1-Count" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Number of MS1 scans >>> acquired" value="7832"/> >>> <!-- In the following, where do I indicate the unit type? >>> UO:0000106 "hertz" --> >>> <qualityParameter ID="MS1-Freq-Max" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS1 >>> scan acquisition" value="2.41814"/> >>> <qualityParameter ID="MS1-Density" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak >>> counts for MS1 scans"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="3">{'UO:0000189:[693,1205,1424]}</content></qualityParameter> >>> <!-- In the following, where do I indicate the unit type? >>> UO:0000189 "count" --> >>> <qualityParameter ID="MS2-Count" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Number of MS2 scans >>> acquired" value="33495"/> >>> <!-- In the following, where do I indicate the unit type? >>> UO:0000106 "hertz" --> >>> <qualityParameter ID="MS2-Freq-Max" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS2 >>> scan acquisition" value="7.33107"/> >>> <qualityParameter ID="MS2-Density" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak >>> counts for MS2 scans"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="3">{'UO:0000189:[27,44,70]}</content></qualityParameter> >>> <!-- How do we specify that the values of a vector sum to 1? How >>> about this case, where two vectors together sume to 1?--> >>> <qualityParameter ID="MS2-PrecZ-Known" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Fraction of known >>> precursor charges for +1, +2, ..., n, more than n"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000008" >>> value="4">{'UO:0000191':[0,0.15047,0.0687565,0.00877743,0.000507538,0.000268697]}</content></qualityParameter> >>> >>> <!-- For the final metric, I used a reporting type where the number >>> of elements in vector is fixed: QC:3000007--> >>> <qualityParameter ID="MS2-PrecZ-Unknown" cvRef="PSI-QC-CV" >>> accession="QC:" name="QuaMeter IDFree Metric- Fraction of unknown >>> precursor charges for +1, more than +1"> >>> <content cvRef="PSI-QC-CV" accession="QC:3000007" >>> value="2">{'UO:0000191':[0.54235,0.22887]}</content></qualityParameter> >>> </runQuality> >>> <cvList> >>> <cv fullName="The HUPO-PSI QC WG metrics ontology in obo >>> format" uri="http://www.github.com/HUPO-PSI/.../" ID="PSI-QC-CV"/> >>> </cvList> >>> </qcML> >>> -------------------------------------------------------- >>> >>> On 4/5/2018 9:47 AM, David Tabb wrote: >>>> Hi, all. >>>> >>>> In creating my hand-crafted example of qcML from QuaMeter IDFree, I've >>>> installed Protege, an ontology viewer. It's useful for perusing the >>>> HUPO-PSI MS ontology (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). >>>> >>>> I'm a bit confused, though, in how to review the qcML ontology. At >>>> present, I can find an OBO to download >>>> (https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), >>>> >>>> >>>> but I do not see an OWL equivalent. Helpfully, I can simply review the >>>> OBO in a text editor. >>>> >>>> In particular, I am trying to determine which QC accessions each of my >>>> IDFree metrics will represent. Does each metric generator get a >>>> particular series of numbers that relate to that software's outputs? >>>> Can I reuse a metric accession from other software if my tool generates >>>> the same values (such as the number of MS/MS scans)? >>>> >>>> Jinmeng Jia, will you be able to share your paragraph that meets the >>>> draft MIAPE QC standard with Weimin Zhu for him to present at >>>> Heidelberg? I was sorry to hear you won't be able to attend yourself. >>>> >>>> Thanks, >>>> Dave >>>> >>>> [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> >>>> >>>> >>>> >>>> The integrity and confidentiality of this email is governed by these >>>> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >>>> Die integriteit en vertroulikheid van hierdie e-pos word deur die >>>> volgende bepalings gereël. >>>> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> >>>> >>>> ------------------------------------------------------------------------------ >>>> >>>> >>>> Check out the vibrant tech community on one of the world's most >>>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot >>>> _______________________________________________ >>>> Psidev-qc-dev mailing list >>>> Psi...@li... >>>> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev >>> >>> >>> [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> >>> >>> >>> The integrity and confidentiality of this email is governed by these >>> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >>> Die integriteit en vertroulikheid van hierdie e-pos word deur die >>> volgende bepalings gereël. >>> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> >>> >>> ------------------------------------------------------------------------------ >>> >>> Check out the vibrant tech community on one of the world's most >>> engaging tech sites, Slashdot.org! http://sdm.link/slashdot >>> _______________________________________________ >>> Psidev-qc-dev mailing list >>> Psi...@li... >>> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev >> >> >> [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> >> >> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> >> >> ------------------------------------------------------------------------------ >> Check out the vibrant tech community on one of the world's most >> engaging tech sites, Slashdot.org! http://sdm.link/slashdot >> _______________________________________________ >> Psidev-qc-dev mailing list >> Psi...@li... >> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot_______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev |
From: Hugo L. F. <hlf...@uv...> - 2018-04-11 13:56:18
|
Fine, I will try to send it by April 17th. Thank you for including my picture! Regards, Hugo. El 11/04/18 a las 10:52, David Tabb escribió: > I really appreciate it, sir! > > Our meeting will take place from April 18-20th. If we can discuss it > during the 19th, that would be ideal! > > I have included your image in the update I will be providing at the > start of the meeting (see attached). > > Merci, > Dave > > On 4/11/2018 11:40 AM, Hugo López Fernández wrote: >> Hi David, >> >> sure, I would be pleased to provide such draft. Please, let me know >> the deadline for sending you the document. I will have a meeting >> tomorrow or the day after tomorrow with my Mass-Up collaborators to >> address this. >> >> Regards, >> >> Hugo. >> >> El 11/04/18 a las 09:41, David Tabb escribió: >>> Hi, Hugo. >>> >>> Would you be willing to assemble a draft qcML document that might be >>> created by your Mass-Up software? We will be looking at such examples >>> at our annual HUPO-PSI meeting in a week. I am providing a pointer >>> to a >>> draft qcML example of the quality metrics that are produced within the >>> QuaMeter "IDFree" mode. >>> https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml >>> >>> >>> >>> Thanks, >>> Dave >>> >>> On 1/20/2017 1:31 PM, Hugo López Fernández wrote: >>>> Hello David, >>>> >>>> I am Hugo, we met last week in Semmering. I hope this email finds you >>>> well and that you had a good trip to back home. >>>> >>>> As we talked in the EuBIC, I am writing you to let you know more about >>>> the quality control analysis that we have included in Mass-Up >>>> (http://sing-group.org/mass-up/). This quality control is intended to >>>> work with peak lists. We would like to incorporate quality control for >>>> raw data, specially to detect batch effects as I also commented you. >>>> >>>> Basically, the quality control (which is explained with most details >>>> in the paper http://doi.org/10.1186/s12859-015-0752-4) can be done at >>>> two levels: at the replicates leve and at the samples level, which >>>> includes additional information from the intra-sample m/z matching >>>> process and consensus spectrum creation (this is because our >>>> collaborators usually want to reduce replicates spectra to a unique >>>> sample "consensus" spectrum). You can find attached the quality >>>> control image included in the paper. >>>> >>>> At the replicates level, the user can check basic information about >>>> each individual spectrum (i.e. peak count, m/z range, intensity >>>> ranges, etc.) and compare all spectra in the dataset. At the samples >>>> level, the user can check the performance of the intra-sample peak >>>> matching process, by comparing the percentages of presence (POP) >>>> counts (i.e.: the counts of peaks that are present in, for example, >>>> 60%, 80% or 100% of replicates) and the POPs of each sample. >>>> >>>> In spite of being a very simple quality control it allowed us to >>>> detect some problems with datasets and we encourage our collaborators >>>> to have a quick look at this quality control metrics before any other >>>> analysis. Unfortunately they usually don't but we must encourage good >>>> practices, which is the reason why I am developing this other software >>>> (http://www.sing-group.org/s2p/), also presented in other poster at >>>> the EuBIC. Basically it is a software to manage, process and integrate >>>> different data sources (Mascot identifications, MALDI plates, 2D-gel >>>> spots). It probably will not revolutionize bioinformatics but it is >>>> allowing the research group to process data efficiently and in a >>>> reproducible way, a totally different scenario than wen I came here >>>> six months ago. >>>> >>>> As I mentioned previously we also would like to include quality >>>> control metrics for MALDI-TOF raw data, with special focus in batch >>>> effect detection (which seems to a common problem here). Regarding >>>> batch effect, I would like to apply this statistic >>>> (http://dx.doi.org/10.1093/bioinformatics/btt480) based on guided >>>> principal component analysis to detect batch effects in MALDI-TOF data >>>> (some people applied it to LC-MS metabolomic data >>>> [http://dx.doi.org/10.1016/j.talanta.2014.07.031]). I would like to >>>> develop this work this year if I get public MALDI-TOF datasets where >>>> batch effect presence has been publicly reported (I found a few >>>> reported but I could not get the data to analyze it yet). >>>> >>>> I will be happy to answer any question you may have or to receive any >>>> feedback from you. Looking forward to see you again, in other >>>> conference or wherever. >>>> >>>> Best regards, >>>> >>>> Hugo. >>>> >>> >>> [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> >>> >>> >>> >>> The integrity and confidentiality of this email is governed by these >>> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >>> Die integriteit en vertroulikheid van hierdie e-pos word deur die >>> volgende bepalings gereël. >>> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> >>> >> > > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> > > > The integrity and confidentiality of this email is governed by these > terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > Die integriteit en vertroulikheid van hierdie e-pos word deur die > volgende bepalings gereël. > Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> -- --------------------------------------------------------------------------- Hugo López-Fernández, PhD Email: hlf...@uv... Web: http://www.sing-group.org/~hlfernandez/ --------------------------------------------------------------------------- SING Research Group http://www.sing-group.org ESEI: Escuela Superior de Ingeniería Informática "Politécnico" Building, Room 306 "As Lagoas" Campus 32004 - Ourense - Spain --------------------------------------------------------------------------- CINBIO: Centro de Investigaciones Biomédicas http://cinbio.es/en/si4-next-generation-computer-systems-group/ --------------------------------------------------------------------------- IISGS: Instituto de Investigación Sanitaria Galicia Sur http://www.iisgaliciasur.es/sistemas-informaticos-de-nueva-generacion-sing/ --------------------------------------------------------------------------- The information in this e-mail and in any attachments is confidential and intended exclusively for the named addressee(s). Any use of this information not in accordance with its purpose, any dissemination or disclosure, either whole or partial, is prohibited except if formally approved. |
From: David T. <dt...@su...> - 2018-04-11 10:08:06
|
I really appreciate it, sir! Our meeting will take place from April 18-20th. If we can discuss it during the 19th, that would be ideal! I have included your image in the update I will be providing at the start of the meeting (see attached). Merci, Dave On 4/11/2018 11:40 AM, Hugo López Fernández wrote: > Hi David, > > sure, I would be pleased to provide such draft. Please, let me know > the deadline for sending you the document. I will have a meeting > tomorrow or the day after tomorrow with my Mass-Up collaborators to > address this. > > Regards, > > Hugo. > > El 11/04/18 a las 09:41, David Tabb escribió: >> Hi, Hugo. >> >> Would you be willing to assemble a draft qcML document that might be >> created by your Mass-Up software? We will be looking at such examples >> at our annual HUPO-PSI meeting in a week. I am providing a pointer to a >> draft qcML example of the quality metrics that are produced within the >> QuaMeter "IDFree" mode. >> https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml >> >> >> Thanks, >> Dave >> >> On 1/20/2017 1:31 PM, Hugo López Fernández wrote: >>> Hello David, >>> >>> I am Hugo, we met last week in Semmering. I hope this email finds you >>> well and that you had a good trip to back home. >>> >>> As we talked in the EuBIC, I am writing you to let you know more about >>> the quality control analysis that we have included in Mass-Up >>> (http://sing-group.org/mass-up/). This quality control is intended to >>> work with peak lists. We would like to incorporate quality control for >>> raw data, specially to detect batch effects as I also commented you. >>> >>> Basically, the quality control (which is explained with most details >>> in the paper http://doi.org/10.1186/s12859-015-0752-4) can be done at >>> two levels: at the replicates leve and at the samples level, which >>> includes additional information from the intra-sample m/z matching >>> process and consensus spectrum creation (this is because our >>> collaborators usually want to reduce replicates spectra to a unique >>> sample "consensus" spectrum). You can find attached the quality >>> control image included in the paper. >>> >>> At the replicates level, the user can check basic information about >>> each individual spectrum (i.e. peak count, m/z range, intensity >>> ranges, etc.) and compare all spectra in the dataset. At the samples >>> level, the user can check the performance of the intra-sample peak >>> matching process, by comparing the percentages of presence (POP) >>> counts (i.e.: the counts of peaks that are present in, for example, >>> 60%, 80% or 100% of replicates) and the POPs of each sample. >>> >>> In spite of being a very simple quality control it allowed us to >>> detect some problems with datasets and we encourage our collaborators >>> to have a quick look at this quality control metrics before any other >>> analysis. Unfortunately they usually don't but we must encourage good >>> practices, which is the reason why I am developing this other software >>> (http://www.sing-group.org/s2p/), also presented in other poster at >>> the EuBIC. Basically it is a software to manage, process and integrate >>> different data sources (Mascot identifications, MALDI plates, 2D-gel >>> spots). It probably will not revolutionize bioinformatics but it is >>> allowing the research group to process data efficiently and in a >>> reproducible way, a totally different scenario than wen I came here >>> six months ago. >>> >>> As I mentioned previously we also would like to include quality >>> control metrics for MALDI-TOF raw data, with special focus in batch >>> effect detection (which seems to a common problem here). Regarding >>> batch effect, I would like to apply this statistic >>> (http://dx.doi.org/10.1093/bioinformatics/btt480) based on guided >>> principal component analysis to detect batch effects in MALDI-TOF data >>> (some people applied it to LC-MS metabolomic data >>> [http://dx.doi.org/10.1016/j.talanta.2014.07.031]). I would like to >>> develop this work this year if I get public MALDI-TOF datasets where >>> batch effect presence has been publicly reported (I found a few >>> reported but I could not get the data to analyze it yet). >>> >>> I will be happy to answer any question you may have or to receive any >>> feedback from you. Looking forward to see you again, in other >>> conference or wherever. >>> >>> Best regards, >>> >>> Hugo. >>> >> >> [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> >> >> >> The integrity and confidentiality of this email is governed by these >> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >> Die integriteit en vertroulikheid van hierdie e-pos word deur die >> volgende bepalings gereël. >> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> >> > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> |
From: Hugo L. F. <hlf...@uv...> - 2018-04-11 09:54:13
|
Hi David, sure, I would be pleased to provide such draft. Please, let me know the deadline for sending you the document. I will have a meeting tomorrow or the day after tomorrow with my Mass-Up collaborators to address this. Regards, Hugo. El 11/04/18 a las 09:41, David Tabb escribió: > Hi, Hugo. > > Would you be willing to assemble a draft qcML document that might be > created by your Mass-Up software? We will be looking at such examples > at our annual HUPO-PSI meeting in a week. I am providing a pointer to a > draft qcML example of the quality metrics that are produced within the > QuaMeter "IDFree" mode. > https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml > > > Thanks, > Dave > > On 1/20/2017 1:31 PM, Hugo López Fernández wrote: >> Hello David, >> >> I am Hugo, we met last week in Semmering. I hope this email finds you >> well and that you had a good trip to back home. >> >> As we talked in the EuBIC, I am writing you to let you know more about >> the quality control analysis that we have included in Mass-Up >> (http://sing-group.org/mass-up/). This quality control is intended to >> work with peak lists. We would like to incorporate quality control for >> raw data, specially to detect batch effects as I also commented you. >> >> Basically, the quality control (which is explained with most details >> in the paper http://doi.org/10.1186/s12859-015-0752-4) can be done at >> two levels: at the replicates leve and at the samples level, which >> includes additional information from the intra-sample m/z matching >> process and consensus spectrum creation (this is because our >> collaborators usually want to reduce replicates spectra to a unique >> sample "consensus" spectrum). You can find attached the quality >> control image included in the paper. >> >> At the replicates level, the user can check basic information about >> each individual spectrum (i.e. peak count, m/z range, intensity >> ranges, etc.) and compare all spectra in the dataset. At the samples >> level, the user can check the performance of the intra-sample peak >> matching process, by comparing the percentages of presence (POP) >> counts (i.e.: the counts of peaks that are present in, for example, >> 60%, 80% or 100% of replicates) and the POPs of each sample. >> >> In spite of being a very simple quality control it allowed us to >> detect some problems with datasets and we encourage our collaborators >> to have a quick look at this quality control metrics before any other >> analysis. Unfortunately they usually don't but we must encourage good >> practices, which is the reason why I am developing this other software >> (http://www.sing-group.org/s2p/), also presented in other poster at >> the EuBIC. Basically it is a software to manage, process and integrate >> different data sources (Mascot identifications, MALDI plates, 2D-gel >> spots). It probably will not revolutionize bioinformatics but it is >> allowing the research group to process data efficiently and in a >> reproducible way, a totally different scenario than wen I came here >> six months ago. >> >> As I mentioned previously we also would like to include quality >> control metrics for MALDI-TOF raw data, with special focus in batch >> effect detection (which seems to a common problem here). Regarding >> batch effect, I would like to apply this statistic >> (http://dx.doi.org/10.1093/bioinformatics/btt480) based on guided >> principal component analysis to detect batch effects in MALDI-TOF data >> (some people applied it to LC-MS metabolomic data >> [http://dx.doi.org/10.1016/j.talanta.2014.07.031]). I would like to >> develop this work this year if I get public MALDI-TOF datasets where >> batch effect presence has been publicly reported (I found a few >> reported but I could not get the data to analyze it yet). >> >> I will be happy to answer any question you may have or to receive any >> feedback from you. Looking forward to see you again, in other >> conference or wherever. >> >> Best regards, >> >> Hugo. >> > > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> > > > The integrity and confidentiality of this email is governed by these > terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > Die integriteit en vertroulikheid van hierdie e-pos word deur die > volgende bepalings gereël. > Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > -- --------------------------------------------------------------------------- Hugo López-Fernández, PhD Email: hlf...@uv... Web: http://www.sing-group.org/~hlfernandez/ --------------------------------------------------------------------------- SING Research Group http://www.sing-group.org ESEI: Escuela Superior de Ingeniería Informática "Politécnico" Building, Room 306 "As Lagoas" Campus 32004 - Ourense - Spain --------------------------------------------------------------------------- CINBIO: Centro de Investigaciones Biomédicas http://cinbio.es/en/si4-next-generation-computer-systems-group/ --------------------------------------------------------------------------- IISGS: Instituto de Investigación Sanitaria Galicia Sur http://www.iisgaliciasur.es/sistemas-informaticos-de-nueva-generacion-sing/ --------------------------------------------------------------------------- The information in this e-mail and in any attachments is confidential and intended exclusively for the named addressee(s). Any use of this information not in accordance with its purpose, any dissemination or disclosure, either whole or partial, is prohibited except if formally approved. |
From: David T. <dt...@su...> - 2018-04-11 08:57:00
|
Hi, Hugo. Would you be willing to assemble a draft qcML document that might be created by your Mass-Up software? We will be looking at such examples at our annual HUPO-PSI meeting in a week. I am providing a pointer to a draft qcML example of the quality metrics that are produced within the QuaMeter "IDFree" mode. https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml Thanks, Dave On 1/20/2017 1:31 PM, Hugo López Fernández wrote: > Hello David, > > I am Hugo, we met last week in Semmering. I hope this email finds you > well and that you had a good trip to back home. > > As we talked in the EuBIC, I am writing you to let you know more about > the quality control analysis that we have included in Mass-Up > (http://sing-group.org/mass-up/). This quality control is intended to > work with peak lists. We would like to incorporate quality control for > raw data, specially to detect batch effects as I also commented you. > > Basically, the quality control (which is explained with most details > in the paper http://doi.org/10.1186/s12859-015-0752-4) can be done at > two levels: at the replicates leve and at the samples level, which > includes additional information from the intra-sample m/z matching > process and consensus spectrum creation (this is because our > collaborators usually want to reduce replicates spectra to a unique > sample "consensus" spectrum). You can find attached the quality > control image included in the paper. > > At the replicates level, the user can check basic information about > each individual spectrum (i.e. peak count, m/z range, intensity > ranges, etc.) and compare all spectra in the dataset. At the samples > level, the user can check the performance of the intra-sample peak > matching process, by comparing the percentages of presence (POP) > counts (i.e.: the counts of peaks that are present in, for example, > 60%, 80% or 100% of replicates) and the POPs of each sample. > > In spite of being a very simple quality control it allowed us to > detect some problems with datasets and we encourage our collaborators > to have a quick look at this quality control metrics before any other > analysis. Unfortunately they usually don't but we must encourage good > practices, which is the reason why I am developing this other software > (http://www.sing-group.org/s2p/), also presented in other poster at > the EuBIC. Basically it is a software to manage, process and integrate > different data sources (Mascot identifications, MALDI plates, 2D-gel > spots). It probably will not revolutionize bioinformatics but it is > allowing the research group to process data efficiently and in a > reproducible way, a totally different scenario than wen I came here > six months ago. > > As I mentioned previously we also would like to include quality > control metrics for MALDI-TOF raw data, with special focus in batch > effect detection (which seems to a common problem here). Regarding > batch effect, I would like to apply this statistic > (http://dx.doi.org/10.1093/bioinformatics/btt480) based on guided > principal component analysis to detect batch effects in MALDI-TOF data > (some people applied it to LC-MS metabolomic data > [http://dx.doi.org/10.1016/j.talanta.2014.07.031]). I would like to > develop this work this year if I get public MALDI-TOF datasets where > batch effect presence has been publicly reported (I found a few > reported but I could not get the data to analyze it yet). > > I will be happy to answer any question you may have or to receive any > feedback from you. Looking forward to see you again, in other > conference or wherever. > > Best regards, > > Hugo. > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> |
From: Bittremieux W. <wou...@ua...> - 2018-04-10 17:30:29
|
Hi Dave, Sorry for the delayed answer to your questions. As far as I'm aware (please someone correct me if I'm wrong): - OBO and OWL are two alternative file formats to specify controlled vocabularies and ontologies. We use the OBO format for our CV. This is indeed a simpler format than OWL and can be viewed relatively easily in a simple text editor. Alternatively, Martin has previously recommended OBO-Edit<http://oboedit.org/> to visualize the relationships between the various terms. - In our previous discussions we have indeed said that in principle every tool gets its own range of CV accessions. This would enable a new tool to easily start producing compliant qcML files without having to check dependencies on other tools. It might make sense to reuse some trivial definitions though for metrics that don't involve any computations, such the number of MS/MS scans. On the other hand, how will downstream tools handle conflicting metrics coming from different tools? Although in that case maybe the tool authors should be the main persons responsible to worry about this rather than us. In any case, it would be useful to explicitly document how tools can get CV accessions. - To report the unit for a single QC metric you can use the unitAccession/unitName/unitCvRef attributes for any XML element that is derived from the CVParamType, which qualityParameter elements are. The CVParamType is specified in our XML schema<https://github.com/HUPO-PSI/qcML-development/blob/master/schema/v0_0_10/qcML_0_0_10.xsd>. - This XML schema can also be used for simple syntactic validation and some semantic validation. Because we have a mix between XML and JSON unfortunately some external semantic validation will have to be explicitly coded as well though. Any decent XML editor should have built-in functionality to validate XML files against a schema, and otherwise there are various command-line tools and linters you can use to do that as well. However, the previously linked v0.0.10 XML schema on GitHub does not seem to be fully up to date anymore at the moment, and I think the most recent XML schema is available in Mathias' ongoing pull request<https://github.com/HUPO-PSI/qcML-development/pull/37>. - As to metrics numbering I don't think we have a process for that yet. I guess now it's just first-come-first-served. Best, Wout On 09 Apr 2018, at 23:56, David Tabb <dt...@su...<mailto:dt...@su...>> wrote: Hi, all. I have not yet received an answer to the three questions below. I have, however, uploaded it to qcdev, where you can find it at this URL: https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml. Merci! Dave On 4/5/2018 11:29 AM, David Tabb wrote: Hi, all. At long last, I have completed my "homework" for Heidelberg! I have created a draft XML to represent the qcML output for computing QuaMeter IDFree metrics for a single input mzML file (see text inline below). I would greatly appreciate answers to the following questions to complete this project: 1) If I am reporting only a single value for a metric (such as "XIC-WideFrac"), how do I report the unit for the metric? 2) Do we have an easy way to validate a draft file like this, at least to determine that I have matching end tags for each one I open? I've been using Emacs, which tries to help, but a dedicated XML editor might be preferable. 3) Who officially designates what each metric will be numbered in the CV? Thanks, Dave -------------------------------------------------------- <?xml version="1.0" encoding="UTF-8"?> <qcML xmlns="http://www.prime-xs.eu/ms/qcml" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.prime-xs.eu/ms/qcml file:/home/walzer/psi/qcML-development/schema/v0_0_10/qcML_0_0_10.xsd" version="0.0.10"> <runQuality ID="ID001"> <metaDataParameters ID="fileprovenance001" cvRef="?" accession="?" name="?" description="do we need cv for toplevel" value="all cv attributes are optional anyway"> <InputFiles> <RawFile location="C:\Research\20171124-Lizex-Chia\1091_Pool_start.mzML" id="ID001" name="1091_Pool_start.mzML"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1000584" name="mzML format"/> </FileFormat> <!-- In the following line, I computed the md5sum for the peak-picked mzML, not the RAW! --> <cvParam cvRef="PSI-MS" accession="MS:1000568" name="MD5" value="b583f6d2a91b4749d5a75885330f6e5d" /> <cvParam cvRef="PSI-MS" accession="MS:1000747" name="completion time" value="2017-12-08-T15:38:57Z" /> </RawFile> </InputFiles> </metaDataParameters> <!-- Question to consider: how should I link a metric below to the concept of "liquid chromatography" or "electrospray ionization?" Is this appropriate--> <!-- Units I employ below: --> <!-- UO:0000191 "fraction" --> <!-- UO:0000010 "second" --> <!-- UO:0010006 "ratio" --> <!-- UO:0000189 "count" --> <!-- UO:0000106 "hertz" --> <!-- When a line gives a single metric, where do I indicate the unit type? UO:0000191 "fraction" --> <qualityParameter ID="XIC-WideFrac" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Fraction of precursor ions accounting for the top half of all peak width" value="0.206807"/> <qualityParameter ID="XIC-FWHM" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak widths for the wide XICs"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0000010':[12.5377,14.2244,16.9234]}</content></qualityParameter> <!-- UO:0010006 is "ratio" rather than "log ratio"--> <qualityParameter ID="XIC-Height" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak log ratio heights for the wide XICs"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0010006':[0.776393,0.93114,6.6283]}</content></qualityParameter> <!-- In the following, where do I indicate the unit type? UO:0000010 "second" --> <qualityParameter ID="RT-Duration" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Highest scan time observed minus the lowest scan time observed" value="4920.17"/> <qualityParameter ID="RT-TIC" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of TIC accumulation as fraction of RT-Duration"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0.301236,0.13286,0.174576,0.391328]}</content></qualityParameter> <qualityParameter ID="RT-MS" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of MS1 acquisition as fraction of RT-Duration"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0.217794,0.272976,0.275845,0.233385]}</content></qualityParameter> <qualityParameter ID="RT-MSMS" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of MS2 acquisition as fraction of RT-Duration"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0.268157,0.233516,0.236373,0.261954]}</content></qualityParameter> <qualityParameter ID="MS1-TIC-Change" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of log ratios of MS1 scan-to-scan TIC changes"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0010006':[0.870774,0.900585,4.66521]}</content></qualityParameter> <qualityParameter ID="MS1-TIC" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Log ratios of MS1 scan TICs"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0010006':[0.568866,0.815636,1.18124]}</content></qualityParameter> <!-- In the following, where do I indicate the unit type? UO:0000189 "count" --> <qualityParameter ID="MS1-Count" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Number of MS1 scans acquired" value="7832"/> <!-- In the following, where do I indicate the unit type? UO:0000106 "hertz" --> <qualityParameter ID="MS1-Freq-Max" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS1 scan acquisition" value="2.41814"/> <qualityParameter ID="MS1-Density" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak counts for MS1 scans"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0000189:[693,1205,1424]}</content></qualityParameter> <!-- In the following, where do I indicate the unit type? UO:0000189 "count" --> <qualityParameter ID="MS2-Count" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Number of MS2 scans acquired" value="33495"/> <!-- In the following, where do I indicate the unit type? UO:0000106 "hertz" --> <qualityParameter ID="MS2-Freq-Max" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS2 scan acquisition" value="7.33107"/> <qualityParameter ID="MS2-Density" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak counts for MS2 scans"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0000189:[27,44,70]}</content></qualityParameter> <!-- How do we specify that the values of a vector sum to 1? How about this case, where two vectors together sume to 1?--> <qualityParameter ID="MS2-PrecZ-Known" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Fraction of known precursor charges for +1, +2, ..., n, more than n"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0,0.15047,0.0687565,0.00877743,0.000507538,0.000268697]}</content></qualityParameter> <!-- For the final metric, I used a reporting type where the number of elements in vector is fixed: QC:3000007--> <qualityParameter ID="MS2-PrecZ-Unknown" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Fraction of unknown precursor charges for +1, more than +1"> <content cvRef="PSI-QC-CV" accession="QC:3000007" value="2">{'UO:0000191':[0.54235,0.22887]}</content></qualityParameter> </runQuality> <cvList> <cv fullName="The HUPO-PSI QC WG metrics ontology in obo format" uri="http://www.github.com/HUPO-PSI/.../" ID="PSI-QC-CV"/> </cvList> </qcML> -------------------------------------------------------- On 4/5/2018 9:47 AM, David Tabb wrote: Hi, all. In creating my hand-crafted example of qcML from QuaMeter IDFree, I've installed Protege, an ontology viewer. It's useful for perusing the HUPO-PSI MS ontology (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). I'm a bit confused, though, in how to review the qcML ontology. At present, I can find an OBO to download (https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), but I do not see an OWL equivalent. Helpfully, I can simply review the OBO in a text editor. In particular, I am trying to determine which QC accessions each of my IDFree metrics will represent. Does each metric generator get a particular series of numbers that relate to that software's outputs? Can I reuse a metric accession from other software if my tool generates the same values (such as the number of MS/MS scans)? Jinmeng Jia, will you be able to share your paragraph that meets the draft MIAPE QC standard with Weimin Zhu for him to present at Heidelberg? I was sorry to hear you won't be able to attend yourself. Thanks, Dave [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org<http://slashdot.org>! http://sdm.link/slashdot _______________________________________________ Psidev-qc-dev mailing list Psi...@li...<mailto:Psi...@li...> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org<http://slashdot.org>! http://sdm.link/slashdot _______________________________________________ Psidev-qc-dev mailing list Psi...@li...<mailto:Psi...@li...> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org<http://slashdot.org>! http://sdm.link/slashdot _______________________________________________ Psidev-qc-dev mailing list Psi...@li...<mailto:Psi...@li...> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev |
From: David T. <dt...@su...> - 2018-04-10 06:56:39
|
Hi, all. I have not yet received an answer to the three questions below. I have, however, uploaded it to qcdev, where you can find it at this URL: https://github.com/HUPO-PSI/qcML-development/blob/master/20180403-1091_Pool_start_v0.8.qc.xml. Merci! Dave On 4/5/2018 11:29 AM, David Tabb wrote: > Hi, all. > > At long last, I have completed my "homework" for Heidelberg! I have > created a draft XML to represent the qcML output for computing QuaMeter > IDFree metrics for a single input mzML file (see text inline below). I > would greatly appreciate answers to the following questions to complete > this project: > > 1) If I am reporting only a single value for a metric (such as > "XIC-WideFrac"), how do I report the unit for the metric? > > 2) Do we have an easy way to validate a draft file like this, at least > to determine that I have matching end tags for each one I open? I've > been using Emacs, which tries to help, but a dedicated XML editor might > be preferable. > > 3) Who officially designates what each metric will be numbered in the CV? > > Thanks, > Dave > -------------------------------------------------------- > <?xml version="1.0" encoding="UTF-8"?> > <qcML xmlns="http://www.prime-xs.eu/ms/qcml" > xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" > xsi:schemaLocation="http://www.prime-xs.eu/ms/qcml > file:/home/walzer/psi/qcML-development/schema/v0_0_10/qcML_0_0_10.xsd" > version="0.0.10"> > <runQuality ID="ID001"> > <metaDataParameters ID="fileprovenance001" cvRef="?" > accession="?" name="?" description="do we need cv for toplevel" > value="all cv attributes are optional anyway"> > <InputFiles> > <RawFile > location="C:\Research\20171124-Lizex-Chia\1091_Pool_start.mzML" > id="ID001" name="1091_Pool_start.mzML"> > <FileFormat> > <cvParam cvRef="PSI-MS" accession="MS:1000584" > name="mzML format"/> > </FileFormat> > <!-- In the following line, I computed the md5sum for the > peak-picked mzML, not the RAW! --> > <cvParam cvRef="PSI-MS" accession="MS:1000568" name="MD5" > value="b583f6d2a91b4749d5a75885330f6e5d" /> > <cvParam cvRef="PSI-MS" accession="MS:1000747" > name="completion time" value="2017-12-08-T15:38:57Z" /> > </RawFile> > </InputFiles> > </metaDataParameters> > <!-- Question to consider: how should I link a metric below to the > concept of "liquid chromatography" or "electrospray ionization?" Is > this appropriate--> > <!-- Units I employ below: --> > <!-- UO:0000191 "fraction" --> > <!-- UO:0000010 "second" --> > <!-- UO:0010006 "ratio" --> > <!-- UO:0000189 "count" --> > <!-- UO:0000106 "hertz" --> > <!-- When a line gives a single metric, where do I indicate the > unit type? UO:0000191 "fraction" --> > <qualityParameter ID="XIC-WideFrac" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Fraction of precursor ions > accounting for the top half of all peak width" value="0.206807"/> > <qualityParameter ID="XIC-FWHM" cvRef="PSI-QC-CV" accession="QC:" > name="QuaMeter IDFree Metric- Distribution of peak widths for the wide > XICs"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="3">{'UO:0000010':[12.5377,14.2244,16.9234]}</content></qualityParameter> > > <!-- UO:0010006 is "ratio" rather than "log ratio"--> > <qualityParameter ID="XIC-Height" cvRef="PSI-QC-CV" accession="QC:" > name="QuaMeter IDFree Metric- Distribution of peak log ratio heights for > the wide XICs"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="3">{'UO:0010006':[0.776393,0.93114,6.6283]}</content></qualityParameter> > > <!-- In the following, where do I indicate the unit type? > UO:0000010 "second" --> > <qualityParameter ID="RT-Duration" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Highest scan time observed > minus the lowest scan time observed" value="4920.17"/> > <qualityParameter ID="RT-TIC" cvRef="PSI-QC-CV" accession="QC:" > name="QuaMeter IDFree Metric- Distribution of TIC accumulation as > fraction of RT-Duration"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="4">{'UO:0000191':[0.301236,0.13286,0.174576,0.391328]}</content></qualityParameter> > > <qualityParameter ID="RT-MS" cvRef="PSI-QC-CV" accession="QC:" > name="QuaMeter IDFree Metric- Distribution of MS1 acquisition as > fraction of RT-Duration"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="4">{'UO:0000191':[0.217794,0.272976,0.275845,0.233385]}</content></qualityParameter> > > <qualityParameter ID="RT-MSMS" cvRef="PSI-QC-CV" accession="QC:" > name="QuaMeter IDFree Metric- Distribution of MS2 acquisition as > fraction of RT-Duration"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="4">{'UO:0000191':[0.268157,0.233516,0.236373,0.261954]}</content></qualityParameter> > > <qualityParameter ID="MS1-TIC-Change" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Distribution of log ratios > of MS1 scan-to-scan TIC changes"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="3">{'UO:0010006':[0.870774,0.900585,4.66521]}</content></qualityParameter> > > <qualityParameter ID="MS1-TIC" cvRef="PSI-QC-CV" accession="QC:" > name="QuaMeter IDFree Metric- Log ratios of MS1 scan TICs"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="3">{'UO:0010006':[0.568866,0.815636,1.18124]}</content></qualityParameter> > > <!-- In the following, where do I indicate the unit type? > UO:0000189 "count" --> > <qualityParameter ID="MS1-Count" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Number of MS1 scans > acquired" value="7832"/> > <!-- In the following, where do I indicate the unit type? > UO:0000106 "hertz" --> > <qualityParameter ID="MS1-Freq-Max" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS1 > scan acquisition" value="2.41814"/> > <qualityParameter ID="MS1-Density" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak > counts for MS1 scans"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="3">{'UO:0000189:[693,1205,1424]}</content></qualityParameter> > <!-- In the following, where do I indicate the unit type? > UO:0000189 "count" --> > <qualityParameter ID="MS2-Count" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Number of MS2 scans > acquired" value="33495"/> > <!-- In the following, where do I indicate the unit type? > UO:0000106 "hertz" --> > <qualityParameter ID="MS2-Freq-Max" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS2 > scan acquisition" value="7.33107"/> > <qualityParameter ID="MS2-Density" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak > counts for MS2 scans"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="3">{'UO:0000189:[27,44,70]}</content></qualityParameter> > <!-- How do we specify that the values of a vector sum to 1? How > about this case, where two vectors together sume to 1?--> > <qualityParameter ID="MS2-PrecZ-Known" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Fraction of known > precursor charges for +1, +2, ..., n, more than n"> > <content cvRef="PSI-QC-CV" accession="QC:3000008" > value="4">{'UO:0000191':[0,0.15047,0.0687565,0.00877743,0.000507538,0.000268697]}</content></qualityParameter> > > <!-- For the final metric, I used a reporting type where the number > of elements in vector is fixed: QC:3000007--> > <qualityParameter ID="MS2-PrecZ-Unknown" cvRef="PSI-QC-CV" > accession="QC:" name="QuaMeter IDFree Metric- Fraction of unknown > precursor charges for +1, more than +1"> > <content cvRef="PSI-QC-CV" accession="QC:3000007" > value="2">{'UO:0000191':[0.54235,0.22887]}</content></qualityParameter> > </runQuality> > <cvList> > <cv fullName="The HUPO-PSI QC WG metrics ontology in obo > format" uri="http://www.github.com/HUPO-PSI/.../" ID="PSI-QC-CV"/> > </cvList> > </qcML> > -------------------------------------------------------- > > On 4/5/2018 9:47 AM, David Tabb wrote: >> Hi, all. >> >> In creating my hand-crafted example of qcML from QuaMeter IDFree, I've >> installed Protege, an ontology viewer. It's useful for perusing the >> HUPO-PSI MS ontology (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). >> >> I'm a bit confused, though, in how to review the qcML ontology. At >> present, I can find an OBO to download >> (https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), >> >> >> but I do not see an OWL equivalent. Helpfully, I can simply review the >> OBO in a text editor. >> >> In particular, I am trying to determine which QC accessions each of my >> IDFree metrics will represent. Does each metric generator get a >> particular series of numbers that relate to that software's outputs? >> Can I reuse a metric accession from other software if my tool generates >> the same values (such as the number of MS/MS scans)? >> >> Jinmeng Jia, will you be able to share your paragraph that meets the >> draft MIAPE QC standard with Weimin Zhu for him to present at >> Heidelberg? I was sorry to hear you won't be able to attend yourself. >> >> Thanks, >> Dave >> >> [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> >> >> >> >> The integrity and confidentiality of this email is governed by these >> terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >> Die integriteit en vertroulikheid van hierdie e-pos word deur die >> volgende bepalings gereël. >> Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> >> >> ------------------------------------------------------------------------------ >> >> >> Check out the vibrant tech community on one of the world's most >> engaging tech sites, Slashdot.org! http://sdm.link/slashdot >> _______________________________________________ >> Psidev-qc-dev mailing list >> Psi...@li... >> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > > > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> > > > The integrity and confidentiality of this email is governed by these > terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > Die integriteit en vertroulikheid van hierdie e-pos word deur die > volgende bepalings gereël. > Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > > ------------------------------------------------------------------------------ > > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> |
From: Jinmeng J. <jia...@16...> - 2018-04-06 17:40:36
|
Hi, David and all. Sure. I have shared the MIAPE QC paragraph to Weimin and we have arranged a short meeting in Shanghai before he go to Heidelberg to make sure everything is prepared. I am sorry for not being able to attend the spring meeting. Best, Jinmeng At 2018-04-05 15:47:24, "David Tabb" <dt...@su...> wrote: >Hi, all. > >In creating my hand-crafted example of qcML from QuaMeter IDFree, I've >installed Protege, an ontology viewer. It's useful for perusing the >HUPO-PSI MS ontology (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). > >I'm a bit confused, though, in how to review the qcML ontology. At >present, I can find an OBO to download >(https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), >but I do not see an OWL equivalent. Helpfully, I can simply review the >OBO in a text editor. > >In particular, I am trying to determine which QC accessions each of my >IDFree metrics will represent. Does each metric generator get a >particular series of numbers that relate to that software's outputs? >Can I reuse a metric accession from other software if my tool generates >the same values (such as the number of MS/MS scans)? > >Jinmeng Jia, will you be able to share your paragraph that meets the >draft MIAPE QC standard with Weimin Zhu for him to present at >Heidelberg? I was sorry to hear you won't be able to attend yourself. > >Thanks, >Dave > >[http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> > >The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> >Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > >------------------------------------------------------------------------------ >Check out the vibrant tech community on one of the world's most >engaging tech sites, Slashdot.org! http://sdm.link/slashdot >_______________________________________________ >Psidev-qc-dev mailing list >Psi...@li... >https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev -- Jinmeng, Jia Ph.D. candidate,Tieliu Shi's Lab Center for Bioinformatics and Computational Biology Shanghai Key Laboratory of Regulatory Biology Institute of Biomedical Sciences and School of Life Sciences East China Normal University Minhang Campus: 500 Dongchuan RD., Shanghai 200241 |
From: David T. <dt...@su...> - 2018-04-05 09:29:33
|
Hi, all. At long last, I have completed my "homework" for Heidelberg! I have created a draft XML to represent the qcML output for computing QuaMeter IDFree metrics for a single input mzML file (see text inline below). I would greatly appreciate answers to the following questions to complete this project: 1) If I am reporting only a single value for a metric (such as "XIC-WideFrac"), how do I report the unit for the metric? 2) Do we have an easy way to validate a draft file like this, at least to determine that I have matching end tags for each one I open? I've been using Emacs, which tries to help, but a dedicated XML editor might be preferable. 3) Who officially designates what each metric will be numbered in the CV? Thanks, Dave -------------------------------------------------------- <?xml version="1.0" encoding="UTF-8"?> <qcML xmlns="http://www.prime-xs.eu/ms/qcml" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.prime-xs.eu/ms/qcml file:/home/walzer/psi/qcML-development/schema/v0_0_10/qcML_0_0_10.xsd" version="0.0.10"> <runQuality ID="ID001"> <metaDataParameters ID="fileprovenance001" cvRef="?" accession="?" name="?" description="do we need cv for toplevel" value="all cv attributes are optional anyway"> <InputFiles> <RawFile location="C:\Research\20171124-Lizex-Chia\1091_Pool_start.mzML" id="ID001" name="1091_Pool_start.mzML"> <FileFormat> <cvParam cvRef="PSI-MS" accession="MS:1000584" name="mzML format"/> </FileFormat> <!-- In the following line, I computed the md5sum for the peak-picked mzML, not the RAW! --> <cvParam cvRef="PSI-MS" accession="MS:1000568" name="MD5" value="b583f6d2a91b4749d5a75885330f6e5d" /> <cvParam cvRef="PSI-MS" accession="MS:1000747" name="completion time" value="2017-12-08-T15:38:57Z" /> </RawFile> </InputFiles> </metaDataParameters> <!-- Question to consider: how should I link a metric below to the concept of "liquid chromatography" or "electrospray ionization?" Is this appropriate--> <!-- Units I employ below: --> <!-- UO:0000191 "fraction" --> <!-- UO:0000010 "second" --> <!-- UO:0010006 "ratio" --> <!-- UO:0000189 "count" --> <!-- UO:0000106 "hertz" --> <!-- When a line gives a single metric, where do I indicate the unit type? UO:0000191 "fraction" --> <qualityParameter ID="XIC-WideFrac" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Fraction of precursor ions accounting for the top half of all peak width" value="0.206807"/> <qualityParameter ID="XIC-FWHM" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak widths for the wide XICs"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0000010':[12.5377,14.2244,16.9234]}</content></qualityParameter> <!-- UO:0010006 is "ratio" rather than "log ratio"--> <qualityParameter ID="XIC-Height" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak log ratio heights for the wide XICs"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0010006':[0.776393,0.93114,6.6283]}</content></qualityParameter> <!-- In the following, where do I indicate the unit type? UO:0000010 "second" --> <qualityParameter ID="RT-Duration" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Highest scan time observed minus the lowest scan time observed" value="4920.17"/> <qualityParameter ID="RT-TIC" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of TIC accumulation as fraction of RT-Duration"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0.301236,0.13286,0.174576,0.391328]}</content></qualityParameter> <qualityParameter ID="RT-MS" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of MS1 acquisition as fraction of RT-Duration"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0.217794,0.272976,0.275845,0.233385]}</content></qualityParameter> <qualityParameter ID="RT-MSMS" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of MS2 acquisition as fraction of RT-Duration"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0.268157,0.233516,0.236373,0.261954]}</content></qualityParameter> <qualityParameter ID="MS1-TIC-Change" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of log ratios of MS1 scan-to-scan TIC changes"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0010006':[0.870774,0.900585,4.66521]}</content></qualityParameter> <qualityParameter ID="MS1-TIC" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Log ratios of MS1 scan TICs"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0010006':[0.568866,0.815636,1.18124]}</content></qualityParameter> <!-- In the following, where do I indicate the unit type? UO:0000189 "count" --> <qualityParameter ID="MS1-Count" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Number of MS1 scans acquired" value="7832"/> <!-- In the following, where do I indicate the unit type? UO:0000106 "hertz" --> <qualityParameter ID="MS1-Freq-Max" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS1 scan acquisition" value="2.41814"/> <qualityParameter ID="MS1-Density" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak counts for MS1 scans"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0000189:[693,1205,1424]}</content></qualityParameter> <!-- In the following, where do I indicate the unit type? UO:0000189 "count" --> <qualityParameter ID="MS2-Count" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Number of MS2 scans acquired" value="33495"/> <!-- In the following, where do I indicate the unit type? UO:0000106 "hertz" --> <qualityParameter ID="MS2-Freq-Max" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Maximum frequency for MS2 scan acquisition" value="7.33107"/> <qualityParameter ID="MS2-Density" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Distribution of peak counts for MS2 scans"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="3">{'UO:0000189:[27,44,70]}</content></qualityParameter> <!-- How do we specify that the values of a vector sum to 1? How about this case, where two vectors together sume to 1?--> <qualityParameter ID="MS2-PrecZ-Known" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Fraction of known precursor charges for +1, +2, ..., n, more than n"> <content cvRef="PSI-QC-CV" accession="QC:3000008" value="4">{'UO:0000191':[0,0.15047,0.0687565,0.00877743,0.000507538,0.000268697]}</content></qualityParameter> <!-- For the final metric, I used a reporting type where the number of elements in vector is fixed: QC:3000007--> <qualityParameter ID="MS2-PrecZ-Unknown" cvRef="PSI-QC-CV" accession="QC:" name="QuaMeter IDFree Metric- Fraction of unknown precursor charges for +1, more than +1"> <content cvRef="PSI-QC-CV" accession="QC:3000007" value="2">{'UO:0000191':[0.54235,0.22887]}</content></qualityParameter> </runQuality> <cvList> <cv fullName="The HUPO-PSI QC WG metrics ontology in obo format" uri="http://www.github.com/HUPO-PSI/.../" ID="PSI-QC-CV"/> </cvList> </qcML> -------------------------------------------------------- On 4/5/2018 9:47 AM, David Tabb wrote: > Hi, all. > > In creating my hand-crafted example of qcML from QuaMeter IDFree, I've > installed Protege, an ontology viewer. It's useful for perusing the > HUPO-PSI MS ontology (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). > > I'm a bit confused, though, in how to review the qcML ontology. At > present, I can find an OBO to download > (https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), > > but I do not see an OWL equivalent. Helpfully, I can simply review the > OBO in a text editor. > > In particular, I am trying to determine which QC accessions each of my > IDFree metrics will represent. Does each metric generator get a > particular series of numbers that relate to that software's outputs? > Can I reuse a metric accession from other software if my tool generates > the same values (such as the number of MS/MS scans)? > > Jinmeng Jia, will you be able to share your paragraph that meets the > draft MIAPE QC standard with Weimin Zhu for him to present at > Heidelberg? I was sorry to hear you won't be able to attend yourself. > > Thanks, > Dave > > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> > > > The integrity and confidentiality of this email is governed by these > terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > Die integriteit en vertroulikheid van hierdie e-pos word deur die > volgende bepalings gereël. > Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > > ------------------------------------------------------------------------------ > > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> |
From: David T. <dt...@su...> - 2018-04-05 08:22:10
|
Hi, all. In creating my hand-crafted example of qcML from QuaMeter IDFree, I've installed Protege, an ontology viewer. It's useful for perusing the HUPO-PSI MS ontology (http://purl.obolibrary.org/obo/ms/4.1.2/ms.owl). I'm a bit confused, though, in how to review the qcML ontology. At present, I can find an OBO to download (https://github.com/HUPO-PSI/qcML-development/raw/master/cv/v0_0_10/qc-cv.obo), but I do not see an OWL equivalent. Helpfully, I can simply review the OBO in a text editor. In particular, I am trying to determine which QC accessions each of my IDFree metrics will represent. Does each metric generator get a particular series of numbers that relate to that software's outputs? Can I reuse a metric accession from other software if my tool generates the same values (such as the number of MS/MS scans)? Jinmeng Jia, will you be able to share your paragraph that meets the draft MIAPE QC standard with Weimin Zhu for him to present at Heidelberg? I was sorry to hear you won't be able to attend yourself. Thanks, Dave [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> |
From: Walzer <wa...@eb...> - 2018-03-15 12:20:21
|
Hi all, @dave sure. Do you have specific questions, hard to understand parts, or interests? best, mths On 15/03/18 08:19, Tabb, David, Prof <dt...@su...> wrote: > Hi, all. > > I definitely look forward to our conversation today! > > Mathias, could you present whatever you have for a current mock-up of qcML? I am trying to do my own homework for the Heidelberg meeting (show what a qcML for QuaMeter IDFree might look like), but my familiarity with the schema is pretty limited! > > Jinmeng Jia has let me know that she'll be presenting a paragraph at Heidelberg that might be found in a manuscript to satisfy a MIAPE-QC rule; she and Weimin will both be attending, happily! > > Thanks, > Dave > > On 3/12/2018 6:40 PM, Wout Bittremieux wrote: > Dear colleagues, > > This is a reminder that our next teleconference is scheduled for Thursday, March 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 8h00 Seattle (daylight savings time!), 23h00 Beijing). > > You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 > [https://www.gstatic.com/images/icons/material/product/2x/hangouts_64dp.png]<https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> > > Google Hangouts<https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> > hangouts.google.com > Click to join the conversation. > > > > > I hope to talk to all of you then so we can prepare for the PSI spring meeting in Heidelberg next month. > > Thanks, > Wout > > > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > > > > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li...<mailto:Psi...@li...> > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > > > > [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> > > The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> > Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> > > > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > > > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev -- Mathias Walzer European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus, Hinxton, Cambridge, UK Office: +44 (0)1223 494 2610 E-mail: wa...@eb... |
From: Tabb, D. P. <dt...@su...> - 2018-03-15 08:19:18
|
Hi, all. I definitely look forward to our conversation today! Mathias, could you present whatever you have for a current mock-up of qcML? I am trying to do my own homework for the Heidelberg meeting (show what a qcML for QuaMeter IDFree might look like), but my familiarity with the schema is pretty limited! Jinmeng Jia has let me know that she'll be presenting a paragraph at Heidelberg that might be found in a manuscript to satisfy a MIAPE-QC rule; she and Weimin will both be attending, happily! Thanks, Dave On 3/12/2018 6:40 PM, Wout Bittremieux wrote: Dear colleagues, This is a reminder that our next teleconference is scheduled for Thursday, March 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 8h00 Seattle (daylight savings time!), 23h00 Beijing). You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 [https://www.gstatic.com/images/icons/material/product/2x/hangouts_64dp.png]<https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> Google Hangouts<https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> hangouts.google.com Click to join the conversation. I hope to talk to all of you then so we can prepare for the PSI spring meeting in Heidelberg next month. Thanks, Wout ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot _______________________________________________ Psidev-qc-dev mailing list Psi...@li...<mailto:Psi...@li...> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev [http://cdn.sun.ac.za/100/ProductionFooter.jpg]<http://www.sun.ac.za/english/Pages/Water-crisis.aspx> The integrity and confidentiality of this email is governed by these terms. Disclaimer<http://www.sun.ac.za/emaildisclaimer> Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. Vrywaringsklousule<http://www.sun.ac.za/emaildisclaimer> |
From: Wout B. <wo...@uw...> - 2018-03-12 16:42:02
|
Dear colleagues, This is a reminder that our next teleconference is scheduled for Thursday, March 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 8h00 Seattle (daylight savings time!), 23h00 Beijing). You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 I hope to talk to all of you then so we can prepare for the PSI spring meeting in Heidelberg next month. Thanks, Wout |
From: Tenzer, S. <te...@un...> - 2018-02-16 07:30:56
|
Hi everyone, my sincerest apologies for not attending yesterday - I had dentists appointment which did not work out as well as planned - I was kind of knocked out afterwards. Hope to talk to you soon! Best wishes, Stefan Am 16.02.2018 um 02:05 schrieb Wout Bittremieux <wo...@uw...>: > Dear colleagues, > > Please find the meeting notes of today's teleconference here: https://github.com/HUPO-PSI/qcML-development/blob/master/meeting_notes/20180215_telco.md > > I want to especially highlight that registrations for this year's PSI spring meeting are now open: http://psidev.info/content/hupo-psi-meeting-2018 > I hope to see all of you there so we can make important progress towards a qcML version 1.0. > > Dave will soon send some more information to the mailing list so we can optimally prepare for the PSI meeting. > > Best, > Wout > >> On 12 Feb 2018, at 19:24, Wout Bittremieux <wo...@uw...> wrote: >> >> Dear colleagues, >> >> This is a reminder that our next teleconference is scheduled for Thursday, February 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 7h00 Seattle, 23h00 Beijing). >> >> You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 >> >> As I couldn't attend last month's teleconference I don't know what was discussed then and what should be on the agenda this week. Please reply to this email to let us know if you have something specific to be discussed. >> >> Thanks, >> Wout >> >> ------------------------------------------------------------------------------ >> Check out the vibrant tech community on one of the world's most >> engaging tech sites, Slashdot.org! http://sdm.link/slashdot_______________________________________________ >> Psidev-qc-dev mailing list >> Psi...@li... >> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev > > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev |
From: Wout B. <wo...@uw...> - 2018-02-16 01:08:12
|
Dear colleagues, Please find the meeting notes of today's teleconference here: https://github.com/HUPO-PSI/qcML-development/blob/master/meeting_notes/20180215_telco.md I want to especially highlight that registrations for this year's PSI spring meeting are now open: http://psidev.info/content/hupo-psi-meeting-2018 I hope to see all of you there so we can make important progress towards a qcML version 1.0. Dave will soon send some more information to the mailing list so we can optimally prepare for the PSI meeting. Best, Wout > On 12 Feb 2018, at 19:24, Wout Bittremieux <wo...@uw...> wrote: > > Dear colleagues, > > This is a reminder that our next teleconference is scheduled for Thursday, February 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 7h00 Seattle, 23h00 Beijing). > > You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 > > As I couldn't attend last month's teleconference I don't know what was discussed then and what should be on the agenda this week. Please reply to this email to let us know if you have something specific to be discussed. > > Thanks, > Wout > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot_______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev |
From: Tabb, D. P. <dt...@su...> - 2018-02-15 23:46:13
|
Agenda items for our teleconference today: * Spring Meeting at Heidelberg: who is attending? * Topics listed for discussion at HUPO-PSI: Formalize suggested CV updates, Formulate additions to CV, Develop examples of CV terms, Review updated MIAPE-QC, Special Topic: Implementation in repositories, Special Topic: Current status of implementations Thanks! Dave On 2/13/2018 5:24 AM, Wout Bittremieux wrote: Dear colleagues, This is a reminder that our next teleconference is scheduled for Thursday, February 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 7h00 Seattle, 23h00 Beijing). You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 [https://www.gstatic.com/images/icons/material/product/2x/hangouts_64dp.png]<https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> Google Hangouts<https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> hangouts.google.com Click to join the conversation. As I couldn't attend last month's teleconference I don't know what was discussed then and what should be on the agenda this week. Please reply to this email to let us know if you have something specific to be discussed. Thanks, Wout ------------------------------------------------------------------------------ Check out the vibrant tech community on one of the world's most engaging tech sites, Slashdot.org! http://sdm.link/slashdot _______________________________________________ Psidev-qc-dev mailing list Psi...@li...<mailto:Psi...@li...> https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev The integrity and confidentiality of this email is governed by these terms / Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. http://www.sun.ac.za/emaildisclaimer |
From: Tabb, D. P. <dt...@su...> - 2018-02-15 23:19:44
|
Hi, team! We had a compact call that touched several interesting topics today. I'll let Wout tell the whole story in the minutes, but for now I want to make a couple of requests in connection with the April 18-20 meeting in Heidelberg (draft agenda here: <https://docs.google.com/spreadsheets/d/14uyeHF9d3_E4K9Fsh5mLHwL5JUS2q49plNspwelGQ4A/view> https://docs.google.com/spreadsheets/d/14uyeHF9d3_E4K9Fsh5mLHwL5JUS2q49plNspwelGQ4A/view<https://docs.google.com/spreadsheets/d/14uyeHF9d3_E4K9Fsh5mLHwL5JUS2q49plNspwelGQ4A/view#gid=1021763599> ): * First, if you haven't registered and plan to go, please go ahead to register so we have a good headcount. * Second, during the latter part of Thursday morning (Apr. 19), we will be looking at the CV with an eye to what is missing, using the perspective of the metric generator, the researcher trying to publish, or the database trying to ingest a qcML object. * Third, if you are a quality metric generator, you should be prepared to work up (or just present) some examples of how to express those metrics in XML during the afternoon session on April 19th. * Fourth, Jinmeng, you get your very own session of two hours in the late afternoon of April 19th. I would greatly appreciate it if you could write an example of a (long?) paragraph that we would like to see in a peer-reviewed manuscript that is an example of meeting the draft MIAPE-QC standard. If meeting the MIAPE-QC standard cannot be accomplished in a long paragraph, I suspect we will need to remove some requirements to make it more practical! (I am displaying my biases about MIAPE, and that is not really fair to you.) * Our session on repositories on the morning of April 20 is intended as a brainstorming session. Do we want PRIDE to have a webservice that will automatically generate a qcML document when given an accession? In brief, what does qcML have to do with repositories? Mathias, in connection with this, could you explain why embedding JSON in XML makes more sense than just writing JSON outright? Thank you for your continued interest in our qcML endeavor! Dave The integrity and confidentiality of this email is governed by these terms / Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. http://www.sun.ac.za/emaildisclaimer |
From: Tabb, D. P. <dt...@su...> - 2018-02-13 17:55:25
|
Hi, folks. The steering committee is looking over the roster of people registered for this year's HUPO-PSI in Heidelberg, and we have a surprising number of people who haven't signed up as attending! http://www.psidev.info/content/hupo-psi-meeting-2018 Please go ahead to register for the meeting; there's no cost, and the web page responds very quickly. Thanks, Dave The integrity and confidentiality of this email is governed by these terms / Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. http://www.sun.ac.za/emaildisclaimer |
From: Wout B. <wo...@uw...> - 2018-02-13 03:37:50
|
Dear colleagues, This is a reminder that our next teleconference is scheduled for Thursday, February 15th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 7h00 Seattle, 23h00 Beijing). You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 As I couldn't attend last month's teleconference I don't know what was discussed then and what should be on the agenda this week. Please reply to this email to let us know if you have something specific to be discussed. Thanks, Wout |
From: Walzer <wa...@eb...> - 2018-02-01 11:21:16
|
FYI -------- Forwarded Message -------- Subject: [Psi-announce] PSI 2018 Spring meeting Date: Wed, 31 Jan 2018 16:40:17 +0000 From: Jones, Andy <And...@li...> To: 'psi...@eb...' <psi...@eb...> CC: 'psi...@li...' <psi...@li...>, 'psi...@li...' <psi...@li...> Dear colleagues, We are pleased to announce that the PSI 2018 Spring meeting is now open for registration (http://www.psidev.info/content/hupo-psi-meeting-2018). PSI2018 will take place April 18-20th 2018 at EMBL, Heidelberg in Germany. The meeting is free of charge for participants (including lunches, refreshments and a workshop dinner). The main topics for discussion will working on the new spectral library data format, the Universal spectrum ID, PEFF (extended fasta format) and PROXI (web services API). There will be a track focussing on metabolomics (mzTab format), further work on quality control (qcML) and developments in the molecular interactions group including implementations of MI XML 3.0, CausalTab format, and identification of novel interactions in interactomes. The PSI always welcomes new participants with an interest in getting involved in proteomics data standards. Please feel free to register and come along to see what we are doing, or forward this message on to other colleagues who may be interested. http://www.psidev.info/content/hupo-psi-meeting-2018 Best wishes Andy Jones |
From: David T. <dt...@su...> - 2018-01-23 05:37:28
|
Hi, all. During our last call, I committed to testing the QuaMeter IDFree mode on a data set from metabolomics rather than proteomics. Last night I downloaded MTBLS548 from the Metabolights repository (https://www.ebi.ac.uk/metabolights/MTBLS548). It's a UPLC/Waters Xevo G2 QTOF data set, using only MS; it contains zero MS/MS scans. The total archive is around a GB in size. I found that the "whole study" download links at the top of "Study Files" didn't work. I looked at the corresponding FTP server, but I saw no directory for this number. Instead I selected all the files and then used the "Download selected files" button at the bottom. With my new 20 Mbps fibre connection at home, the whole download arrived in minutes! I decided to translate the raw data to mzML format, centroiding peaks with the "CantWaiT" peaklister in Proteowizard: "c:\Program Files\ProteoWizard\ProteoWizard 3.0.11738\msconvert.exe" --filter "peakPicking cwt snr=1.0 peakSpace=0.1" --filter "turbocharger minCharge=1 maxCharge=4" -z *.raw Happily, that process ran rather quickly! >From there, I could run QuaMeter against the mzMLs pretty easily. I used a quameter.cfg that contains these lines: ChromatogramMzLowerOffset = "0.05mz" ChromatogramMzUpperOffset = "0.05mz" Instrument = "orbi" The command line for running QuaMeter itself looked like this: "c:\Program Files\Bumbershoot\bumbershoot-bin-windows-vc120-release-3_0_11738\quameter.exe" -MetricsType idfree --cpus 1 OutputFilepath metrics.tsv *.mzML Happily, the code ran without any apparent errors, even though it was designed to work for files containing both MS and MS/MS scans. I am attaching the output file: metrics.tsv. The following columns were zeros or undefined values: XIC-WideFrac XIC-FWHM-Q1 XIC-FWHM-Q2 XIC-FWHM-Q3 XIC-Height-Q2 XIC-Height-Q3 XIC-Height-Q4 RT-MSMS-Q1 RT-MSMS-Q2 RT-MSMS-Q3 RT-MSMS-Q4 MS2-Count MS2-Freq-Max MS2-Density-Q1 MS2-Density-Q2 MS2-Density-Q3 MS2-PrecZ-1 MS2-PrecZ-2 MS2-PrecZ-3 MS2-PrecZ-4 MS2-PrecZ-5 MS2-PrecZ-more MS2-PrecZ-likely-1 MS2-PrecZ-likely-multi Obviously an MS2 or MSMS metric cannot be computed when none of these scans are collected. Similarly, the precursor charge metrics are undefined when no ions were selected as precursors! The "XIC" metrics concern the peak widths of ions selected as precursors, so we lost those as well. At present, QuaMeter makes no effort to characterize peak width or intensity for peaks that do not serve as precursor ions. So is any of this useful? Yes, I think so. I am attaching a graph out of Excel: RT-TIC.gif. The image composites four metrics: RT-TIC-Q1 through RT-TIC-Q4. We can sum the TIC across all the MS scans to comprise a total TIC. Then we ask what fraction of all RT duration was required to accumulate the first quarter of total TIC, the second quarter, etc. This image shows the variation in TIC accumulation times for each of the 10 experiments. In this image I have sorted by the startTimeStamp in the mzML files. The authors apparently ran a random ordering of their two cohorts (file names starting with "N" rather than "H"). The time stamps tell us that half the data were acquired in the afternoon of May 3rd while the other half were acquired in the early morning hours of May 4th. Do you see a shift in the RT-TIC plot between the fourth and fifth file? Would we see a cohort-associated shift? Reordering the files is just an Excel sort away! Thanks, Dave The integrity and confidentiality of this email is governed by these terms / Die integriteit en vertroulikheid van hierdie e-pos word deur die volgende bepalings gereël. http://www.sun.ac.za/emaildisclaimer |
From: Walzer <wa...@eb...> - 2018-01-18 11:44:25
|
Dear all, I'm very sorry but I will be on the train from Manchester at that time. best, mths On 16/01/18 16:25, Wout Bittremieux wrote: > Dear colleagues, > > This is a reminder that our next teleconference is scheduled for Thursday, January 18th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 7h00 Seattle, 23h00 Beijing). > > You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 <https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> > > I propose the following agenda items for the teleconference: > > EuBIC developer's meeting update > Mathias did a great job preparing our QC topic for the EuBIC developer's meeting and we managed to make some progress in developing a dashboard in Ghent last week. We can give an overview of what we did and what the current status is. See the slides in the attachment for an initial update. > Thanks again for your efforts Mathias! > > CV status > Martin has let us know that he's too busy to reliably attend our monthly meetings. Instead Julian will take over some of his efforts in updating the CV towards a 1.0 version. It would be useful to get an overview of where we're at right now and gain an insight into the necessary changes. > > Schedule of the annual PSI meeting > Based on Dave's mail from yesterday we can discuss the topics that will be addressed during the annual PSI meeting in Heidelberg in three months. > > Please reply to this email if you want to add additional items to the agenda. > > Thanks, > Wout > > PS: I have another commitment earlier on Thursday afternoon. I will try to attend our teleconference but I am not sure yet whether I will actually be able to do so, so you don't need to wait for me to start. > > > ------------------------------------------------------------------------------ > Check out the vibrant tech community on one of the world's most > engaging tech sites, Slashdot.org! http://sdm.link/slashdot > > > _______________________________________________ > Psidev-qc-dev mailing list > Psi...@li... > https://lists.sourceforge.net/lists/listinfo/psidev-qc-dev -- Mathias Walzer European Bioinformatics Institute (EMBL-EBI) Wellcome Trust Genome Campus, Hinxton, Cambridge, UK Office: +44 (0)1223 494 2610 E-mail: wa...@eb... |
From: Wout B. <wou...@ua...> - 2018-01-17 08:25:12
|
I realized I forgot to attach the slides we used to present the work we did at the EuBIC hackathon last week. Please see the attachment now. I also forgot to mention that the QCloud system from our friends in Barcelona has been published in PLoS ONE recently. Congrats! Check it out here: http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0189209 Best, Wout |
From: Wout B. <wou...@ua...> - 2018-01-16 16:35:12
|
Dear colleagues, This is a reminder that our next teleconference is scheduled for Thursday, January 18th, at 15h00 GMT (16h00 Western Europe, 17h00 Cape Town, 7h00 Seattle, 23h00 Beijing). You can connect to our teleconference on Google Hangouts through the following link: https://hangouts.google.com/group/taNzRy6ELMDK9wbk2 <https://hangouts.google.com/group/taNzRy6ELMDK9wbk2> I propose the following agenda items for the teleconference: EuBIC developer's meeting update Mathias did a great job preparing our QC topic for the EuBIC developer's meeting and we managed to make some progress in developing a dashboard in Ghent last week. We can give an overview of what we did and what the current status is. See the slides in the attachment for an initial update. Thanks again for your efforts Mathias! CV status Martin has let us know that he's too busy to reliably attend our monthly meetings. Instead Julian will take over some of his efforts in updating the CV towards a 1.0 version. It would be useful to get an overview of where we're at right now and gain an insight into the necessary changes. Schedule of the annual PSI meeting Based on Dave's mail from yesterday we can discuss the topics that will be addressed during the annual PSI meeting in Heidelberg in three months. Please reply to this email if you want to add additional items to the agenda. Thanks, Wout PS: I have another commitment earlier on Thursday afternoon. I will try to attend our teleconference but I am not sure yet whether I will actually be able to do so, so you don't need to wait for me to start. |