|
From: Tim F. <tfe...@br...> - 2012-02-23 17:56:19
|
The numbers are calculated by using the basecalls from the read, and the cigar string to match up the read bases with the appropriate reference bases and then calculate the fraction which mismatch the reference (we include in this read bases which are no-calls). The PF_MISMATCH_RATE is simply the number of mismatching aligned bases divided by all aligned bases. The PF_HQ_ERROR_RATE is perhaps poorly named, but is the same calculation as above except that it only looks at reads with a mapping quality of 20 or higher PF_INDEL_RATE is the number of insertion or deletion events per 100 read bases. -t On Feb 23, 2012, at 12:46 PM, Richard Corbett wrote: > Hi all, > > Does anyone know how these numbers are calculated? ie, are they based on > flags, the cigar, or sequence string? > > PF_MISMATCH_RATE 0.47867 0.476719 0.477708 > PF_HQ_ERROR_RATE 0.48216 0.481005 0.481589 > PF_INDEL_RATE 0.00009 0.000095 0.000093 > > Also, a question about interpretation... is this telling me that .477 > percent of my aligned bases are mismatch errors? > > thanks, > RIchard > > > ------------------------------------------------------------------------------ > Virtualization & Cloud Management Using Capacity Planning > Cloud computing makes use of virtualization - but cloud computing > also focuses on allowing computing to be delivered as a service. > http://www.accelacomm.com/jaw/sfnl/114/51521223/ > _______________________________________________ > Samtools-help mailing list > Sam...@li... > https://lists.sourceforge.net/lists/listinfo/samtools-help |