From: Alec W. <al...@br...> - 2010-08-24 13:59:26
|
Hi Jessica, Can you clarify a couple of things? When you say "the two files seem to match," what two files are you referring to? When the two files are slightly off, what does that mean? Thanks, Alec On 8/24/10 9:45 AM, Jessica Maia wrote: > Hi there, > > I think there's a minor bug in Picard's MarkDuplicates function. Reads were aligned with BWA and variants identified with Samtools. I ran the following steps 4 times in this order with only the merge step being different among runs: > > aln > sampe/samse > import > sort > merge > -Samtools merge version1 - merged all the individual bam files for each lane all at once > -Samtools merge version2 - merge the files for each Illumina run first, then then merged bam files for all the runs into 1 file. > Picard's MarkDuplicates with REMOVE_DUPLICATES=true > pileup -c > varFilter. > > After the merge step and before the PCR duplicate removal, I ran Picard's AlignmentSummaryMetrics and the two files seem to match. After the PCR duplicate removal step, I ran AlignmentSummaryMetrics and the two files are slightly off. In addition, the snp/indels obtained at the end are slightly off. > > The second time I ran these steps, I sorted the bam files after the merge step and before PCR duplicate removal. This produced the same outcome had I not sorted before removing duplicates. > > I'm happy to provide more details. The bam files are quite large, about 140G, corresponding to about 1.5 billion reads. > > > Jessica > > ------------------------------------------------------------------------------ > Sell apps to millions through the Intel(R) Atom(Tm) Developer Program > Be part of this innovative community and reach millions of netbook users > worldwide. Take advantage of special opportunities to increase revenue and > speed time-to-market. Join now, and jumpstart your future. > http://p.sf.net/sfu/intel-atom-d2d > _______________________________________________ > Samtools-help mailing list > Sam...@li... > https://lists.sourceforge.net/lists/listinfo/samtools-help > |