From: Tim F. <tfe...@br...> - 2011-02-28 14:09:50
|
The way Picard handles this is actually by looking at the LB (library) tag on the read groups. It will treat reads from the same library as one group for duplication detection, but will not call things duplicates when the reads originate from different libraries. So if the LB tag on your read groups is the same they'll be merged for duplicate detection, and if it's different they won't be. -t On Feb 28, 2011, at 8:32 AM, Davide Cittaro wrote: > Hi all, how does samtools (and picard) deal with possible duplicates that belong to different RG within the same BAM file? Are they kept (because they come from different groups) or deleted (because they are likely to be duplicates)? I'm asking this because I have a sample that has been run on two lanes on the same flowcell, therefore I can have duplicates but I want to keep lane information in RG field. > > Thanks > > d > /* > Davide Cittaro, PhD > > Cogentech - Consortium for Genomic Technologies > via adamello, 16 > 20139 Milano > Italy > > tel.: +39(02)574303007 > e-mail: dav...@if... > */ > > > > > ------------------------------------------------------------------------------ > Free Software Download: Index, Search & Analyze Logs and other IT data in > Real-Time with Splunk. Collect, index and harness all the fast moving IT data > generated by your applications, servers and devices whether physical, virtual > or in the cloud. Deliver compliance at lower cost and gain new business > insights. http://p.sf.net/sfu/splunk-dev2dev _______________________________________________ > Samtools-help mailing list > Sam...@li... > https://lists.sourceforge.net/lists/listinfo/samtools-help |