From: Nava W. <ne...@sg...> - 2009-08-06 13:16:03
|
hmm, might be neat to have a samtools validate tool which checks if a sam file conforms to the spec. On Thu, Aug 06, 2009 at 09:14:15AM -0400, Tim Fennell wrote: > I'd expect that what's happening is probably a result of violating one > or more constraints in the sam spec. For example, I'm not entirely > sure how things would be affected if there are four reads/records in > the file with the same name as would happen if you merge a file with > itself. > > -t > > On Aug 6, 2009, at 6:53 AM, Sendu Bala wrote: > > > When you do something silly like merge a bam with itself, why > > doesn't an > > rmdup/MarkDuplicates see the duplicate reads as duplicates? And why do > > samtools and picard behave so differently in this case? > > > > For example, starting with a 2000 read bam file that both samtools and > > picard agree has 6 duplicate reads, merged with itself it becomes 4000 > > reads. Then on the merged bam: > > > > picard-tools MarkDuplicates marks 28 reads as duplicates. > > > > samtools rmdup gets rid of 1073 reads. > > > > I'd have naively expected 2006 reads to be seen as duplicates. Or > > perhaps 12. Or even 6 again. But those results just seem random? > > > > > > -- > > The Wellcome Trust Sanger Institute is operated by Genome Research > > Limited, a charity registered in England with number 1021457 and a > > company registered in England with number 2742969, whose registered > > office is 215 Euston Road, London, NW1 2BE. > > > > ------------------------------------------------------------------------------ > > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 > > 30-Day > > trial. Simplify your report design, integration and deployment - and > > focus on > > what you do best, core application coding. Discover what's new with > > Crystal Reports now. http://p.sf.net/sfu/bobj-july > > _______________________________________________ > > Samtools-help mailing list > > Sam...@li... > > https://lists.sourceforge.net/lists/listinfo/samtools-help > > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day > trial. Simplify your report design, integration and deployment - and focus on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Samtools-help mailing list > Sam...@li... > https://lists.sourceforge.net/lists/listinfo/samtools-help -- Nav Work: 01865 854873 Mob : 07518-358405 |