From: James B. <jk...@sa...> - 2016-02-15 11:20:42
|
Consider the following example Seq 73 chr3 100 0 10M = 100 0 ... Seq 133 chr3 100 0 * = 100 0 ... The mate is unmapped, but in the traditional manner it is placed at the same location as the mapped read. However RNEXT and PNEXT is filled out for both entries. I saw this in real data, but it got me wondering whether it is permitted. It sounds incorrect and destroys some of the benefit of RNEXT/PNEXT if it includes unmapped reads too. However the specification isn't clear. It uses phrases like "PNEXT: Position of the primary alignment of the NEXT read in the template". I guess "primary alignment" doesn't exist as it's not aligned, merely located adjacent to another read, which then puts it into the category of "the information is unavailable". However that's trying to read between the lines and it's not explicitly stated. Is it categorically the case that RNEXT/PNEXT/TLEN should never have values when the mate is unmapped? If so I'll consider it a bug that this data was produced in such a manner. Thanks, James -- James Bonfield (jk...@sa...) | Hora aderat briligi. Nunc et Slythia Tova | Plurima gyrabant gymbolitare vabo; A Staden Package developer: | Et Borogovorum mimzebant undique formae, https://sf.net/projects/staden/ | Momiferique omnes exgrabure Rathi. -- The Wellcome Trust Sanger Institute is operated by Genome Research Limited, a charity registered in England with number 1021457 and a company registered in England with number 2742969, whose registered office is 215 Euston Road, London, NW1 2BE. |