Every sequencing library contains duplicate reads. While many duplicates arise during polymerase chain reaction (PCR), some of these duplicates derive from multiple identical fragments of mRNA present in the original lysate (termed "biological duplicates"). Unique Molecular Identifiers (UMIs) are random oligonucleotide sequences that allow differentiation between technical and biological duplicates. Here we report the development of UMI-Reducer, a new computational tool for processing and differentiating PCR duplicates from biological duplicates. UMI-Reducer uses UMIs and the mapping position of the read to identify and collapse reads that are technical duplicates. Remaining true biological reads are further used for a bias-free estimate of mRNA abundance in the original lysate. This strategy is of particular use for libraries made from low amounts of starting material, which typically require additional cycles of PCR.
UMI-Reducer
collapsing technical duplicates via Unique Molecular Identifiers
Brought to you by:
smangul1
Downloads:
0 This Week