ngsdataqeval is a Python tool to evaluate the quality of high-throughput sequencing data, used by Next Generation Sequencing. Unlike other tools that analyze raw data, this is designed to evaluate the quality of the processed reads after mapping to a reference genome.
The evaluation is performed in a genomic region defined by the user, and it provides some statistics computed from the reads that map to that region (ie. a single gene). The program provides a graphical output embedded in an html file. The analysis contains the sequencing quality along the reads, the mapping quality distribution, the coverage of the defined region, the overall quality at each nucleotide position, and the distribution of the coverage as a function of the GC content in the reference genome. The results provided by this program can help to distinguish between sequence variation and sequencing errors.

Project Activity

See All Activity >

Categories

Data Quality

Follow NGS data quality evaluation

NGS data quality evaluation Web Site

Other Useful Business Software
Build Securely on AWS with Proven Frameworks Icon
Build Securely on AWS with Proven Frameworks

Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
Download Now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of NGS data quality evaluation!

Additional Project Details

Registered

2012-03-09