PASHA is a parallel short read assembler for large genomes using de Bruijn graphs. Taking advantage of both shared-memory multi-core CPUs and distributed-memory compute clusters, PASHA has demonstrated its potential to perform high-quality de-novo assembly of large genomes in reasonable time with modest computing resources. Our evaluation using three small real paired-end datasets shows that PASHA is able to produce better assemblies with comparable genome coverage and mis-assembly rates compared to three leading assemblers: Velvet, ABySS and SOAPdenovo. Moreover, PASHA achieves the fastest speed for all three datasets on a single CPU. For the human genome, PASHA achieves competitive assembly quality with ABySS and is able to complete the assembly in about 21 hours, which is about 2.38× faster than ABySS on the same hardware configurations.

Project Activity

See All Activity >

Categories

Bio-Informatics

License

GNU General Public License version 2.0 (GPLv2), Apache License V2.0

Follow PASHA: Parallelized Short Read Assembly

PASHA: Parallelized Short Read Assembly Web Site

You Might Also Like
AlertBot: Website Monitoring of Uptime, Performance, and Errors Icon
AlertBot: Website Monitoring of Uptime, Performance, and Errors

For IT Professionals and network adminstrators looking for a web application monitoring solution

AlertBot monitors your website's full functionality around the clock so you can focus your time on more important things.
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of PASHA: Parallelized Short Read Assembly!

Additional Project Details

Operating Systems

Linux, BSD

User Interface

Console/Terminal

Programming Language

C++

Related Categories

C++ Bio-Informatics Software

Registered

2012-07-05