Menu

MUMmerGPU

Michael Schatz

MUMmerGPU: High-throughput sequence alignment using Graphics Processing Units.
Michael Schatz and Cole Trapnell
Center for Bioinformatics and Computational Biology, University of Maryland

MUMmerGPU is an open-source high-throughput parallel pairwise local sequence alignment program that runs on commodity Graphics Processing Units (GPUs) in common workstations. MUMmerGPU uses the new Compute Unified Device Architecture (CUDA) from nVidia to align multiple query sequences against a single reference sequence stored as a suffix tree. By processing the queries in parallel on the highly parallel graphics card, MUMmerGPU achieves more than a 10-fold speedup over a serial CPU version of the sequence alignment kernel, and outperforms the exact alignment component of MUMmer on a high end CPU by 3.5-fold in total application time when aligning reads from recent sequencing projects using Solexa/Illumina, 454, and Sanger sequencing technologies. MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by new, high-throughput sequencing technologies.

BMC Bioinformatics 2007 8:474

MUMmerGPU 2.0 features a new stackless depth-first-search print kernel and is 13x faster than the serial CPU version of the alignment code and nearly 4x faster in total computation time than MUMmerGPU 1.0. We exhaustively examined 128 GPU data layout configurations to improve register footprint and running time and conclude higher occupancy has greater impact than reduced latency for achieving high performance for data intensive GPGPU applications.

Parallel Computing 2009 35(8-9)429-440

See the Tesla Bio Workbench page for up to date results.

Announcements

  • Nov 8, 2010 - MUMmerGPU workloads download
  • Feb 21, 2010 - Add link to Tesla Bio Workbench
  • Jan 19, 2010 - MUMmerGPU 2.0 official release
  • Nov 13, 2009 - Moved to a Wiki
  • Bowtie An ultrafast memory-efficient short read aligner
  • CloudBurst Highly Sensitive Short Read Mapping with MapReduce
  • Crossbow Whole Genome Resequencing Analysis in the Clouds