SeqPig is a library for Apache Pig for the distributed analysis of large sequencing datasets. It provides import and export functions for file formats commonly used for sequencing data, as well as a collection of Pig user-defined-functions (UDF’s) to help process aligned and unaligned sequence data. Currently SeqPig supports BAM/SAM, FastQ and Qseq input and output.
For more information see the manual at http://seqpig.sourceforge.net/
Be the first to post a review of SeqPig!