HSRA download | SourceForge.net

HSRA is a MapReduce-based parallel tool for mapping reads from RNA sequencing (RNA-seq) experiments. RNA-seq analyses typically begin by mapping reads to a reference genome in order to determine the location from which the reads were originated, which is a very time-consuming step. This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing.

HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.

Project Activity

See All Activity >

License

GNU General Public License version 3.0 (GPLv3)

Follow HSRA

HSRA Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of HSRA!

Additional Project Details

Operating Systems

Linux

Intended Audience

Information Technology, Healthcare Industry, Science/Research

User Interface

Console/Terminal, Command-line

Programming Language

Java

Related Categories

Java Bio-Informatics Software, Java Big Data Tool

Registered

2018-02-06

Similar Business Software

Cufflinks

Cufflinks assemble transcripts, estimate their abundances and test for differential expression and regulation in RNA-Seq samples. It accepts aligned RNA-Seq reads and assembles the alignments into a parsimonious set of transcripts. Cufflinks then estimates the relative abundances of these...

See Software
QIAGEN CLC Genomics Workbench

QIAGEN CLC Genomics Workbench is a powerful solution that works for everyone, no matter the workflow. Cutting-edge technology and unique features and algorithms widely used by scientific leaders in industry and academia make it easy to overcome challenges associated with data analysis....

See Software
Geneious

Geneious Prime makes bioinformatics accessible by transforming raw data into visualizations that make sequence analysis intuitive and user-friendly. Simple sequence assembly and easy editing of contigs. Automatic annotation for gene prediction, motifs, translation, and variant calling. Genotype...

See Software