mfsizes

Multi-FASTA sequence (DNA or protein) statistics calculator.

Add a Review
1 Download (This Week)
Last Update:

Description

A simple command-line utility to calculate biological sequence (DNA or protein) sizes in a (multi) FASTA file. It gives averages, GC (or methionine) content, N50, N90, N95, number of N's, and total bases, and can also report by codon if requested.

mfsizes Web Site

Features

  • sequence sizes (DNA or protein)
  • GC content, in percentage (for each sequence and overall weighted average)
  • methionine content, in absolute number and percentage (protein only)
  • codon GC content (DNA only)
  • multi FASTA input files
  • reports average sequence size, total nucleotides, N50, N90, and N95
  • by default, report shows sequence names sorted in descending size order
  • report is tab-delimited text with results from one FASTA entry per line

KEEP ME UPDATED

Write a Review

User Reviews

Be the first to post a review of mfsizes!

Additional Project Details

Intended Audience

Science/Research

User Interface

Command-line

Programming Language

Perl

Registered

2011-05-20
Screenshots can attract more users to your project.
Features can attract more users to your project.