A simple command-line utility to calculate biological sequence (DNA or protein) sizes in a (multi) FASTA file. It gives averages, GC (or methionine) content, N50, N90, N95, number of N's, and total bases, and can also report by codon if requested.

Features

  • sequence sizes (DNA or protein)
  • GC content, in percentage (for each sequence and overall weighted average)
  • methionine content, in absolute number and percentage (protein only)
  • codon GC content (DNA only)
  • multi FASTA input files
  • reports average sequence size, total nucleotides, N50, N90, and N95
  • by default, report shows sequence names sorted in descending size order
  • report is tab-delimited text with results from one FASTA entry per line
  • gzip-compressed input supported

Project Activity

See All Activity >

Categories

Bio-Informatics

License

GNU General Public License version 3.0 (GPLv3)

Follow mfsizes

mfsizes Web Site

Other Useful Business Software
Get Avast Free Antivirus with 24/7 AI-powered online scam detection Icon
Get Avast Free Antivirus with 24/7 AI-powered online scam detection

Get protection for today’s online threats. Free.

Award-winning antivirus protection, as well as protection against online scams, dangerous Wi-Fi connections, hacked accounts, and ransomware. It includes Avast Assistant, your built-in AI partner, which gives you help with suspicious online messages, offers, and more.
Free Download
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of mfsizes!

Additional Project Details

Intended Audience

Science/Research

User Interface

Command-line

Programming Language

Perl

Related Categories

Perl Bio-Informatics Software

Registered

2011-05-20