Download Latest Version readFilter.tar.gz (14.7 kB)
Email in envelope

Get an email when there's a new version of readFilter

Home / ver1
Name Modified Size InfoDownloads / Week
Parent folder
README 2011-04-13 1.2 kB
readFilter.tar.gz 2011-04-13 14.7 kB
Totals: 2 Items   15.9 kB 0
README

readFilter.pl
http://sourceforge.net/projects/readFilter/

 Filter DNA reads in fasta format removing sequences not meeting minimum length and 
 maximum number of N's.  Also can remove sequences not containing QC adapter/barcode 
 sequence.  Coordinately filters associated quality files if present.

 NOTE: Does NOT parse sequences by barcode. The QC sequence feature is meant to remove
 low quality NGS sequences, as sequences with errors in a known primer, adpater, barcode
 sequence tend to be more prone to errors in the remainder of the sequence.

INSTALLATION

 No installation is necessary, but file must be executed on a computer with Perl
 installed (http://www.perl.org/).  Script has been tested on Mac & Linux only.

EXECUTION

 SYNTAX:  readFilter.pl -l length -n maxN  -b ReqSeq -i infile -o outfile

 ARGS:    -l minimum length cutoff (DEFAULT = 300)
          -n maximum number of N's allowed in a valid sequence (DEFAULT=1)
          -b barcode (nucleotide sequence which must occur at start to keep)
          -i input fasta file (quality file is inferred to have same name with '.qual'
          -o output fasta file       
Source: README, updated 2011-04-13