| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| README | 2011-04-13 | 1.2 kB | |
| readFilter.tar.gz | 2011-04-13 | 14.7 kB | |
| Totals: 2 Items | 15.9 kB | 0 | |
README
readFilter.pl
http://sourceforge.net/projects/readFilter/
Filter DNA reads in fasta format removing sequences not meeting minimum length and
maximum number of N's. Also can remove sequences not containing QC adapter/barcode
sequence. Coordinately filters associated quality files if present.
NOTE: Does NOT parse sequences by barcode. The QC sequence feature is meant to remove
low quality NGS sequences, as sequences with errors in a known primer, adpater, barcode
sequence tend to be more prone to errors in the remainder of the sequence.
INSTALLATION
No installation is necessary, but file must be executed on a computer with Perl
installed (http://www.perl.org/). Script has been tested on Mac & Linux only.
EXECUTION
SYNTAX: readFilter.pl -l length -n maxN -b ReqSeq -i infile -o outfile
ARGS: -l minimum length cutoff (DEFAULT = 300)
-n maximum number of N's allowed in a valid sequence (DEFAULT=1)
-b barcode (nucleotide sequence which must occur at start to keep)
-i input fasta file (quality file is inferred to have same name with '.qual'
-o output fasta file