This page will describe how to use NGOpt tools with Reptile to correct errors in Illumina reads.
You will need the following packages/programs. They will need to be in your executable path:
In the directory containing my_reads.fastq, run:
<ngopt_trunk>/tools/quality/reptile_fq-converter ./ ./ 2
This will create three files,
my_reads.fa,my_reads.q, andmy_reads.map.txt. Because Reptile changes the original sequence names to arbitrary names, I have posted a hack on NGOpt that will produce a.map.txtfile. We will use this file later to rename our reads to the original names.Note: All files ending in .fastq in the current directory will be processed.
Copy the Reptile config-file template from the Reptile directory and change as needed.
cp <path_to_reptile>/src/config~ ./my_reads.rec.conf
vim my_reads.rec.conf
Edit my_reads.rec.conf to specify input and output files.
InFaFile" to contain my_reads.faIQFile" to contain my_reads.qOErrFile" to contain the desired name of the output file. reptile-outputRun reptile with the config file from previous step.
reptile my_reads.rec.conf
This will produce the file reptile-output, as mentioned in the previous step.
Now that we've identified our errors, we will merge them to get back corrected reads.
reptile_merger my_reads.fa reptile-output my_reads.rec.fasta
The third argument to reptile_merger is the output file. This file will contain our corrected reads.
To get back the original read names, we can use a script from NGOpt--reptile_rename.
<ngopt_trunk>/tools/quality/reptile_rename my_reads.map.txt my_reads.rec.fasta > my_reads.rec.renamed.fasta
We now have corrected reads with their original names in my_reads.rec.renamed.fasta.
If you have a paired-end library, you might want to re-pair reads with broken pairings. You can use the program repair in the NGOpt toolkit to fix broken pairings.
repair -p ./ -s .fasta my_reads.rec my_reads.rec.renamed.fasta
This will produce three files: my_reads.rec_p1.fasta, my_reads.rec_p2.fasta, and my_reads.rec_up.fasta. The third file contains reads with a missing pair.
Wiki: PreprocessingMultiplexedIlluminaData
View and moderate all "wiki Discussion" comments posted by this user
Mark all as spam, and block user from posting to "Wiki"
Originally posted by: eks.y...@gmail.com
Reptile Error correction has been updated to include a tool for choosing default parameters for each input dataset