Menu

Overview

Interferon

Enly is a tool that allows the closure of a number of the gaps that are commonly present in a draft genome. It is best suited for 454 Roche sequencer reads and is based on the iterative mapping of reads at the extremities of contigs obtained after de novo assembling.

Enly takes a multiFASTA file embedding the all the contigs as input and tries to increase their length by reiterating the following procedure for each of the contigs. Initially, a number of bases (selectable by the user) are detached from one of the contig extremity and this sequence fragment is used as input for a BLAST search against a database embedding all the reads resulting from the sequencing run. Since a typical 454 sequencing run embeds reads with variable lengths, different BLAST searches are performed, using fragments of different length at every step within the same cycle. The BLAST output is then parsed to identify those reads that can be used to increase the overall length of the contig, that is those reads only partially aligned at the end of the contig and projecting outside from its extremity. The identified reads and the original contig are then assembled together resulting in a (possibly) enlarged contig. The same procedure is repeated for the other extremity of the contig.

This steps are repeated for a certain (user specifiable) cycles, or until no further elongation of the contigs is possible.


Related

Wiki: Home