Menu

FLASH doesn't align overlap

2016-06-10
2016-06-11
  • Renato Oliveira

    Renato Oliveira - 2016-06-10

    Before enter in a discussion, I would like to congratulate all the people engaged in the development of the FLASH script.
    Of all the pipelines and scripts for merging overlapping paired-end reads, FLASH was the one which had the best results.

    BUT... it could be better if in the merging process, it considerate that two overlapping sequences migth have some mismatch due to some insertion or deletion of a base, for example:

    ACGTAGATCGATAGATAGTAGATGTAGATATGA
    -------------------TAGTAGTAGATGTAGATATGAAGAACACAACGATCGATGCTG

    The two reads above seems not to overlap, BUT because of an insertion, they could overlap:

    ACGTAGATCGATAGATAGTAGATGTAGATATGA
    --------------------TAG_TAGTAGATGTAGATATGAAGAACACAACGATCGATGCTG

    please, consider this suggestion.
    Thanks in advance!

     

    Last edit: Renato Oliveira 2016-06-10
  • synchronicity

    synchronicity - 2016-06-11

    Thanks for the feedback. This is known limitation of FLASH: it only considers substitution errors, not insertions or deletions. As such, it is primarily intended for data where substitution errors are much more common than insertions or deletions. Currently, there are no plans to address this.

     

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.