Menu

#189 Bug in parsing non-standard NCBI FASTA header

open
nobody
5
2009-09-21
2009-09-21
Jon Ison
No

From emboss-bug :

> I found a bug in the header parsing of needle. See the inputs and outputs included below. Note how in the output there is no sequence accession for input 1.
> *************Input 1********************
>> OMNI|NTL01AP01841||5105472| 247aa long hypothetical protein taxon:272557
> ************Input2***********************
>> pva1.model.393_00196
>
> **************Output*****************
> #=======================================
> #
> # Aligned_sequences: 2
> # 1:
> # 2: pva1.model.393_00196

This relates to how non-standard NCBI FASTA headers are parsed,
especially where they have missing fields.

We will check and make sure the next release has something to report in
the header.

Discussion

MongoDB Logo MongoDB