Menu

#333 tide-index should treat asterisks as stop codons

post v2.0
open
Kaipo
None
2015-09-25
2015-09-25
No

Currently, Crux skips over non-alphabetic characters in protein sequences. However, often, * is used to represent a stop codon. So this pseudogene sequence

PF3D7_0302300 | organism=Plasmodium_falciparum_3D7 | product=erythrocyte membr\
ane protein 1 (PfEMP1), pseudogene | location=Pf3D7_03_v3:125992-130233(-) | le\
ngth=1414 | sequence_SO=chromosome | SO=protein_coding
MYTSRCKDHKFIQFVFRLFKINSTNNIHIYIYIYIYIYIHKMVTERIHNKYCNTASMKNG
DQYNRKNLMVPMKERLLARIYHLTFRLMQRKEDMKKGDIPMFRATYVVRTD
TNDKHEYASEHDKYQGPCTGKDTKFVIGTPWKKEENEVNQIHKDVLLPPRRRHMCTSNLE
NLNVYSIELTGVNASHSFLGDLLLAAKYERKHIKNNLRKDILGICTDIKYRFADLGDIIR
GKDM
YQNRD

should be treated as multiple subproteins

MYTSRCKDHKFIQFVFRLFKINSTNNIHIYIYIYIYIYIHKMVTERIHNKYCNTASMKNG
DQYNRKN

L

MV

PMK

ERLLARI

YHLTF

RLMQRKEDMKKGDIPMF

RATYVV

RTDTNDKHEYASEHDKYQGPCTGKDTKFVIGTPWKKEENEVNQIHKDVLLPPRRRHMCTSNLE
NLNVYSIELTGVNASHSFLGDLLLAAKYERKHIKNNLRKDILGICTDIKYRFADLGDIIR
GKDM

YQNRD

Discussion


Log in to post a comment.