When an EMBL file is split using the psu_splitter.pl
script, the ID line is not correctly written:-
Before splitting (origid is taken to make ID in new file)
FT /origid="Tb927_11_04_20040218"
FT /origid="Tb927_11_05_20040218"
After splitting
ID Tb927_11_0 standard; DNA; UNC; 78417 BP.
ID Tb927_11_0 standard; DNA; UNC; 6141 BP.
The ID line has been truncated, so 04 and 05 look the same
Since the filenames created are pulled from this ID
line, files will get overwritten.
As an aside, the filenames created are all in
lowercase, irespective of the case of the ID - any
chance this can be altered?
Logged In: YES
user_id=987685
This problem is with EMBOSS, and I have submitted a bug
report to deal with it.
I have written a temporary hack with the psu_splitter script
to redo the ID lines if needed.