[Jmol-users] Fwd: Re: Chain order changes: a problem for Proteopedia

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 454-5900

<html>
<body>
Dear Bob,<br><br>
How difficult would it be to change the state script generator in Jmol to
avoid using atom serial numbers?<br><br>
I'm sure none of us ever envisioned the current state of affairs. We have
who knows how many state scripts saved in Proteopedia, and now an unknown
number of March-17-remediated PDB files have scrambled atom serial
numbers. Further, there is no guarantee that such scrambling will not
occur again in a future remediation.<br><br>
A change in Jmol to avoid serial number dependency in saved state scripts
will not avoid the need for repairs to Proteopedia now, but may avoid
such calamities in the future.<br><br>
Regards, -Eric<br><br>
<blockquote type=cite class=cite cite="">Date: Mon, 30 Mar 2009 15:09:56
-0400<br>
From: John Westbrook &lt;jw...@rc...&gt;<br>
Reply-To: jw...@rc...<br>
Organization: RCSB - Protein Data Bank<br>
To: em...@mi...<br>
CC: Helen Berman &lt;be...@rc...&gt;, Kim Henrick
&lt;he...@eb...&gt;<br>
Subject: Re: Chain order changes: a problem for Proteopedia <br><br>
Dear Eric,<br><br>
In producing the V3.2 wwPDB release, we have tried to preserve as
much<br>
as possible the PDB chain and residue nomenclature for polymer molecular
components.<br>
With version 3.2 comes the introduction of more uniform assignment<br>
PDB chain identifiers for ligands and solvent, so this has resulted
in<br>
some nomenclature changes in V3.2 files.<br><br>
<font color="#FF0000">Neither the V2.6 nor V3.2 format specifications
suggests that the atom serial<br>
number be used as the primary atom identifier.</font>&nbsp;&nbsp; I am
sure that you are aware that<br>
there are now many PDB entries that have been split across multiple PDB
data<br>
files specifically because of the limitation in the range atom serial
numbers.<br>
Atom serial numbers are also replicated between models so they do not
represent<br>
unique atom identifiers for NMR or other multi-model entries.<br><br>
If you have a specific dependency prior atom serial numbers in your
software<br>
system, then you can always recover the particular version of the PDB
entry<br>
that you used from our ftp snapshot server
(<a href="ftp://snapshots.rcsb.org/" eudora="autourl">
ftp://snapshots.rcsb.org</a>).<br><br>
I should point out that we provided Jaim with an advanced copy of
the<br>
V3.2 files for testing on Dec 3, 2008.&nbsp;&nbsp; The issue of atom
ordering<br>
was not raised as an issue at that time.<br><br>
Regards,<br><br>
John<br><br>
<br><br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; Begin forwarded message:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; From: Eric Martz &lt;em...@mi...&gt;<br>
&gt;&gt;&gt;&gt; Date: March 29, 2009 7:27:12 PM PDT<br>
&gt;&gt;&gt;&gt; To: &quot;in...@rc...&quot; &lt;in...@rc...&gt;,
&quot;pd...@rc...&quot; &lt;pd...@rc...&gt;<br>
&gt;&gt;&gt;&gt; Subject: pdb-l: Chain order changes: a problem for
Proteopedia<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Dear wwPDB:<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; The March 17, 2009 remediation of PDB data in the wwPDB
(PDB format<br>
&gt;&gt;&gt;&gt; 3.20) appears to me to have, in many cases, changed the
order of<br>
&gt;&gt;&gt;&gt; chains, and hence the atom serial numbers in the PDB
files. This has<br>
&gt;&gt;&gt;&gt; created a major problem in the wiki Proteopedia.Org,
where many<br>
&gt;&gt;&gt;&gt; molecular scenes that took hours or weeks to develop are
now<br>
&gt;&gt;&gt;&gt; nonfunctional.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; The problem arises becaused Jmol uses atom serial
numbers for<br>
&gt;&gt;&gt;&gt; selecting groups of atoms when it saves a molecular
scene (in a<br>
&gt;&gt;&gt;&gt; &quot;state script&quot;). Proteopedia's Scene Authoring
Tool uses Jmol's state<br>
&gt;&gt;&gt;&gt; scripts to capture molecular scenes and attach them to
&quot;green links&quot;.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Questions:<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; 1. Were the names of ATOM chains ever changed? I assume
(and hope)<br>
&gt;&gt;&gt;&gt; not, but I have not checked carefully. I see that the
chain names<br>
&gt;&gt;&gt;&gt; assigned to HETATMs were changed in some cases, e.g.
1e3m, where an<br>
&gt;&gt;&gt;&gt; ADP single-residue &quot;chain&quot; originally named
chain C (before the 2007<br>
&gt;&gt;&gt;&gt; remediation) is now deemed to be part of chain A (and
its position<br>
&gt;&gt;&gt;&gt; was moved to the end of the file, after all ATOM
records). Since I<br>
&gt;&gt;&gt;&gt; have been unable to get pre-March-17 snapshot PDB files
(the<br>
&gt;&gt;&gt;&gt; snapshot.wwpdb.org server is unresponsive) I am not sure
when each of<br>
&gt;&gt;&gt;&gt; these changes were made.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; 2. Was the changing of chain orders in the March 17
remediation<br>
&gt;&gt;&gt;&gt; intentional? If so, is the new order specified somewhere
in the 3.20<br>
&gt;&gt;&gt;&gt; documentation? I can see no pattern to the new chain
orders (see<br>
&gt;&gt;&gt;&gt; examples below).<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; 3. Were chain orders ever changed in files that contain
only protein<br>
&gt;&gt;&gt;&gt; chains (no nucleic acids)?<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; 4. Will the changes in chain order be retained
permanently (requiring<br>
&gt;&gt;&gt;&gt; substantial repairs to Proteopedia.Org)?<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Observations:<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; We first noticed the broken molecular scenes in
Proteopedia in cases<br>
&gt;&gt;&gt;&gt; that involved DNA. Therefore I have so far limited my
inspection of<br>
&gt;&gt;&gt;&gt; PDB files to those containing both protein and DNA.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Since the snapshot ftp server is unresponsive today, my
comparisons<br>
&gt;&gt;&gt;&gt; were all made between files I had saved before the 2007
remediation<br>
&gt;&gt;&gt;&gt; (typically saved 2001-2004), and current files. We have
reason to<br>
&gt;&gt;&gt;&gt; suspect that changes in chain ordering occurred in the
March 17, 2009<br>
&gt;&gt;&gt;&gt; remediation, but I cannot verify this for the cases
below.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Some files have NO CHANGE in chain order:<br>
&gt;&gt;&gt;&gt; 1d66: DE (DNA), AB (protein).<br>
&gt;&gt;&gt;&gt; 1osl: (an NMR multiple model file) AB (protein), CD
(DNA).<br>
&gt;&gt;&gt;&gt; 1e3m: old AB (protein), C (single residue ADP HETATM
&quot;chain&quot;), EF<br>
&gt;&gt;&gt;&gt; (DNA); new AB, EF. (ADP now in chain A at the end, thus
changing ATOM<br>
&gt;&gt;&gt;&gt; serial numbers.)<br>
&gt;&gt;&gt;&gt;&nbsp;&nbsp; Thus there appears to be no requirement for
nucleic acid or<br>
&gt;&gt;&gt;&gt; protein chains to come first.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Some files that had protein first were rearranged to put
DNA first:<br>
&gt;&gt;&gt;&gt; 1aoi: old ABCDEFGH (protein), IJ (DNA); new IJ,
ABCDEDFH.<br>
&gt;&gt;&gt;&gt; 1fzp: old DB (protein), WK (DNA); new WK, DB.<br>
&gt;&gt;&gt;&gt; 1hcr: old A (protein), BC (DNA); new BC, A.<br>
&gt;&gt;&gt;&gt;&nbsp;&nbsp; Thus there appears to be no requirement that
chains be in<br>
&gt;&gt;&gt;&gt; alphabetic order.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; One file had an RNA chain moved to BETWEEN two DNA
chains, leaving<br>
&gt;&gt;&gt;&gt; protein before DNA:<br>
&gt;&gt;&gt;&gt; 1qln: old A (protein), TN (DNA), R (RNA); new A
(protein), N<br>
&gt;&gt;&gt;&gt; (DNA), R (RNA), T (DNA).<br>
&gt;&gt;&gt;&gt;&nbsp;&nbsp;&nbsp; The new order happens to be
alphabetical by chain name, but<br>
&gt;&gt;&gt;&gt; this is not true in other files (see above).<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; I did not happen to come across a case where DNA chains
preceded<br>
&gt;&gt;&gt;&gt; protein in the old format, with protein being moved
before DNA in the<br>
&gt;&gt;&gt;&gt; new format.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; There also appears to be no requirement that chains be
in the order<br>
&gt;&gt;&gt;&gt; given in the COMPND records. Examples where the order
differs in the<br>
&gt;&gt;&gt;&gt; new files: 1flo, 1qln.<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Sincerely, -Eric<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; /* - - - - - - - - - - - - - - - - - - - - - - - - - -
-<br>
&gt;&gt;&gt;&gt; Eric Martz, Professor Emeritus, Dept Microbiology<br>
&gt;&gt;&gt;&gt; U Mass, Amherst --
<a href="http://martz.molviz.org/" eudora="autourl">
http://Martz.MolviZ.Org</a><br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; Top Five 3D MolVis Technologies
<a href="http://top5.molviz.org/" eudora="autourl">
http://Top5.MolviZ.Org</a><br>
&gt;&gt;&gt;&gt; 3D Wiki with Scene-Authoring Tools
<a href="http://proteopedia.org/" eudora="autourl">
http://Proteopedia.Org</a><br>
&gt;&gt;&gt;&gt; Biochem 3D Education Resources
<a href="http://molviz.org/" eudora="autourl">http://MolviZ.org</a><br>
&gt;&gt;&gt;&gt; See 3D Molecules, Install Nothing! -
<a href="http://firstglance.jmol.org/" eudora="autourl">
http://firstglance.jmol.org</a><br>
&gt;&gt;&gt;&gt; ConSurf - Find Conserved Patches in Proteins:
<a href="http://consurf.tau.ac.il/" eudora="autourl">
http://consurf.tau.ac.il</a><br>
&gt;&gt;&gt;&gt; Atlas of Macromolecules:
<a href="http://atlas.molviz.org/" eudora="autourl">
http://atlas.molviz.org</a><br>
&gt;&gt;&gt;&gt; Workshops:
<a href="http://workshops.molviz.org/" eudora="autourl">
http://workshops.molviz.org</a><br>
&gt;&gt;&gt;&gt; World Index of Molecular Visualization Resources:<br>
&gt;&gt;&gt;&gt;
<a href="http://molvisindex.org/" eudora="autourl">
http://molvisindex.org</a><br>
&gt;&gt;&gt;&gt; PDB Lite Macromolecule Finder:
<a href="http://pdblite.org/" eudora="autourl">http://pdblite.org</a><br>
&gt;&gt;&gt;&gt; Molecular Visualization EMail List (molvis-list):<br>
&gt;&gt;&gt;&gt;
<a href="http://list.molviz.org/" eudora="autourl">
http://list.molviz.org</a><br>
&gt;&gt;&gt;&gt; Protein Explorer - 3D Visualization:
<a href="http://proteinexplorer.org/" eudora="autourl">
http://proteinexplorer.org</a><br>
&gt;&gt;&gt;&gt; - - - - - - - - - - - - - - - - - - - - - - - - - - -
*/<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt;<br>
&gt;&gt;&gt;&gt; TO UNSUBSCRIBE OR CHANGE YOUR SUBSCRIPTION OPTIONS,
please see<br>
&gt;&gt;&gt;&gt;
<a href="https://lists.sdsc.edu/mailman/listinfo.cgi/pdb-l" eudora="autourl">
https://lists.sdsc.edu/mailman/listinfo.cgi/pdb-l</a> .<br>
&gt;&gt;&gt;<br><br>
<br>
-- <br>
******************************************************************<br>
&nbsp; John Westbrook, Ph.D.<br>
&nbsp; Rutgers, The State University of New Jersey<br>
&nbsp; Department of Chemistry and Chemical Biology<br>
&nbsp; 610 Taylor Road<br>
&nbsp; Piscataway, NJ 08854-8087<br>
&nbsp; e-mail: jw...@rc...<br>
&nbsp; Ph:&nbsp; (732) 445-4290&nbsp; Fax: (732) 445-4320<br>
******************************************************************</blockquote>
</body>
</html>

[Jmol-users] Fwd: Re: Chain order changes: a problem for Proteopedia

An interactive viewer for three-dimensional chemical structures.

[Jmol-users] Fwd: Re: Chain order changes: a problem for Proteopedia