We have conducted a prospective analysis of the Protein Data Bank in order to study certain constituents of proteins, i.e. those elements that are neither halogens or phosphorous nor part of the biological amino acid set. A sample of 5749 structures was analyzed and classified according to the 56 elements encountered. Almost half of the structures are represented by a set of twelve elements, each one involved in more than 100 structures. We analysed this subsample with more detail by computing the distance of amino acid residues in a coordination sphere of 5 Å, and studying the corresponding distribution curves of frequency of appearance, as well as using methods of cluster analysis to group the elements. We have therefore constructed a periodic table of the elements from a protein point of view. The analyses undertaken are able to distinguish between real components of proteins and elements inserted by artefacts of crystallization process or experimental techniques.Response to Reviewers: Dear Editor, Please find enclosed a reviewed version of the original paper submitted to JMM. We have answered all the questions raised by the reviewers and very much improved the original version. We have also altered the original title as one of the reviewers remarked that it did not totally match what the study provided. The final version also presents a revised study of the PDB and the conlusions were modified accordingly. In sum, this version is really much better than the original one.
AbstractWe have conducted a prospective analysis of the Protein Data Bank in order to study certain constituents of proteins, i.e. those elements that are neither halogens or phosphorous nor part of the biological amino acid set. A sample of 5749 structures was analyzed and classified according to the 56 elements encountered. In almost half of the structures 15 metals (i.e. Na, Mg, K, Ca, Mn, Fe, Co, Ni, Cu, Zn, As, Mo, Cd, W, Hg) are involved, each one figuring in more than 100 structures. We analysed this subsample in more detail by computing the distance of amino acid residues in a coordination sphere of 5 Å, and studying the corresponding occurrences, as well as using methods of cluster analysis to group the elements.The analyses undertaken are able to distinguish between real components of proteins and elements inserted by artefacts of crystallization process or experimental techniques.