Different components of the theoretical protein folding problem are evaluated critically. It is argued that: (i) as a rule, small-and medium-sized proteins are in the free energy minimum; (ii) long-living metastable states may either appear occasionally with growing protein size, or be selected by evolution for a specific function; (iii) functions discriminating against incorrect folds would fail if they were used directly in the global optimization, unless they approximate the true free energy accurately; (iv) surface and electrostatic free energies should be treated separately; (v) conformational entropy (of side chains in particular) should be taken into account; (vi) Monte Carlo procedures considering all free energy terms and combining global knowledge-based random moves with local optimization have the largest potential for success.Protein folding; Global optimization; Free energy; Electrostatics; Solvation; Entropy
THE THEORETICAL PROTEIN FOLDING PROBLEMPrediction of the native three-dimensional structure of a protein from the amino acid sequence remains an unsolved problem despite numerous efforts to solve it for more than a quarter of a century. A physical approach to the problem in its pure form is based on the assumption that the native conformation corresponds to the structure with the lowest free energy and is thus in a state of thermodynamic equilibrium. This view is backed by in vitro observations that many proteins can spontaneously and successfully refold from a variety of denatured states [l-3]. The biological factors of in vivo folding, such as peptidyl-prolyl-isomerase, protein disulphide isomerase, molecular chaperonins and translocation control, clearly influence the kinetics of protein folding, assembly and transport [4,5] by reducing energy barriers and protecting intermediates from aggregation, however, these facts do not invalidate the hypothesis that the native conformation corresponds to the global free energy minimum.The alternative assumption (backed by Levinthal's argument [6]) is that the native state is the lowest kinetically accessible free energy minimum which is separated from the true global minimum by a large kinetic barrier (more than 25-30 kcal/mol). The first experimental evidence supporting this view has been provided recently [7-91. a-Lytic protease was shown to have two conformational forms, active and inactive [7]. The protein needs a catalyst, which is normally covalently attached to it, to bypass a kinetic barrier of more than about 27 kcal/mol separating the intermediate metastable conformation from its stable active form. It was also suggested that B-sheet rearrangements in serpins (serine protease inhibitors) imply the existence of a metastable kinetically trapped five-stranded p-sheet conformation which can be rearranged slowly to the thermodynamically stable six-stranded conformation [8,10].These interesting observations raise two questions: (i) could a metastable state have all the features of the normal protein? and if it could, (ii) is this behaviour typica...