We present a method for designing a funnel-shaped free-energy surface that reproducibly assembles secondary structure elements of proteins into their native conformations from a random extended configuration. Assuming a priori knowledge of secondary structure, our method can design a funnel-shaped surface for folding of ␣, , and ␣ structures individually. We design energy surfaces that fold up to five unrelated sequences with the same energy parameters. We develop a measure of the foldability of an energy landscape in silico and present an alternative way to view energy landscapes.I n 1973, Anfinsen (1) showed that the native state of a protein corresponds to a minimum of free energy. Using his discovery as a starting point, scientists have tried to derive the free energy surface of proteins and to predict the folded protein structure.One of the dominant prediction methods used today is the decoy-then-scoring paradigm (45). First, a large number of decoys are generated to sample the conformation space. Usually considerable effort is exerted to explore and enrich the near-native fraction of the decoy set. After the sampling, a discrimination function picks out the best structure(s) from the ensemble. In this approach, the sampling of conformation space and the scoring of structures are separate. This paradigm is suitable for today's rapid increase of computational power: sampling is trivially parallelizable and, consequently, progress can result from simple increase in the number of operations spent on the task. Dovetailed into the decoy-generate-then-score paradigm are several different methods of extracting energy parameters for proteins. We mention a few examples: Z-score minimization (2-4) requires that the native state energy lie as low as possible relative to some reference state, usually represented by an ensemble of compact structures. Linear optimization (5, 6) maximizes a variety of conditions subject to the logical constraint that the energy of the native state (suitably parametrized as a linear sum) is simply lower than all alternate conformations. Potentials of mean force (7) assume that some relevant variables, e.g., residue-residue distances, are Boltzmann-distributed with the native state being the overwhelmingly more populated one. Finally, some approaches have trial-and-error components, where researchers notice and introduce features that make their predictions more native-like (8, 9).The price extracted by the generate-decoy-then-score separation is the necessity of elaborate and computationally intensive sampling techniques to uncover the low energy states. Several clever schemes (see ref. 9 and references therein) use sequence information by either finding a close homologue or using short segment homology in fragment insertion. So far, researchers have not been able to consistently fold proteins using the generate-then-score paradigm (10, 11).
Folding FunnelsScientists are not alone in having trouble folding proteins with current methods. A polypeptide can assume such a vast number of conf...