We describe new methods for predicting protein tertiary structures to low resolution given the speci®cation of secondary structure and a limited set of long-range NMR distance constraints. The NMR data sets are derived from a realistic protocol involving completely deuterated 15 N and 13 C-labeled samples. A global optimization method, based upon a modi®cation of the aBB (branch and bound) algorithm of Floudas and co-workers, is employed to minimize an objective function combining the NMR distance restraints with a residue-based protein folding potential containing hydrophobicity, excluded volume, and van der Waals interactions. To assess the ef®cacy of the new methodology, results are compared with benchmark calculations performed via the X-PLOR program of Bru È nger and co-workers using standard distance geometry/molecular dynamics (DGMD) calculations. Seven mixed a/b proteins are examined, up to a size of 183 residues, which our methods are able to treat with a relatively modest computational effort, considering the size of the conformational space. In all cases, our new approach provides substantial improvement in root-mean-square deviation from the native structure over the DGMD results; in many cases, the DGMD results are qualitatively in error, whereas the new method uniformly produces high quality low-resolution structures. The DGMD structures, for example, are systematically non-compact, which probably results from the lack of a hydrophobic term in the X-PLOR energy function. These results are highly encouraging as to the possibility of developing computational/ NMR protocols for accelerating structure determination in larger proteins, where data sets are often underconstrained.
# 1999 Academic PressKeywords: protein folding; global optimization; annealing; distance restraints
*Corresponding author
IntroductionEnergy minimization in the absence of longrange constraints has not yet contributed to the determination of new protein structures. Two requirements for a solution to this problem are (1) a potential energy function which is accurate enough to distinguish the native from all other conformations; and (2) a means of globally minimizing such a function. In the absence of an accurate energy function, some effort has been spent on augmenting existing functions with experimentally derived constraints. Provided the quality and quantity of the constraints are suf®cient to guarantee a global minimum near the native conformation, the augmented energy functions (or target functions) may be useful for testing and developing minimization algorithms. The resulting constrained energy minimization (CEM) techniques avoid one serious problem associated with unconstrained minimization, which is that the global minimum is not, in general, known.In addition to providing an objective target for optimization, CEM is immediately applicable to the problem of NMR structure re®nement. Typi-E-mail address of the corresponding author: rich@chem.columbia.edu Abbreviations used: DGMD, distance geometry/ molecular dynamics; CEM,...