We present a physically rigorous method to calculate solventdependent accessible surface areas (ASAs) of amino acid residues in unfolded proteins. ASA values will be larger in a good solvent, where solute-solvent interactions dominate and promote chain extension. Conversely, they will be smaller in a poor solvent, where solute-solute interactions dominate and promote chain collapse. In the method described here, these solvent-dependent effects are modeled by Boltzmann-weighting a simulated ensemble for solvent quality-good or poor. Solvent quality is parameterized as intramolecular hydrogen bond strength, using a ''hydrogen bond dial'' that can be varied from ''off'' to ''high'' (i.e., from 0 to ؊6 kcal/mol per hydrogen bond). When plotted as a function of hydrogen bond strength, the Boltzmann-weighted distribution of conformers describes a sigmoidal curve, with a transition midpoint near 1. ␥-turns ͉ hydrogen bonding ͉ protein folding ͉ unfolded state G lobular proteins fold to uniquely ordered, biologically relevant conformers under physiological conditions, but they unfold at high temperature, high pressure, extremes of pH, or in the presence of denaturing solvents. The transition between these states, N(ative) º U(nfolded), is spontaneous, cooperative, and reversible (1). With some exceptions (2), the N state is the biologically relevant form, and it is amenable to structural characterization (3). In contrast, the U state can only be described by a statistical model (4-6) and characterized in terms of averaged dimensions, for example, its mean radius of gyration (͗Rg͘) or its mean-squared end-to-end distance (͗L 2 ͘) (7). In addition to these classical measures, a reliable estimate of accessible surface area (ASA) (8) in the unfolded state has become a pressing need of late, motivated particularly by recent work that requires it (9, 10). Calculation of native-state ASA is straightforward in proteins of known structure (11-13), but corresponding values in the unfolded state are model-dependent. Creamer et al. (14,15) sought to estimate the ASA of residues in the unfolded state by bracketing it between two reliable extremes, an upper limit modeled on an extended polypeptide in good solvent and a lower limit extracted from chain segments in proteins of known structure. As a practical measure, Schellman (16) took the mean of these two extremes, and Auton and Bolen (9) followed suit. However, simply using the numerical average is unsatisfying because it lacks a rigorous physical basis. In an innovative approach, Goldenberg (17) simulated populations of protein-length chains by adapting a standard software package that had been developed originally to generate three-dimensional models from NMR-derived distance constraints. But this approach is not entirely satisfying either because a disproportionately large fraction of the residues fall within sterically restricted regions of conformational space (see table 1
in ref. 17).Here, we describe a physically based method to calculate both backbone and side-chain resid...