Proinsulin exhibits a single structure, whereas insulin-like growth factors refold as two disulfide isomers in equilibrium. Native insulin-related growth factor (IGF)-I has canonical cystines (A6 -A11, A7-B7, and A20 -B19) maintained by IGFbinding proteins; IGF-swap has alternative pairing (A7-A11, A6 -B7, and A20 -B19) and impaired activity. Studies of minidomain models suggest that residue B5 (His in insulin and Thr in IGFs) governs the ambiguity or uniqueness of disulfide pairing. Residue B5, a site of mutation in proinsulin causing neonatal diabetes, is thus of broad biophysical interest. Here, we characterize reciprocal B5 substitutions in the two proteins. In insulin, His The vertebrate insulin-related superfamily consists of insulin and insulin-related growth factors (IGF-I 5 and IGF-II) (1, 2), relaxin (3-5), and relaxin-related factors (6 -9). Insulin and IGFs function as ligands for receptor tyrosine kinases (the insulin receptor and type 1 IGF receptor; IR and IGF-1R) (10), whereas relaxin and related factors bind to G-protein-coupled receptors (11). Interest in the evolution and folding properties of insulin-related polypeptides has recently been invigorated by the discovery of mutations in the human insulin gene associated with permanent neonatal-onset diabetes mellitus (12). These dominant mutations impair the foldability of variant and (in trans) wild-type proinsulin, leading to -cell dysfunction, endoplasmic reticular (ER) stress, and impaired -cell viability (13). One such mutation occurs at position B5 (12, 14). Insulin contains a conserved His at B5, whereas IGF-I contains a conserved Thr (Table 1). Here, we investigate reciprocal substitutions in these proteins, Thr B5 in insulin and His B5 in IGF-I, as probes of competing evolutionary constraints among otherwise homologous sequences.
6IGFs are single-chain polypeptides containing A-and B-domains, an intervening connecting (C)-domain, and a C-terminal D-domain (Fig. 1A) (1, 2); insulin (like relaxin and related factors) contains two chains (designated A and B; Fig. 1B) as a consequence of proteolytic processing in the trans-Golgi network (3-5, 15). Crystal structures of IGF-I and insulin exhibit similar ␣-helical domains (Fig. 1, A and B) (1-9). Protein folding is linked to specific disulfide pairing. The canonical cystines in insulin are A6 -A11, A7-B7, and A20 -B19, and the corresponding cystines in IGF-I are at polypeptide positions 47-52,