We have used intramolecular cross-linking, MS, and sequence threading to rapidly identify the fold of a model protein, bovine basic fibroblast growth factor (FGF)-2. Its tertiary structure was probed with a lysine-specific cross-linking agent, bis(sulfosuccinimidyl) suberate (BS 3 ). Sites of cross-linking were determined by tryptic peptide mapping by using time-of-flight MS. Eighteen unique intramolecular lysine (Lys-Lys) cross-links were identified. The assignments for eight cross-linked peptides were confirmed by using post source decay MS. The interatomic distance constraints were all consistent with the tertiary structure of FGF-2. These relatively few constraints, in conjunction with threading, correctly identified FGF-2 as a member of the -trefoil fold family. To further demonstrate utility, we used the top-scoring homolog, IL-1, to build an FGF-2 homology model with a backbone error of 4.8 Å (rms deviation). This method is fast, is general, uses small amounts of material, and is amenable to automation.
In recent years, the number of novel proteins identified by genomic (1, 2) and proteomic projects has dramatically increased, with a concomitant need for more rapid determination of their tertiary structures.Visualization of the three-dimensional structures of proteins has traditionally been realized by x-ray crystallography and NMR. These techniques produce high resolution atomic data but require relatively large amounts (10 to 100 mg) of pure analyte in a particular solution or crystalline state. Even if these conditions are met, it can take months or even years to generate a molecular structure by following these methodologies.To develop an alternative approach to structure determination that could keep pace with the rate of novel protein identification, we have re-examined cross-linking technology in the light of newer analytical protocols for the separation and identification of complex peptide mixtures. Previous investigators have shown that cross-linking experiments can provide low resolution interatomic distance information (3). In theory, given enough distance information, it is possible to solve the tertiary structure of a macromolecule (4, 5).The challenge we faced in trying to generate such information in a short time using cross-linking technology was to devise a rapid method for identifying cross-linked residues. MS affords high throughput but has rarely been used for the identification of cross-links. One study has been published where disuccinimidyl ester cross-linking, Edman sequencing, and MS were used to validate a model of human erythropoietin (6). Recent advances in MS (7, 8) gave us the means whereby we could determine the masses and sequences of large peptides with high accuracy and sensitivity (9, 10). These improvements make it feasible to analyze complex peptide mixtures from proteolytically digested, cross-linked proteins (11) very quickly. Specifically, we describe the use of chemical cross-linking and time-of-flight (TOF) MS to identify Lys-Lys cross-links. We also show how t...