We recently reported statistical analysis of structural data on glycosidic linkages. Here we extend this analysis to the glycan-protein linkage, and the peptide primary, secondary, and tertiary structures around N-glycosylation sites. We surveyed 506 glycoproteins in the Protein Data Bank crystallographic database, giving 2592 glycosylation sequons (1683 occupied) and generated a database of 626 nonredundant sequons with 386 occupied. Deviations in the expected amino acid composition were seen around occupied asparagines, particularly an increased occurrence of aromatic residues before the asparagine and threonine at position +2. Glycosylation alters the asparagine side chain torsion angle distribution and reduces its flexibility. There is an elevated probability of finding glycosylation sites in which secondary structure changes. An 11-class taxonomy was developed to describe protein surface geometry around glycosylation sites. Thirty-three percent of the occupied sites are on exposed convex surfaces, 10% in deep recesses and 20% on the edge of grooves with the glycan filling the cleft. A surprisingly large number of glycosylated asparagine residues have a low accessibility. The incidence of aromatic amino acids brought into close contact with the glycan by the folding process is higher than their normal levels on the surface or in the protein core. These data have significant implications for control of sequon occupancy and evolutionary selection of glycosylation sites and are discussed in relation to mechanisms of protein fold stabilization and regional quality control of protein folding. Hydrophobic protein-glycan interactions and the low accessibility of glycosylation sites in folded proteins are common features and may be critical in mediating these functions.
Ovarian cancer is the fourth most common cancer in women in the Western world. In a pilot scale study, we highlight changes in the total serum glycome of patients with advanced ovarian cancer that might shed insight into disease pathogenesis. These changes include increases in levels of core fucosylated, agalactosyl biantennary glycans (FA2) and sialyl Lewis x (SLe(x)). To investigate further which proteins contribute to these alterations, we developed technology to analyze simultaneously the glycosylation of protein glycoforms contained in single spots excised from a 2D gel (<1 ng protein). The acute-phase proteins, haptoglobin, alpha1-acid glycoprotein, and alpha1-antichymotrypsin from patients contained elevated levels of subsets of glycoforms containing SLe(x). We also established that IgG heavy chains from patients contained twice the level of FA2 compared with healthy controls. Serum CA125 is the only biomarker that is used routinely, and there is a need for complementary markers that will improve both sensitivity and specificity. There was some preliminary indication that combinations of changes in the serum glycome might improve the separation of ovarian cancer and benign tumors; however, a larger study using data receiver operating characteristic curves will be required to draw any firm conclusions.
We have generated a database of 639 glycosidic linkage structures by an exhaustive survey of the available crystallographic data for isolated oligosaccharides, glycoproteins, and glycan-binding proteins. For isolated oligosaccharides there is relatively little crystallographic data available. A much larger number of glycoprotein and glycan-binding protein structures have now been solved in which two or more linked monosaccharides can be resolved. In the majority of these cases, only a few residues can be seen. Using the 639 glycosidic linkage structures, we have identified one or more distinct conformers for all the linkages. The O5-C1-O-C(x)' torsion angles for all these distinct conformers appear to be determined chiefly by the exo-anomeric effect. The Manalpha1-6Man linkage appears to be less restrained than the others, showing a wide degree of dispersion outside the ranges of the defined conformers. The identification of distinct conformers for glyco-sidic linkages allows "average" glycan structures to be modeled and also allows the easy identification of distorted glycosidic linkages. Such an analysis shows that the interactions between IgG Fc and its own N-linked glycan result in severe distortion of the terminal Galbeta1-4GlcNAc linkage only, indicating the strong interactions that must be present between the Gal residue and the protein surface. The applicability of this crystallographic based analysis to glycan structures in solution is discussed. This database of linkagestructures should be a very useful reference tool in three-dimensional structure determinations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.