SUMMARY The Bacteroides are a numerically dominant genus of the human intestinal microbiota. These organisms harbor a rare bacterial pathway for incorporation of exogenous fucose into capsular polysaccharides and glycoproteins. The infrequency of glycoprotein synthesis by bacteria prompted a more detailed analysis of this process. Here, we demonstrate that Bacteroides fragilis has a general O-glycosylation system. The proteins targeted for glycosylation include those predicted to be involved in protein folding, protein-protein interactions, peptide degradation, as well as surface lipoproteins. Protein glycosylation is central to the physiology of B. fragilis and is necessary for the organism to competitively colonize the mammalian intestine. We provide evidence that general O-glycosylation systems are conserved among intestinal Bacteroides species and likely contribute to the predominance of Bacteroides in the human intestine.
The main features of the protein structure are two antiparallel beta-sheets (a central one with three strands and another with two), a short helix that packs against the three-stranded beta-sheet, and a carboxy-terminal region that, although lacking regular secondary structure, is well defined and packs against the three-stranded beta-sheet, on the opposite face to the helix. We have used the structure, in combination with existing biochemical data, to identify residues that may be involved in C8 binding.
Two modifications to the commonly used protocols for calculating NMR structures are developed, relating to the treatment of NOE constraints involving groups of equivalent protons or nonstereoassigned diastereotopic protons. Firstly, a modified method is investigated for correcting for multiplicity, which is applicable whenever all NOE intensities are calibrated as a single set and categorised in broad intensity ranges. Secondly, a new set of values for 'pseudoatom corrections' is proposed for use with calculations employing 'centre-averaging'. The effect of these protocols on structure calculations is demonstrated using two proteins, one of which is well defined by the NOE data, the other less so. It is shown that failure to correct for multiplicity when using 'r(-6) averaging' results in overly precise structures, higher NOE energies and deviations from geometric ideality, while failure to correct for multiplicity when using 'r(-6) summation' can cause an avoidable degradation of precision if the NOE data are sparse. Conversely, when multiplicities are treated correctly, r(-6) averaging, r(-6) summation and centre averaging all give closely comparable results when the structure is well defined by the data. When the NOE data contain less information, r(-6) averaging or r(-6) summation offer a significant advantage over centre averaging, both in terms of precision and in terms of the proportion of calculations that converge on a consisten result.
eIF1 is a universally conserved translation factor that is necessary for scanning and involved in initiation site selection. We have determined the solution structure of human eIF1 with an N-terminal His tag using NMR spectroscopy. Residues 29-113 of the native sequence form a tightly packed domain with two α-helices on one side of a five-stranded parallel and antiparallel β-sheet. The fold is new but similar to that of several ribosomal proteins and RNA-binding domains. A likely binding site is indicated by yeast mutations and conserved residues located together on the surface. No interaction with recombinant eIF5 or the initiation site RNA GCCACAAUGGCA was detected by NMR, but GST pull-down experiments show that eIF1 binds specifically to the p110 subunit of eIF3. This interaction explains how eIF1 is recruited to the 40S ribosomal subunit.
Summary The human gut symbiont Bacteroides fragilis has a general protein O-glycosylation system in which numerous extracytoplasmic proteins are glycosylated at a three amino acid motif. In B. fragilis, protein glycosylation is a fundamental and essential property as mutants with protein glycosylation defects have impaired growth and are unable to competitively colonize the mammalian intestine. In this study, we analyzed the phenotype of B. fragilis mutants with defective protein glycosylation and found that the glycan added to proteins is comprised of a core glycan and an outer glycan. The genetic region encoding proteins for the synthesis of the outer glycan is conserved within a Bacteroides species but divergent between species. Unlike the outer glycan, an antiserum raised to the core glycan reacted with all Bacteroidetes species tested, from all four classes of the phylum. We found that these diverse Bacteroidetes species synthesize numerous glycoproteins and glycosylate proteins at the same three amino acid motif. The wide-spread conservation of this protein glycosylation system within the phylum suggests that this system of post-translational protein modification evolved early, before the divergence of the four classes of Bacteroidetes, and has been maintained due to its physiologic importance to the diverse species of this phylum.
Among bacterial species demonstrated to have protein Oglycosylation systems, that of Bacteroides fragilis and related species is unique in that extracytoplasmic proteins are glycosylated at serine or threonine residues within the specific threeamino acid motif D(S/T)(A/I/L/M/T/V). This feature allows for computational analysis of the proteome to identify candidate glycoproteins. With the criteria of a signal peptidase I or II cleavage site or a predicted transmembrane-spanning region and the presence of at least one glycosylation motif, we identified 1021 candidate glycoproteins of B. fragilis. In addition to the eight glycoproteins identified previously, we confirmed that another 12 candidate glycoproteins are in fact glycosylated. These included four glycoproteins that are predicted to localize to the inner membrane, a compartment not previously shown to include glycosylated proteins. In addition, we show that four proteins involved in cell division and chromosomal segregation, two of which are encoded by candidate essential genes, are glycosylated. To date, we have not identified any extracytoplasmic proteins containing a glycosylation motif that are not glycosylated. Therefore, based on the list of 1021 candidate glycoproteins, it is likely that hundreds of proteins, comprising more than half of the extracytoplasmic proteins of B. fragilis, are glycosylated. Site-directed mutagenesis of several glycoproteins demonstrated that all are glycosylated at the identified glycosylation motif. By engineering glycosylation motifs into a naturally unglycosylated protein, we are able to bring about site-specific glycosylation at the engineered sites, suggesting that this glycosylation system may have applications for glycoengineering.
The recent report of the synthesis of glycoproteins by the abundant intestinal symbionts Bacteroides showed that these organisms use a novel bacterial enzyme to decorate their surfaces with a sugar residue derived from their environment. As a first step in understanding the importance of these glycoproteins to the bacteria and to the bacterial-host symbiosis, we identified and characterized the abundant glycoproteins of Bacteroides distasonis Using lectin-affinity purification followed by tandem mass spectrometry, we identified a family of at least nine glycoproteins, similar only to the S-layer glycoproteins of Tannerella forsythia. Analysis of one of these purified glycoproteins demonstrated that the glycan is primarily a polymer of xylose, a monosaccharide rarely found in bacterial glycans. Even more unexpected was the finding that seven of nine of the glycoprotein promoters undergo DNA inversion, a process that we show is active in their endogenous human environment. Using cross-species functional assays, we show that a single serine family site-specific recombinase globally mediates the inversions of these glycoprotein promoters. This regulatory mechanism is similar to that of the Bacteroides fragilis capsular polysaccharides and establishes DNA inversion as a general and ancient means of regulation of glycan-containing surface molecules of these important human intestinal symbionts.
Gene transcription requires the release of inactive DNA from its packaging of histone proteins. Following the discovery of the first transcription-associated histone acetyltransferase, tetrahymena GCN5, it was shown that yeast GCN5 is recruited to the promoter and causes hyper-acetylation of histones and transcriptional activation of target genes, establishing a direct connection between histone acetylation and transcriptional activation. Many other important transcription regulators have been found to have histone acetyltransferase activity, including TAFII230/250, p300/CBP and its associated factor PCAF. Here we present the solution structure of the catalytic domain of tGCN5 (residues 47-210) in complex with coenzyme A. The structure contains two domains; the amino-terminal domain is similar to those of other GCN5-related N-acetyltransferases but the carboxy-terminal domain is not. Coenzyme A binds in a deep hydrophobic pocket between the two domains. Chemical shift changes upon titration with histone H3 peptides indicate a binding site at the domain boundary opposite to the coenzyme A site. The structural data indicate a single-step acetyl-transfer reaction mechanism catalysed by a hydrogen bond to the backbone amide group of leucine 126 and the side-chain carboxyl group of a conserved acidic residue.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.