Owing to the degeneracy of the genetic code, a protein sequence can be encoded by many different synonymous mRNA coding sequences. Synonymous codon usage was once thought to be functionally neutral, but evidence now indicates it is shaped by evolutionary selection and affects other aspects of protein biogenesis beyond specifying the amino acid sequence of the protein. Synonymous rare codons, once thought to have only negative impacts on the speed and accuracy of translation, are now known to play an important role in diverse functions, including regulation of cotranslational folding, covalent modifications, secretion, and expression level. Mutations altering synonymous codon usage are linked to human diseases. However, much remains unknown about the molecular mechanisms connecting synonymous codon usage to efficient protein biogenesis and proper cell physiology. Here we review recent literature on the functional effects of codon usage, including bioinformatics approaches aimed at identifying general roles for synonymous codon usage.
Synonymous rare codons are considered to be sub-optimal for gene expression because they are translated more slowly than common codons. Yet surprisingly, many protein coding sequences include large clusters of synonymous rare codons. Rare codons at the 5’ terminus of coding sequences have been shown to increase translational efficiency. Although a general functional role for synonymous rare codons farther within coding sequences has not yet been established, several recent reports have identified rare-to-common synonymous codon substitutions that impair folding of the encoded protein. Here we test the hypothesis that although the usage frequencies of synonymous codons change from organism to organism, codon rarity will be conserved at specific positions in a set of homologous coding sequences, for example to tune translation rate without altering a protein sequence. Such conservation of rarity–rather than specific codon identity–could coordinate co-translational folding of the encoded protein. We demonstrate that many rare codon cluster positions are indeed conserved within homologous coding sequences across diverse eukaryotic, bacterial, and archaeal species, suggesting they result from positive selection and have a functional role. Most conserved rare codon clusters occur within rather than between conserved protein domains, challenging the view that their primary function is to facilitate co-translational folding after synthesis of an autonomous structural unit. Instead, many conserved rare codon clusters separate smaller protein structural motifs within structural domains. These smaller motifs typically fold faster than an entire domain, on a time scale more consistent with translation rate modulation by synonymous codon usage. While proteins with conserved rare codon clusters are structurally and functionally diverse, they are enriched in functions associated with organism growth and development, suggesting an important role for synonymous codon usage in organism physiology. The identification of conserved rare codon clusters advances our understanding of distinct, functional roles for otherwise synonymous codons and enables experimental testing of the impact of synonymous codon usage on the production of functional proteins.
Anfinsen’s principle1 asserts that all information required to specify the structure of a protein is encoded in its amino acid sequence. However, during protein synthesis by the ribosome the N-terminus of the nascent chain can begin to fold before the C-terminus is available. We tested whether this co-translational folding can alter the folded structure of an encoded protein in vivo, versus the structure formed when refolded in vitro. We designed a fluorescent protein consisting of three half-domains, where the N- and C-terminal half-domains compete with each other to interact with the central half-domain. The outcome of this competition determines the fluorescence properties of the resulting folded structure. Upon refolding after chemical denaturation, this protein produced equimolar amounts of the N- and C-terminal folded structures, respectively. In contrast, translation in E. coli resulted in a two-fold enhancement in the formation of the N-terminal folded structure. Rare synonymous codon substitutions at the 5′ end of the C-terminal half-domain further increased selection for the N-terminal folded structure. These results demonstrate that the rate at which a nascent protein emerges from the ribosome can specify the folded structure of a protein.
Antibiotic resistance is a major threat to human welfare. Inhibitors of multidrug efflux pumps (EPIs) are promising alternative therapeutics that could revive activities of antibiotics and reduce bacterial virulence. Identification of new druggable sites for inhibition is critical for the development of effective EPIs, especially in light of constantly emerging resistance. Here, we describe EPIs that interact with periplasmic membrane fusion proteins, critical components of efflux pumps that are responsible for the activation of the transporter and the recruitment of the outer-membrane channel. The discovered EPIs bind to AcrA, a component of the prototypical AcrAB-TolC pump, change its structure in vivo, inhibit efflux of fluorescent probes, and potentiate the activities of antibiotics in Escherichia coli and other Gram-negative bacteria. Our findings expand the chemical and mechanistic diversity of EPIs, suggest the mechanism for regulation of the efflux pump assembly and activity, and provide a promising path for reviving the activities of antibiotics in resistant bacteria.
Protein folding is an essential prerequisite for protein function and hence cell function. Kinetic and thermodynamic studies of small proteins that refold reversibly were essential for developing our current understanding of the fundamentals of protein folding mechanisms. However, we still lack sufficient understanding to accurately predict protein structures from sequences, or the effects of disease-causing mutations. To date, model proteins selected for folding studies represent only a small fraction of the complexity of the proteome and are unlikely to exhibit the breadth of folding mechanisms used in vivo. We are in urgent need of new methods – both theoretical and experimental – that can quantify the folding behavior of a truly broad set of proteins under in vivo conditions. Such a shift in focus will provide a more comprehensive framework from which to understand the connections between protein folding, the molecular basis of disease, and cell function and evolution.
Initial protein structural comparisons were sequence-based. Since amino acids that are distant in the sequence can be close in the 3-dimensional (3D) structure, 3D contact approaches can complement sequence approaches. Traditional 3D contact approaches study 3D structures directly and are alignment-based. Instead, 3D structures can be modeled as protein structure networks (PSNs). Then, network approaches can compare proteins by comparing their PSNs. These can be alignment-based or alignment-free. We focus on the latter. Existing network alignment-free approaches have drawbacks: 1) They rely on naive measures of network topology. 2) They are not robust to PSN size. They cannot integrate 3) multiple PSN measures or 4) PSN data with sequence data, although this could improve comparison because the different data types capture complementary aspects of the protein structure. We address this by: 1) exploiting well-established graphlet measures via a new network alignment-free approach, 2) introducing normalized graphlet measures to remove the bias of PSN size, 3) allowing for integrating multiple PSN measures, and 4) using ordered graphlets to combine the complementary PSN data and sequence (specifically, residue order) data. We compare synthetic networks and real-world PSNs more accurately and faster than existing network (alignment-free and alignment-based), 3D contact, or sequence approaches.
Autotransporter (AT) proteins provide a diverse array of important virulence functions to Gram-negative bacterial pathogens, and have also been adapted for protein surface display applications. The “autotransporter” moniker refers to early models that depicted these proteins facilitating their own translocation across the bacterial outer membrane. Although translocation is less autonomous than originally proposed, AT protein segments upstream of the C-terminal transmembrane β-barrel have nevertheless consistently been found to contribute to efficient translocation and/or folding of the N-terminal virulence region (the “passenger”). However, defining the precise secretion functions of these AT regions has been complicated by the use of multiple overlapping and ambiguous terms to define AT sequence, structural, and functional features, including “autochaperone”, “linker” and “junction”. Moreover, the precise definitions and boundaries of these features vary among ATs and even among research groups, leading to an overall murky picture of the contributions of specific features to translocation. Here we propose a unified, unambiguous nomenclature for AT structural, functional and conserved sequence features, based on explicit criteria. Applied to 16 well studied AT proteins, this nomenclature reveals new commonalities for translocation but also highlights that the autochaperone function is less closely coupled with a conserved sequence element than previously believed.
Summary Autotransporter (AT) proteins are a broad class of virulence factors from Gram-negative pathogens. AT outer membrane (OM) secretion appears simple in many regards, yet the mechanism that enables transport of the central AT “passenger” across the OM remains unclear. OM secretion efficiency for two AT passengers is enhanced by a ~20 kDa stable core at the C-terminus of the passenger, but studies on a broader range of AT proteins are needed in order to determine whether a stability difference between the passenger N- and C-terminus represents a truly common mechanistic feature. Y. pestis YapV is homologous to Shigella flexneri IcsA, and like IcsA, YapV recruits mammalian neural Wiskott-Aldrich syndrome protein (N-WASP). In vitro, the purified YapV passenger is functional and rich in β-sheet structure, but lacks a ~20 kDa C-terminal stable core. However, the N-terminal 49 residues of the YapV passenger globally destabilize the entire YapV passenger, enhancing its OM secretion efficiency. These results indicate that the contributions of AT passenger sequences to OM secretion efficiency extend beyond a C-terminal stable core, and highlight a role of the passenger N-terminus in reducing passenger stability in order to facilitate OM secretion of some AT proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.