To estimate the minimal gene set required to sustain bacterial life in nutritious conditions, we carried out a systematic inactivation of Bacillus subtilis genes. Among Ϸ4,100 genes of the organism, only 192 were shown to be indispensable by this or previous work. Another 79 genes were predicted to be essential. The vast majority of essential genes were categorized in relatively few domains of cell metabolism, with about half involved in information processing, one-fifth involved in the synthesis of cell envelope and the determination of cell shape and division, and one-tenth related to cell energetics. Only 4% of essential genes encode unknown functions. Most essential genes are present throughout a wide range of Bacteria, and almost 70% can also be found in Archaea and Eucarya. However, essential genes related to cell envelope, shape, division, and respiration tend to be lost from bacteria with small genomes. Unexpectedly, most genes involved in the Embden-Meyerhof-Parnas pathway are essential. Identification of unknown and unexpected essential genes opens research avenues to better understanding of processes that sustain bacterial life.
The latest version of the CATH-Gene3D protein structure classification database (4.0, http://www.cathdb.info) provides annotations for over 235 000 protein domain structures and includes 25 million domain predictions. This article provides an update on the major developments in the 2 years since the last publication in this journal including: significant improvements to the predictive power of our functional families (FunFams); the release of our ‘current’ putative domain assignments (CATH-B); a new, strictly non-redundant data set of CATH domains suitable for homology benchmarking experiments (CATH-40) and a number of improvements to the web pages.
Transcription and translation require a high concentration of potassium across the entire tree of life. The conservation of a high intracellular potassium was an absolute requirement for the evolution of life on Earth. This was achieved by the interplay of P- and V-ATPases that can set up electrochemical gradients across the cell membrane, an energetically costly process requiring the synthesis of ATP by F-ATPases. In animals, the control of an extracellular compartment was achieved by the emergence of multicellular organisms able to produce tight epithelial barriers creating a stable extracellular milieu. Finally, the adaptation to a terrestrian environment was achieved by the evolution of distinct regulatory pathways allowing salt and water conservation. In this review we emphasize the critical and dual role of Na(+)-K(+)-ATPase in the control of the ionic composition of the extracellular fluid and the renin-angiotensin-aldosterone system (RAAS) in salt and water conservation in vertebrates. The action of aldosterone on transepithelial sodium transport by activation of the epithelial sodium channel (ENaC) at the apical membrane and that of Na(+)-K(+)-ATPase at the basolateral membrane may have evolved in lungfish before the emergence of tetrapods. Finally, we discuss the implication of RAAS in the origin of the present pandemia of hypertension and its associated cardiovascular diseases.
The function of most proteins is not determined experimentally, but is extrapolated from homologs. According to the “ortholog conjecture”, or standard model of phylogenomics, protein function changes rapidly after duplication, leading to paralogs with different functions, while orthologs retain the ancestral function. We report here that a comparison of experimentally supported functional annotations among homologs from 13 genomes mostly supports this model. We show that to analyze GO annotation effectively, several confounding factors need to be controlled: authorship bias, variation of GO term frequency among species, variation of background similarity among species pairs, and propagated annotation bias. After controlling for these biases, we observe that orthologs have generally more similar functional annotations than paralogs. This is especially strong for sub-cellular localization. We observe only a weak decrease in functional similarity with increasing sequence divergence. These findings hold over a large diversity of species; notably orthologs from model organisms such as E. coli, yeast or mouse have conserved function with human proteins.
CATH version 3.5 (Class, Architecture, Topology, Homology, available at http://www.cathdb.info/) contains 173 536 domains, 2626 homologous superfamilies and 1313 fold groups. When focusing on structural genomics (SG) structures, we observe that the number of new folds for CATH v3.5 is slightly less than for previous releases, and this observation suggests that we may now know the majority of folds that are easily accessible to structure determination. We have improved the accuracy of our functional family (FunFams) sub-classification method and the CATH sequence domain search facility has been extended to provide FunFam annotations for each domain. The CATH website has been redesigned. We have improved the display of functional data and of conserved sequence features associated with FunFams within each CATH superfamily.
Homologous genes are classified into orthologs and paralogs, depending on whether they arose by speciation or duplication. It is widely assumed that orthologs share similar functions, whereas paralogs are expected to diverge more from each other. But does this assumption hold up on further examination? We present evidence that orthologs and paralogs are not so different in either their evolutionary rates or their mechanisms of divergence. We emphasize the importance of appropriately designed studies to test models of gene evolution between orthologs and between paralogs. Thus, functional change between orthologs might be as common as between paralogs, and future studies should be designed to test the impact of duplication against this alternative model.
A well-known case of evolutionary adaptation is that of ribulose-1,5-bisphosphate carboxylase (RubisCO), the enzyme responsible for fixation of CO 2 during photosynthesis. Although the majority of plants use the ancestral C 3 photosynthetic pathway, many flowering plants have evolved a derived pathway named C 4 photosynthesis. The latter concentrates CO 2 , and C 4 RubisCOs consequently have lower specificity for, and faster turnover of, CO 2 . The C 4 forms result from convergent evolution in multiple clades, with substitutions at a small number of sites under positive selection. To understand the physical constraints on these evolutionary changes, we reconstructed in silico ancestral sequences and 3D structures of RubisCO from a large group of related C 3 and C 4 species. We were able to precisely track their past evolutionary trajectories, identify mutations on each branch of the phylogeny, and evaluate their stability effect. We show that RubisCO evolution has been constrained by stability-activity tradeoffs similar in character to those previously identified in laboratory-based experiments. The C 4 properties require a subset of several ancestral destabilizing mutations, which from their location in the structure are inferred to mainly be involved in enhancing conformational flexibility of the open-closed transition in the catalytic cycle. These mutations are near, but not in, the active site or at intersubunit interfaces. The C 3 to C 4 transition is preceded by a sustained period in which stability of the enzyme is increased, creating the capacity to accept the functionally necessary destabilizing mutations, and is immediately followed by compensatory mutations that restore global stability.T he adaptive diversification of organisms often requires the evolution of novel enzymatic properties. The evolutionary shift from one enzymatic function to another involves crossing an energetic barrier in a fitness landscape (1). The number of mutations that confer advantageous function during such a shift is consequently limited. Some residues are critical for maintaining the stability of the protein fold, others are important for the catalytic activity itself. Due to the multiple roles of amino acids in proteins, the adaptation of one physical parameter of an enzyme is likely to affect other properties (2). As proteins usually form thermodynamically stable structures, their evolutionary trajectories are constrained to a narrow range of stability (3). Stability and activity are likely to be negatively correlated. Most possible amino acid changes in native proteins are destabilizing and, consequently, mutations that lead to a more favorable enzyme activity are likely to decrease the stability of the protein (2, 4). Compensatory mutations are then needed to restore global stability. These processes are referred to as stability-activity tradeoffs (5-7). Furthermore, proteins with higher stability confer greater evolvability, because there is more scope to accept destabilizing yet functionally beneficial changes (8). Wh...
Living organisms have evolved protein phosphorylation, a rapid and versatile mechanism that drives signaling and regulates protein function. We report the phosphoproteomes of 18 fungal species and a phylogenetic-based approach to study phosphosite evolution. We observe rapid divergence, with only a small fraction of phosphosites conserved over hundreds of millions of years. Relative to recently acquired phosphosites, ancient sites are enriched at protein interfaces and are more likely to be functionally important, as we show for sites on H2A1 and eIF4E. We also observe a change in phosphorylation motif frequencies and kinase activities that coincides with the whole-genome duplication event. Our results provide an evolutionary history for phosphosites and suggest that rapid evolution of phosphorylation can contribute strongly to phenotypic diversity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.