The concept of a last universal common ancestor of all cells (LUCA, or the progenote) is central to the study of early evolution and life's origin, yet information about how and where LUCA lived is lacking. We investigated all clusters and phylogenetic trees for 6.1 million protein coding genes from sequenced prokaryotic genomes in order to reconstruct the microbial ecology of LUCA. Among 286,514 protein clusters, we identified 355 protein families (∼0.1%) that trace to LUCA by phylogenetic criteria. Because these proteins are not universally distributed, they can shed light on LUCA's physiology. Their functions, properties and prosthetic groups depict LUCA as anaerobic, CO2-fixing, H2-dependent with a Wood-Ljungdahl pathway, N2-fixing and thermophilic. LUCA's biochemistry was replete with FeS clusters and radical reaction mechanisms. Its cofactors reveal dependence upon transition metals, flavins, S-adenosyl methionine, coenzyme A, ferredoxin, molybdopterin, corrins and selenium. Its genetic code required nucleoside modifications and S-adenosyl methionine-dependent methylations. The 355 phylogenies identify clostridia and methanogens, whose modern lifestyles resemble that of LUCA, as basal among their respective domains. LUCA inhabited a geochemically active environment rich in H2, CO2 and iron. The data support the theory of an autotrophic origin of life involving the Wood-Ljungdahl pathway in a hydrothermal setting.
All known life forms trace back to a last universal common ancestor (LUCA) that witnessed the onset of Darwinian evolution. One can ask questions about LUCA in various ways, the most common way being to look for traits that are common to all cells, like ribosomes or the genetic code. With the availability of genomes, we can, however, also ask what genes are ancient by virtue of their phylogeny rather than by virtue of being universal. That approach, undertaken recently, leads to a different view of LUCA than we have had in the past, one that fits well with the harsh geochemical setting of early Earth and resembles the biology of prokaryotes that today inhabit the Earth's crust.
Genomes record their own history. But if we want to look all the way back to life's beginnings some 4 billion years ago, the record of microbial evolution that is preserved in prokaryotic genomes is not easy to read. Microbiology has a lot in common with geology in that regard. Geologists know that plate tectonics and erosion have erased much of the geological record, with ancient rocks being truly rare. The same is true of microbes. Lateral gene transfer (LGT) and sequence divergence have erased much of the evolutionary record that was once written in genomes, and it is not obvious which genes among sequenced genomes are genuinely ancient. Which genes trace to the last universal ancestor, LUCA? The classical approach has been to look for genes that are universally distributed. Another approach is to make all trees for all genes, and sift out the trees where signals have been overwritten by LGT. What is left ought to be ancient. If we do that, what do we find?
The advent of environmental O about 2.5 billion years ago forced microbes to metabolically adapt and to develop mechanisms for O sensing. Sensing of O by [4Fe-4S] to [2Fe-2S] cluster conversion represents an ancient mechanism that is used by FNR (Escherichia coli), FNR (Bacillus subtilis), NreB (Staphylococcus aureus) and WhiB3 (Mycobacterium tuberculosis). The phylogenetic relationship of these sensors was investigated. FNR homologues are restricted to the proteobacteria and a few representatives from other phyla. Homologues of FNR and NreB are located within the bacilli, of WhiB3 within the actinobacteria. Archaea contain no homologues. The data reveal no similarity between the FNR , FNR , NreB and WhiB3 sensor families on the sequence and structural levels. These O sensor families arose independently in phyla that were already present at the time O appeared, their members were subsequently distributed by lateral gene transfer. The chemistry of [4Fe-4S] and [2Fe-2S] cluster formation and interconversion appears to be shared by the sensor protein families. The type of signal output is, however, family specific. The homologues of FNR and NreB vary with regard to the number of Cys residues that coordinate the cluster. It is suggested that the variants derive from lateral gene transfer and gained other functions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.