Mitochondria and plastids power complex life, and retain their own organelle DNA (oDNA) genomes, with highly reduced gene contents compared to their endosymbiont ancestors. Why some protein-coding genes are retained in oDNA and some lost remains a debated question. Here we harness over 15k oDNA sequences and over 300 whole genome sequences with tools from structural biology, bioinformatics, machine learning, and Bayesian model selection to reveal the properties of genes, and associated underlying mechanisms, that shape oDNA evolution. Striking symmetry exists between the two organelle types: gene retention patterns in both are predicted by the hydrophobicity of a protein product and its energetic centrality within its protein complex, with additional influences of nucleic acid and amino acid biochemistry. Remarkably, retention principles from one organelle type successfully and quantitatively predict retention in the other, supporting this universality; these principles also distinguish gene profiles in independent endosymbiotic relationships. The identification of these features shaping organelle gene retention both provides quantitative support for several existing evolutionary hypotheses, and suggests new biochemical and biophysical mechanisms influencing organelle genome evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.