1Eukaryotic cells represent an intricate collaboration between multiple genomes, even down to the 2 level of multisubunit complexes in mitochondria and plastids. One such complex in plants is the 3 caseinolytic protease (Clp), which plays an essential role in plastid protein turnover. The 4 proteolytic core of Clp comprises subunits from one plastid-encoded gene (clpP1) and multiple 5 nuclear genes. The clpP1 gene is highly conserved across most green plants, but it is by far the 6 fastest evolving plastid-encoded gene in some angiosperms. To better understand these extreme 7 and mysterious patterns of divergence, we investigated the history of clpP1 molecular evolution 8 across green plants by extracting sequences from 988 published plastid genomes. We find that 9 clpP1 has undergone remarkably frequent bouts of accelerated sequence evolution and 10 architectural changes (e.g., loss of introns and RNA-editing sites) within seed plants. Although 11 clpP1 is often assumed to be a pseudogene in such cases, multiple lines of evidence suggest that 12 this is rarely the case. We applied comparative native gel electrophoresis of chloroplast protein 13 complexes followed by protein mass spectrometry in two species within the angiosperm genus 14 Silene, which has highly elevated and heterogeneous rates of clpP1 evolution. We confirmed that 15 clpP1 is expressed as a stable protein and forms oligomeric complexes with the nuclear-encoded 16 Clp subunits, even in one of the most divergent Silene species. Additionally, there is a tight 17 correlation between amino-acid substitution rates in clpP1 and the nuclear-encoded Clp subunits 18 across a broad sampling of angiosperms, suggesting ongoing selection on interactions within this 19 complex.
21Rates of sequence evolution vary dramatically across genes and genomes. Understanding 22 the forces that determine such variation is one of the defining goals in the field of molecular 23 evolution. In seed plants, the plastid genome (plastome) generally evolves two-to six-fold more 24 slowly than the nuclear genome (Wolfe et al., 1987; Drouin et al., 2008; Smith and Keeling, 25 2015). However, among angiosperms, there is considerable heterogeneity in the rate of plastome 26 evolution. Many lineages have maintained a slowly-evolving plastome, while others have 27 experienced drastic rate increases (Jansen et al., 2007). For instance, among close relatives 28 within the tribe Sileneae there have been at least three recent and independent increases in 29 plastome evolutionary rate (Erixon and Oxelman, 2008; Sloan, Triant, Forrester, et al., 2014). 30 Similar accelerations have been documented in the Campanulaceae (Haberle et al., 2008; a structural level, plastome gene order has largely been conserved, with most angiosperms 34 retaining the structural organization that was present in the most recent common ancestor of this 35 group (Raubeson and Jansen, 2005). However, the sporadic increases in rates of plastome 36 sequence evolution have often been accompanied by structural ...