Chromosome-level genome assembly of the aquatic plant Nymphoides indica reveals transposable element bursts and NBS-LRR gene family expansion shedding light on its invasiveness
Abstract:Nymphoides indica, an aquatic plant, is an invasive species that causes both ecological and economic damage in North America and elsewhere. However, the lack of genomic data of N. indica limits the in-depth analysis of this invasive species. Here, we report a chromosome-level genome assembly of nine pseudochromosomes of N. indica with a total size of approximately 520 Mb. More than half of the N. indica genome consists of transposable elements (TEs), and a higher density of TEs around genes may play a signific… Show more
“…S7). This result is consistent with earlier work, which proposed that N. indica experienced a recent WGD (S. Yang et al, 2022). The Ks distribution for two gene pairs was c. 2.52 (Nin-hap117766 and NinKHZ2), and c. 3.1 (Ninhap113470 and NinBAS1), respectively (Fig.…”
Section: Supergene Evolution Via Stepwise Duplicationsupporting
confidence: 93%
“…S7). This result is consistent with earlier work, which proposed that N. indica experienced a recent WGD (S. Yang et al ., 2022). The Ks distribution for two gene pairs was c .…”
Section: Resultsmentioning
confidence: 99%
“…For the GWAS analyses, we collected young leaves from 13 and 15 individuals of the L‐ and S‐morphs, respectively, and conducted DNA extraction, library preparation, and whole‐genome resequencing following protocols detailed in Yang et al . (2022). We sequenced each sample to an expected coverage of 10×.…”
Section: Methodsmentioning
confidence: 99%
“…For the genome assembly of the S‐morph plant, we used H ifiasm v.0.16.0 (Cheng et al ., 2021) to assemble the PacBio HiFi reads into a haplotype‐resolved genome and a haploid genome. The Hi‐C data of the same individual, which was generated in our previous study (Yang et al ., 2022), were retrieved from CNCB (CNP0002767). We first mapped the clean Hi‐C reads to the haploid assembly to construct chromosome‐level assemblies using J uicer (Durand et al ., 2016b) and built primary scaffolds with the 3D‐DNA pipeline (Dudchenko et al ., 2017).…”
Section: Methodsmentioning
confidence: 99%
“…We predicted protein‐coding genes by combining de novo , homology‐based, and transcriptome‐based approaches according to methods detailed in our previous study of N. indica (Yang et al ., 2022). We mapped all consensus gene models to public databases, including Swiss‐Prot and NR, using B lastp (Camacho et al ., 2009) searches with an e‐value cutoff of 1e‐5 and aligned to the egg NOG database v.5.0 (Huerta‐Cepas et al ., 2019) using egg NOG‐mapper v.2 (Cantalapiedra et al ., 2021).…”
Summary
Distyly has evolved independently in numerous animal‐pollinated angiosperm lineages. Understanding of its molecular basis has been restricted to a few species, primarily Primula. Here, we investigate the genetic architecture of the single diallelic locus (S‐locus) supergene, a linkage group of functionally associated genes, and explore how it may have evolved in distylous Nymphoides indica, a lineage of flowering plants not previously investigated.
We assembled haplotype‐resolved genomes, used read‐coverage‐based genome‐wide association study (rb‐GWAS) to locate the S‐locus supergene, co‐expression network analysis to explore gene networks underpinning the development of distyly, and comparative genomic analyses to investigate the origins of the S‐locus supergene.
We identified three linked candidate S‐locus genes – NinBAS1, NinKHZ2, and NinS1 – that were only evident in the short‐styled morph and were hemizygous. Co‐expression network analysis suggested that brassinosteroids contribute to dimorphic sex organs in the short‐styled morph. Comparative genomic analyses indicated that the S‐locus supergene likely evolved via stepwise duplications and has been affected by transposable element activities.
Our study provides novel insight into the structure, regulation, and evolution of the supergene governing distyly in N. indica. It also provides high‐quality genomic resources for future research on the molecular mechanisms underlying the striking evolutionary convergence in form and function across heterostylous taxa.
“…S7). This result is consistent with earlier work, which proposed that N. indica experienced a recent WGD (S. Yang et al, 2022). The Ks distribution for two gene pairs was c. 2.52 (Nin-hap117766 and NinKHZ2), and c. 3.1 (Ninhap113470 and NinBAS1), respectively (Fig.…”
Section: Supergene Evolution Via Stepwise Duplicationsupporting
confidence: 93%
“…S7). This result is consistent with earlier work, which proposed that N. indica experienced a recent WGD (S. Yang et al ., 2022). The Ks distribution for two gene pairs was c .…”
Section: Resultsmentioning
confidence: 99%
“…For the GWAS analyses, we collected young leaves from 13 and 15 individuals of the L‐ and S‐morphs, respectively, and conducted DNA extraction, library preparation, and whole‐genome resequencing following protocols detailed in Yang et al . (2022). We sequenced each sample to an expected coverage of 10×.…”
Section: Methodsmentioning
confidence: 99%
“…For the genome assembly of the S‐morph plant, we used H ifiasm v.0.16.0 (Cheng et al ., 2021) to assemble the PacBio HiFi reads into a haplotype‐resolved genome and a haploid genome. The Hi‐C data of the same individual, which was generated in our previous study (Yang et al ., 2022), were retrieved from CNCB (CNP0002767). We first mapped the clean Hi‐C reads to the haploid assembly to construct chromosome‐level assemblies using J uicer (Durand et al ., 2016b) and built primary scaffolds with the 3D‐DNA pipeline (Dudchenko et al ., 2017).…”
Section: Methodsmentioning
confidence: 99%
“…We predicted protein‐coding genes by combining de novo , homology‐based, and transcriptome‐based approaches according to methods detailed in our previous study of N. indica (Yang et al ., 2022). We mapped all consensus gene models to public databases, including Swiss‐Prot and NR, using B lastp (Camacho et al ., 2009) searches with an e‐value cutoff of 1e‐5 and aligned to the egg NOG database v.5.0 (Huerta‐Cepas et al ., 2019) using egg NOG‐mapper v.2 (Cantalapiedra et al ., 2021).…”
Summary
Distyly has evolved independently in numerous animal‐pollinated angiosperm lineages. Understanding of its molecular basis has been restricted to a few species, primarily Primula. Here, we investigate the genetic architecture of the single diallelic locus (S‐locus) supergene, a linkage group of functionally associated genes, and explore how it may have evolved in distylous Nymphoides indica, a lineage of flowering plants not previously investigated.
We assembled haplotype‐resolved genomes, used read‐coverage‐based genome‐wide association study (rb‐GWAS) to locate the S‐locus supergene, co‐expression network analysis to explore gene networks underpinning the development of distyly, and comparative genomic analyses to investigate the origins of the S‐locus supergene.
We identified three linked candidate S‐locus genes – NinBAS1, NinKHZ2, and NinS1 – that were only evident in the short‐styled morph and were hemizygous. Co‐expression network analysis suggested that brassinosteroids contribute to dimorphic sex organs in the short‐styled morph. Comparative genomic analyses indicated that the S‐locus supergene likely evolved via stepwise duplications and has been affected by transposable element activities.
Our study provides novel insight into the structure, regulation, and evolution of the supergene governing distyly in N. indica. It also provides high‐quality genomic resources for future research on the molecular mechanisms underlying the striking evolutionary convergence in form and function across heterostylous taxa.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.