Cultivated strawberry emerged from the hybridization of two wild octoploid species, both descendants from the merger of four diploid progenitor species into a single nucleus more than 1 million years ago. Here we report a near-complete chromosome-scale assembly for cultivated octoploid strawberry (Fragaria × ananassa) and uncovered the origin and evolutionary processes that shaped this complex allopolyploid. We identified the extant relatives of each diploid progenitor species and provide support for the North American origin of octoploid strawberry. We examined the dynamics among the four subgenomes in octoploid strawberry and uncovered the presence of a single dominant subgenome with significantly greater gene content, gene expression abundance, and biased exchanges between homoeologous chromosomes, as compared with the other subgenomes. Pathway analysis showed that certain metabolomic and disease-resistance traits are largely controlled by the dominant subgenome. These findings and the reference genome should serve as a powerful platform for future evolutionary studies and enable molecular breeding in strawberry.
Long terminal repeat retrotransposons (LTR-RTs) are prevalent in plant genomes. The identification of LTR-RTs is critical for achieving high-quality gene annotation. Based on the well-conserved structure, multiple programs were developed for the de novo identification of LTR-RTs; however, these programs are associated with low specificity and high false discovery rates. Here, we report LTR_retriever, a multithreading-empowered Perl program that identifies LTR-RTs and generates high-quality LTR libraries from genomic sequences. LTR_retriever demonstrated significant improvements by achieving high levels of sensitivity (91%), specificity (97%), accuracy (96%), and precision (90%) in rice (Oryza sativa). LTR_retriever is also compatible with long sequencing reads. With 40k self-corrected PacBio reads equivalent to 4.53 genome coverage in Arabidopsis (Arabidopsis thaliana), the constructed LTR library showed excellent sensitivity and specificity. In addition to canonical LTRRTs with 59-TG.CA-39 termini, LTR_retriever also identifies noncanonical LTR-RTs (non-TGCA), which have been largely ignored in genome-wide studies. We identified seven types of noncanonical LTRs from 42 out of 50 plant genomes. The majority of noncanonical LTRs are Copia elements, with which the LTR is four times shorter than that of other Copia elements, which may be a result of their target specificity. Strikingly, non-TGCA Copia elements are often located in genic regions and preferentially insert nearby or within genes, indicating their impact on the evolution of genes and their potential as mutagenesis tools.
Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever.
BackgroundSequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations.ResultsWe benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species.ConclusionsThe benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The number of pan-genes in these diverse genomes exceeds 103,000, with approximately a third found across all genotypes. The results demonstrate that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres revealed additional variation in major cytological landmarks. We show that combining structural variation with single-nucleotide polymorphisms can improve the power of quantitative mapping studies. We also document variation at the level of DNA methylation and demonstrate that unmethylated regions are enriched for cis-regulatory elements that contribute to phenotypic variation.
It has long been established that premature leaf senescence negatively impacts the yield stability of rice, but the underlying molecular mechanism driving this relationship remains largely unknown. Here, we identified a dominant premature leaf senescence mutant, prematurely senile 1 (ps1-D). PS1 encodes a plantspecific NAC (no apical meristem, Arabidopsis ATAF1/2, and cupshaped cotyledon2) transcriptional activator, Oryza sativa NAC-like, activated by apetala3/pistillata (OsNAP). Overexpression of OsNAP significantly promoted senescence, whereas knockdown of OsNAP produced a marked delay of senescence, confirming the role of this gene in the development of rice senescence. OsNAP expression was tightly linked with the onset of leaf senescence in an age-dependent manner. Similarly, ChIP-PCR and yeast onehybrid assays demonstrated that OsNAP positively regulates leaf senescence by directly targeting genes related to chlorophyll degradation and nutrient transport and other genes associated with senescence, suggesting that OsNAP is an ideal marker of senescence onset in rice. Further analysis determined that OsNAP is induced specifically by abscisic acid (ABA), whereas its expression is repressed in both aba1 and aba2, two ABA biosynthetic mutants. Moreover, ABA content is reduced significantly in ps1-D mutants, indicating a feedback repression of OsNAP on ABA biosynthesis. Our data suggest that OsNAP serves as an important link between ABA and leaf senescence. Additionally, reduced OsNAP expression leads to delayed leaf senescence and an extended grain-filling period, resulting in a 6.3% and 10.3% increase in the grain yield of two independent representative RNAi lines, respectively. Thus, fine-tuning OsNAP expression should be a useful strategy for improving rice yield in the future.hormones | nutrition remobilization | programmed cell death L eaf senescence is an integral part of the final stages of plant development and is controlled by a fine-tuned, complex regulatory network (1). During senescence, leaf cells undergo dramatic changes in cellular metabolism, structure, and gene expression (2, 3). The most striking feature of these changes is the yellowing of the leaves caused by the breakdown of chlorophyll during chloroplast degeneration, followed by the hydrolysis of macromolecules such as lipids, proteins, and nucleic acids, which, in turn, results in mitochondria and nuclei dissociation and cell death (4, 5). This process facilitates both hydrolysis and the recycling of nutrients from source to sink tissues to increase reproductive success (6). Thus, senescence is not a passive process but rather is a developmentally programmed procedure that has a strong adaptive advantage (7,8). Although leaf senescence is controlled primarily by developmental age, the onset and progression of this process also is influenced by a number of endogenous and external factors (1, 9, 10). For example, abscisic acid (ABA) is thought to be one of the phytohormones that promote leaf senescence (11, 12). Specifically, both an upregu...
Asian cultivated rice (Oryza sativa L.) consists of two main subspecies, indica and japonica. Indica has higher nitrate-absorption activity than japonica, but the molecular mechanisms underlying that activity remain elusive. Here we show that variation in a nitrate-transporter gene, NRT1.1B (OsNPF6.5), may contribute to this divergence in nitrate use. Phylogenetic analysis revealed that NRT1.1B diverges between indica and japonica. NRT1.1B-indica variation was associated with enhanced nitrate uptake and root-to-shoot transport and upregulated expression of nitrate-responsive genes. The selection signature of NRT1.1B-indica suggests that nitrate-use divergence occurred during rice domestication. Notably, field tests with near-isogenic and transgenic lines confirmed that the japonica variety carrying the NRT1.1B-indica allele had significantly improved grain yield and nitrogen-use efficiency (NUE) compared to the variety without that allele. Our results show that variation in NRT1.1B largely explains nitrate-use divergence between indica and japonica and that NRT1.1B-indica can potentially improve the NUE of japonica.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.