To understand the variation in genomic patterning of DNA methylation we compared methylomes of 34 diverse angiosperm species. By analyzing whole-genome bisulfite sequencing data in a phylogenetic context it becomes clear that there is extensive variation throughout angiosperms in gene body DNA methylation, euchromatic silencing of transposons and repeats, as well as silencing of heterochromatic transposons. The Brassicaceae have reduced CHG methylation levels and also reduced or loss of CG gene body methylation. The Poaceae are characterized by a lack or reduction of heterochromatic CHH methylation and enrichment of CHH methylation in genic regions. Reduced CHH methylation levels are found in clonally propagated species, suggesting that these methods of propagation may alter the epigenomic landscape over time. These results show that DNA methylation patterns are broadly a reflection of the evolutionary and life histories of plant species.
Read mapping is a fundamental part of next-generation genomic research but is complicated by genome duplication in many plants. Categorizing DNA sequence reads into their respective genomes enables current methods to analyze polyploid genomes as if they were diploid. We present PolyCat—a pipeline for mapping and categorizing all types of next-generation sequence data produced from allopolyploid organisms. PolyCat uses GSNAP’s single-nucleotide polymorphism (SNP)-tolerant mapping to minimize the mapping efficiency bias caused by SNPs between genomes. PolyCat then uses SNPs between genomes to categorize reads according to their respective genomes. Bisulfite-treated reads have a significant reduction in nucleotide complexity because nucleotide conversion events are confounded with transition substitutions. PolyCat includes special provisions to properly handle bisulfite-treated data. We demonstrate the functionality of PolyCat on allotetraploid cotton, Gossypium hirsutum, and create a functional SNP index for efficiently mapping sequence reads to the D-genome sequence of G. raimondii. PolyCat is appropriate for all allopolyploids and all types of next-generation genome analysis, including differential expression (RNA sequencing), differential methylation (bisulfite sequencing), differential DNA-protein binding (chromatin immunoprecipitation sequencing), and population diversity.
The formation of allopolyploid cotton precipitated a rapid diversification and colonization of dry coastal American tropical and subtropical regions. Previous phylogenetic analyses, combined with molecular divergence analyses, have offered a temporal framework for this radiation, but provide only weak support for some of the resolved branches. Moreover, these earlier analyses did not include the recently recognized sixth polyploid species, G. ekmanianum Wittmack. Here we use targeted sequence capture of multiple loci in conjunction with both concatenated and Bayesian concordance analyses to reevaluate the phylogeny of allopolyploid cotton species. Although phylogenetic resolution afforded by individual genes is often low, sufficient signal was attained both through the concatenated and concordance analyses to provide robust support for the Gossypium polyploid clade, which is reported here.
• Premise of the study: Hybridization has played an important role in the evolution and ecological adaptation of diploid and polyploid plants. Artemisia tridentata (Asteraceae) tetraploids are extremely widespread and of great ecological importance. These tetraploids are often taxonomically identified as A. tridentata subsp. wyomingensis or as autotetraploids of diploid subspecies tridentata and vaseyana. Few details are available as to how these tetraploids are formed or how they are related to diploid subspecies.• Methods: We used amplicon sequencing to assess phylogenetic relationships among three recognized subspecies: tridentata, vaseyana, and wyomingensis. DNA sequence data from putative genes were pyrosequenced and assembled from 329 samples. Nucleotide diversity and putative haplotypes were estimated from the high‐read coverage. Phylogenies were constructed from Bayesian coalescence and neighbor‐net network analyses.• Key results: Analyses support distinct diploid subspecies of tridentata and vaseyana in spite of known hybridization in ecotones. Nucleotide diversity estimates of populations compared to the total diversity indicate the relationships are predominately driven by a small proportion of the amplicons. Tetraploids, including subspecies wyomingensis, are polyphyletic occurring within and between diploid subspecies groups.• Conclusions: Artemisia tridentata is a species comprising phylogenetically distinct diploid progenitors and a tetraploid complex with varying degrees of phylogenetic and morphological affinities to the diploid subspecies. These analyses suggest tetraploids are formed locally or regionally from diploid tridentata and vaseyana populations via autotetraploidy, followed by introgression between tetraploid groups. Understanding the phylogenetic vs. ecological relationships of A. tridentata subspecies will have bearing on how to restore these desert ecosystems.
Understanding the composition, evolution, and function of the Gossypium hirsutum (cotton) genome is complicated by the joint presence of two genomes in its nucleus (AT and DT genomes). These two genomes were derived from progenitor A-genome and D-genome diploids involved in ancestral allopolyploidization. To better understand the allopolyploid genome, we re-sequenced the genomes of extant diploid relatives that contain the A1 (Gossypium herbaceum), A2 (Gossypium arboreum), or D5 (Gossypium raimondii) genomes. We conducted a comparative analysis using deep re-sequencing of multiple accessions of each diploid species and identified 24 million SNPs between the A-diploid and D-diploid genomes. These analyses facilitated the construction of a robust index of conserved SNPs between the A-genomes and D-genomes at all detected polymorphic loci. This index is widely applicable for read mapping efforts of other diploid and allopolyploid Gossypium accessions. Further analysis also revealed locations of putative duplications and deletions in the A-genome relative to the D-genome reference sequence. The approximately 25,400 deleted regions included more than 50% deletion of 978 genes, including many involved with starch synthesis. In the polyploid genome, we also detected 1,472 conversion events between homoeologous chromosomes, including events that overlapped 113 genes. Continued characterization of the Gossypium genomes will further enhance our ability to manipulate fiber and agronomic production of cotton.
Amaranth (Amaranthus hypochondriacus L.) is an emerging pseudocereal native to the New World that has garnered increased attention in recent years because of its nutritional quality, in particular its seed protein and more specifically its high levels of the essential amino acid lysine. It belongs to the Amaranthaceae family, is an ancient paleopolyploid that shows disomic inheritance (2n = 32), and has an estimated genome size of 466 Mb. Here we present a high-quality draft genome sequence of the grain amaranth. The genome assembly consisted of 377 Mb in 3518 scaffolds with an N 50 of 371 kb. Repetitive element analysis predicted that 48% of the genome is comprised of repeat sequences, of which Copia-like elements were the most commonly classified retrotransposon. A de novo transcriptome consisting of 66,370 contigs was assembled from eight different amaranth tissue and abiotic stress libraries. Annotation of the genome identified 23,059 protein-coding genes. Seven grain amaranths (A. hypochondriacus, A. caudatus, and A. cruentus) and their putative progenitor (A. hybridus) were resequenced. A single nucleotide polymorphism (SNP) phylogeny supported the classification of A. hybridus as the progenitor species of the grain amaranths. Lastly, we generated a de novo physical map for A. hypochondriacus using the BioNano Genomics' Genome Mapping platform. The physical map spanned 340 Mb and a hybrid assembly using the BioNano physical maps nearly doubled the N 50 of the assembly to 697 kb. Moreover, we analyzed synteny between amaranth and sugar beet (Beta vulgaris L.) and estimated, using K s analysis, the age of the most recent polyploidization event in amaranth.
Whole genome duplication (WGD) is widespread in flowering plants and is a driving force in angiosperm diversification. The redundancy introduced by WGD allows the evolution of novel gene interactions and functions, although the patterns and processes of diversification are poorly understood. We identified ∼2,000 pairs of paralogous genes in Gossypium raimondii (cotton) resulting from an approximately 60 My old 5- to 6-fold ploidy increase. Gene expression analyses revealed that, in G. raimondii, 99.4% of the gene pairs exhibit differential expression in at least one of the three tissues (petal, leaf, and seed), with 93% to 94% exhibiting differential expression on a per-tissue basis. For 1,666 (85%) pairs, differential expression was observed in all tissues. These observations were mirrored in a time series of G. raimondii seed, and separately in leaf, petal, and seed of G. arboreum, indicating expression level diversification before species divergence. A generalized linear model revealed 92.4% of the paralog pairs exhibited expression divergence, with most exhibiting significant gene and tissue interactions indicating complementary expression patterns in different tissues. These data indicate massive, near-complete expression level neo- and/or subfunctionalization among ancient gene duplicates, suggesting these processes are essential in their maintenance over ∼60 Ma.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.