Angiosperms are the largest and most successful clade of land plants with >250,000 species distributed in nearly every terrestrial habitat. Many phylogenetic studies have been based on DNA sequences of one to several genes, but, despite decades of intensive efforts, relationships among early diverging lineages and several of the major clades remain either incompletely resolved or weakly supported. We performed phylogenetic analyses of 81 plastid genes in 64 sequenced genomes, including 13 new genomes, to estimate relationships among the major angiosperm clades, and the resulting trees are used to examine the evolution of gene and intron content. Phylogenetic trees from multiple methods, including model-based approaches, provide strong support for the position of Amborella as the earliest diverging lineage of flowering plants, followed by Nymphaeales and Austrobaileyales. The plastid genome trees also provide strong support for a sister relationship between eudicots and monocots, and this group is sister to a clade that includes Chloranthales and magnoliids. Resolution of relationships among the major clades of angiosperms provides the necessary framework for addressing numerous evolutionary questions regarding the rapid diversification of angiosperms. Gene and intron content are highly conserved among the early diverging angiosperms and basal eudicots, but 62 independent gene and intron losses are limited to the more derived monocot and eudicot clades. Moreover, a lineage-specific correlation was detected between rates of nucleotide substitutions, indels, and genomic rearrangements. angiosperm evolution ͉ molecular evolution A ngiosperms, the largest clade of land plants with Ͼ250,000 species, experienced rapid radiation soon after their first appearance in the fossil record (1). As a result, flowering plants exhibit incredible diversity in habit, morphology, anatomy, physiology, and reproductive biology. This variation has presented major challenges to biologists interested in the origin and evolution of these traits, and resolving these issues critically depends on having a well resolved and strongly supported phylogenetic framework. Over the past 20 years, numerous phylogenetic studies have used both morphological and molecular data to assess relationships among the major clades (reviewed in ref.2), resulting in a widely accepted classification of angiosperms with 45 orders and 457 families (3).For nearly two decades, most phylogenetic analyses of angiosperms have relied on DNA sequences of one to several genes from the plastid, mitochondrial, and nuclear genomes (reviewed in ref.2). Despite these intensive efforts there are still uncertainties regarding relationships among several major clades throughout angiosperms, including the earliest diverging lineages. Recent studies support the placement of Amborella sister to all remaining angiosperms, but support is often low. Amborella has also been placed with waterlilies (Nymphaeales) in a clade sister to other angiosperms (4-7). In many studies, resolution of r...
BackgroundSacred lotus is a basal eudicot with agricultural, medicinal, cultural and religious importance. It was domesticated in Asia about 7,000 years ago, and cultivated for its rhizomes and seeds as a food crop. It is particularly noted for its 1,300-year seed longevity and exceptional water repellency, known as the lotus effect. The latter property is due to the nanoscopic closely packed protuberances of its self-cleaning leaf surface, which have been adapted for the manufacture of a self-cleaning industrial paint, Lotusan.ResultsThe genome of the China Antique variety of the sacred lotus was sequenced with Illumina and 454 technologies, at respective depths of 101× and 5.2×. The final assembly has a contig N50 of 38.8 kbp and a scaffold N50 of 3.4 Mbp, and covers 86.5% of the estimated 929 Mbp total genome size. The genome notably lacks the paleo-triplication observed in other eudicots, but reveals a lineage-specific duplication. The genome has evidence of slow evolution, with a 30% slower nucleotide mutation rate than observed in grape. Comparisons of the available sequenced genomes suggest a minimum gene set for vascular plants of 4,223 genes. Strikingly, the sacred lotus has 16 COG2132 multi-copper oxidase family proteins with root-specific expression; these are involved in root meristem phosphate starvation, reflecting adaptation to limited nutrient availability in an aquatic environment.ConclusionsThe slow nucleotide substitution rate makes the sacred lotus a better resource than the current standard, grape, for reconstructing the pan-eudicot genome, and should therefore accelerate comparative analysis between eudicots and monocots.
During the past decade, there has been a rapid increase in our understanding of plastid genome organization and evolution due to the availability of many new completely sequenced genomes. There are 45 complete genomes published and ongoing projects are likely to increase this sampling to nearly 200 genomes during the next 5 years. Several groups of researchers including ours have been developing new techniques for gathering and analyzing entire plastid genome sequences and details of these developments are summarized in this chapter. The most important developments that enhance our ability to generate whole chloroplast genome sequences involve the generation of pure fractions of chloroplast genomes by whole genome amplification using rolling circle amplification, cloning genomes into Fosmid or bacterial artificial chromosome (BAC) vectors, and the development of an organellar annotation program (Dual Organellar GenoMe Annotator [DOGMA]). In addition to providing details of these methods, we provide an overview of methods for analyzing complete plastid genome sequences for repeats and gene content, as well as approaches for using gene order and sequence data for phylogeny reconstruction. This explosive increase in the number of sequenced plastid genomes and improved computational tools will provide many insights into the evolution of these genomes and much new data for assessing relationships at deep nodes in plants and other photosynthetic organisms.
Background: The number of completely sequenced plastid genomes available is growing rapidly. This array of sequences presents new opportunities to perform comparative analyses. In comparative studies, it is often useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the genomes reported here: Nuphar advena (from a basal-most lineage) and Ranunculus macranthus (a basal eudicot). We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages) to evaluate features such as the status of ycf15 and ycf68 as protein coding genes, the distribution of simple sequence repeats (SSRs) and longer dispersed repeats (SDR), and patterns of nucleotide composition.
Inference of genetic clusters is a key aim of population genetics, sparking development of numerous analytical methods. Within these, there is a conceptual divide between finding de novo structure versus assessment of a priori groups. Recently developed, Discriminant Analysis of Principal Components (DAPC), combines discriminant analysis (DA) with principal component (PC) analysis. When applying DAPC, the groups used in the DA (specified a priori or described de novo) need to be carefully assessed. While DAPC has rapidly become a core technique, the sensitivity of the method to misspecification of groups and how it is being empirically applied, are unknown. To address this, we conducted a simulation study examining the influence of a priori versus de novo group designations, and a literature review of how DAPC is being applied. We found that with a priori groupings, distance between genetic clusters reflected underlying FST. However, when migration rates were high and groups were described de novo there was considerable inaccuracy, both in terms of the number of genetic clusters suggested and placement of individuals into those clusters. Nearly all (90.1%) of 224 studies surveyed used DAPC to find de novo clusters, and for the majority (62.5%) the stated goal matched the results. However, most studies (52.3%) omit key run parameters, preventing repeatability and transparency. Therefore, we present recommendations for standard reporting of parameters used in DAPC analyses. The influence of groupings in genetic clustering is not unique to DAPC, and researchers need to consider their goal and which methods will be most appropriate.
Populations delineated based on genetic data are commonly used for wildlife conservation and management. Many studies use the program structure combined with the ΔK method to identify the most probable number of populations (K). We recently found K = 2 was identified more often when studies used ΔK compared to studies that did not. We suggested two reasons for this: hierarchical population structure leads to underestimation, or the ΔK method does not evaluate K = 1 causing an overestimation. The present contribution aims to develop a better understanding of the limits of the method using one, two and three population simulations across migration scenarios. From these simulations we identified the “best K” using model likelihood and ΔK. Our findings show that mean probability plots and ΔK are unable to resolve the correct number of populations once migration rate exceeds 0.005. We also found a strong bias towards selecting K = 2 using the ΔK method. We used these data to identify the range of values where the ΔK statistic identifies a value of K that is not well supported. Finally, using the simulations and a review of empirical data, we found that the magnitude of ΔK corresponds to the level of divergence between populations. Based on our findings, we suggest researchers should use the ΔK method cautiously; they need to report all relevant data, including the magnitude of ΔK, and an estimate of connectivity for the research community to assess whether meaningful genetic structure exists within the context of management and conservation.
Expertise in the broader impacts of scientific research is an increasingly important aspect of professional development, particularly because federal grant proposals are commonly reviewed using both the Intellectual Merit and the Broader Impacts Criteria. Unfortunately, training in broader impacts, such as science communication and outreach, is not typically part of undergraduate or graduate curricula. We initiated one of the first graduate-level biology courses on broader impacts, focusing on giving graduate students firsthand, authentic experiences with grant writing, science communication, and educational outreach. Students in this interdisciplinary course learned from experts, wrote for a broad audience about their own research, and proposed and implemented outreach in collaboration with local organizations. We outline our approach, discuss outcomes from each activity, assess our impact, and finally consider how future programs might expand on this model.
Wildlife disease incidence is increasing, resulting in negative impacts on the economy, biodiversity, and potentially human health. Chronic wasting disease (CWD) is a fatal, transmissible spongiform encephalopathy of cervids (wild and captive) which continues to spread geographically resulting in exposure to potential new host species. The disease agent (PrP CWD ) is a misfolded conformer of the cellular prion protein (PrP C ). In Canada, the disease is endemic in Alberta and Saskatchewan, affecting mule and white-tail deer, with lesser impact on elk and moose. As the disease continues to expand, additional wild ungulate species including bison, bighorn sheep, mountain goat, and pronghorn antelope may be exposed. To better understand the speciesbarrier, we reviewed the current literature on taxa naturally or experimentally exposed to CWD to identify susceptible and resistant species. We created a phylogeny of these taxa using cytochrome B and found that CWD susceptibility followed the species phylogeny. Using this phylogeny we estimated the probability of CWD susceptibility for wild ungulate species. We then compared PrP C amino acid polymorphisms among these species to identify which sites segregated between susceptible and resistant species. We identified sites that were significantly associated with susceptibility, but they were not fully discriminating. Finally, we sequenced Prnp from 578 wild ungulates to further evaluate their potential susceptibility. Together, these data suggest the hostrange for CWD will potentially include pronghorn, mountain goat and bighorn sheep, but bison are likely to be more resistant. These findings highlight the need for monitoring potentially susceptible species as CWD continues to expand.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.