The domesticated sunflower, Helianthus annuus L., is a global oil crop that has promise for climate change adaptation, because it can maintain stable yields across a wide variety of environmental conditions, including drought 1 . Even greater resilience is achievable through the mining of resistance alleles from compatible wild sunflower relatives 2,3 , including numerous extremophile species 4 . Here we report a high-quality reference for the sunflower genome (3.6 gigabases), together with extensive transcriptomic data from vegetative and floral organs. The genome mostly consists of highly similar, related sequences 5 and required single-molecule realtime sequencing technologies for successful assembly. Genome analyses enabled the reconstruction of the evolutionary history of the Asterids, further establishing the existence of a whole-genome triplication at the base of the Asterids II clade 6 and a sunflowerspecific whole-genome duplication around 29 million years ago 7 . An integrative approach combining quantitative genetics, expression and diversity data permitted development of comprehensive gene networks for two major breeding traits, flowering time and oil metabolism, and revealed new candidate genes in these networks. We found that the genomic architecture of flowering time has been shaped by the most recent whole-genome duplication, which suggests that ancient paralogues can remain in the same regulatory networks for dozens of millions of years. This genome represents a cornerstone for future research programs aiming to exploit genetic diversity to improve biotic and abiotic stress resistance and oil production, while also considering agricultural constraints and human nutritional needs 8,9 .As the only major crop domesticated in North America, with its sunlike inflorescence that inspired artists, the sunflower is both a social icon and a major research focus for scientists. In evolutionary biology, the Helianthus genus is a long-time model for hybrid speciation and adaptive introgression 10 . In plant science, the sunflower is a model for understanding solar tracking 11 and inflorescence development 12 .Despite this large interest, assembling its genome has been extremely difficult as it mainly consists of long and highly similar repeats. This complexity has challenged leading-edge assembly protocols for close to a decade 13 .To finally overcome this challenge, we generated a 102× sequencing coverage of the genome of the inbred line XRQ using 407 singlemolecule real-time (SMRT) cells on the PacBio RS II platform. Production of 32 million very long reads allowed us to generate a genome assembly that captures 3 gigabases (Gb) (80% of the estimated genome size) in 13,957 sequence contigs. Four high-density genetic maps were combined with a sequence-based physical map to build the sequences of the 17 pseudo-chromosomes that anchor 97% of the gene content (Fig.
L-Sox5 and Sox6 are highly identical Sry-related transcription factors coexpressed in cartilage. Whereas Sox5 and Sox6 single null mice are born with mild skeletal abnormalities, Sox5; Sox6 double null fetuses die with a severe, generalized chondrodysplasia. In these double mutants, chondroblasts poorly differentiate. They express the genes for all essential cartilage extracellular matrix components at low or undetectable levels and initiate proliferation after a long delay. All cartilages are thus extracellular matrix deficient and remain rudimentary. While chondroblasts in the center of cartilages ultimately activate prehypertrophic chondrocyte markers, epiphyseal chondroblasts ectopically activate hypertrophic chondrocyte markers. Thick intramembranous bone collars develop, but the formation of cartilage growth plates and endochondral bones is disrupted. L-Sox5 and Sox6 are thus redundant, potent enhancers of chondroblast functions, thereby essential for endochondral skeleton formation.
The sunflower family, Asteraceae, comprises 10% of all flowering plant species and displays an incredible diversity of form. Asteraceae are clearly monophyletic, yet resolving phylogenetic relationships within the family has proven difficult, hindering our ability to understand its origin and diversification. Recent molecular clock dating has suggested a Cretaceous origin, but the lack of deep sampling of many genes and representative taxa from across the family has impeded the resolution of migration routes and diversifications that led to its global distribution and tremendous diversity. Here we use genomic data from 256 terminals to estimate evolutionary relationships, timing of diversification(s), and biogeographic patterns. Our study places the origin of Asteraceae at ∼83 MYA in the late Cretaceous and reveals that the family underwent a series of explosive radiations during the Eocene which were accompanied by accelerations in diversification rates. The lineages that gave rise to nearly 95% of extant species originated and began diversifying during the middle Eocene, coincident with the ensuing marked cooling during this period. Phylogenetic and biogeographic analyses support a South American origin of the family with subsequent dispersals into North America and then to Asia and Africa, later followed by multiple worldwide dispersals in many directions. The rapid mid-Eocene diversification is aligned with the biogeographic range shift to Africa where many of the modern-day tribes appear to have originated. Our robust phylogeny provides a framework for future studies aimed at understanding the role of the macroevolutionary patterns and processes that generated the enormous species diversity of Asteraceae.
• Premise of the study: The Compositae (Asteraceae) are a large and diverse family of plants, and the most comprehensive phylogeny to date is a meta-tree based on 10 chloroplast loci that has several major unresolved nodes. We describe the development of an approach that enables the rapid sequencing of large numbers of orthologous nuclear loci to facilitate efficient phylogenomic analyses.• Methods and Results: We designed a set of sequence capture probes that target conserved orthologous sequences in the Compositae. We also developed a bioinformatic and phylogenetic workflow for processing and analyzing the resulting data. Application of our approach to 15 species from across the Compositae resulted in the production of phylogenetically informative sequence data from 763 loci and the successful reconstruction of known phylogenetic relationships across the family.• Conclusions: These methods should be of great use to members of the broader Compositae community, and the general approach should also be of use to researchers studying other families.
While hybridization has recently received a resurgence of attention from systematists and evolutionary biologists, there remains a dearth of case studies on ancient, diversified hybrid lineages-clades of organisms that originated through reticulation. Studies on these groups are valuable in that they would speak to the long-term phylogenetic success of lineages following gene flow between species. We present a phylogenomic view of Heuchera, long known for frequent hybridization, incorporating all three independent genomes: targeted nuclear (~400,000 bp), plastid (~160,000 bp), and mitochondrial (~470,000 bp) data. We analyze these data using multiple concatenation and coalescence strategies. The nuclear phylogeny is consistent with previous work and with morphology, confidently suggesting a monophyletic Heuchera. By contrast, analyses of both organellar genomes recover a grossly polyphyletic Heuchera,consisting of three primary clades with relationships extensively rearranged within these as well. A minority of nuclear loci also exhibit phylogenetic discord; yet these topologies remarkably never resemble the pattern of organellar loci and largely present low levels of discord inter alia. Two independent estimates of the coalescent branch length of the ancestor of Heuchera using nuclear data suggest rare or nonexistent incomplete lineage sorting with related clades, inconsistent with the observed gross polyphyly of organellar genomes (confirmed by simulation of gene trees under the coalescent). These observations, in combination with previous work, strongly suggest hybridization as the cause of this phylogenetic discord. [Ancient hybridization; chloroplast capture; incongruence; phylogenomics; reticulation.].
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.