The domesticated sunflower, Helianthus annuus L., is a global oil crop that has promise for climate change adaptation, because it can maintain stable yields across a wide variety of environmental conditions, including drought 1 . Even greater resilience is achievable through the mining of resistance alleles from compatible wild sunflower relatives 2,3 , including numerous extremophile species 4 . Here we report a high-quality reference for the sunflower genome (3.6 gigabases), together with extensive transcriptomic data from vegetative and floral organs. The genome mostly consists of highly similar, related sequences 5 and required single-molecule realtime sequencing technologies for successful assembly. Genome analyses enabled the reconstruction of the evolutionary history of the Asterids, further establishing the existence of a whole-genome triplication at the base of the Asterids II clade 6 and a sunflowerspecific whole-genome duplication around 29 million years ago 7 . An integrative approach combining quantitative genetics, expression and diversity data permitted development of comprehensive gene networks for two major breeding traits, flowering time and oil metabolism, and revealed new candidate genes in these networks. We found that the genomic architecture of flowering time has been shaped by the most recent whole-genome duplication, which suggests that ancient paralogues can remain in the same regulatory networks for dozens of millions of years. This genome represents a cornerstone for future research programs aiming to exploit genetic diversity to improve biotic and abiotic stress resistance and oil production, while also considering agricultural constraints and human nutritional needs 8,9 .As the only major crop domesticated in North America, with its sunlike inflorescence that inspired artists, the sunflower is both a social icon and a major research focus for scientists. In evolutionary biology, the Helianthus genus is a long-time model for hybrid speciation and adaptive introgression 10 . In plant science, the sunflower is a model for understanding solar tracking 11 and inflorescence development 12 .Despite this large interest, assembling its genome has been extremely difficult as it mainly consists of long and highly similar repeats. This complexity has challenged leading-edge assembly protocols for close to a decade 13 .To finally overcome this challenge, we generated a 102× sequencing coverage of the genome of the inbred line XRQ using 407 singlemolecule real-time (SMRT) cells on the PacBio RS II platform. Production of 32 million very long reads allowed us to generate a genome assembly that captures 3 gigabases (Gb) (80% of the estimated genome size) in 13,957 sequence contigs. Four high-density genetic maps were combined with a sequence-based physical map to build the sequences of the 17 pseudo-chromosomes that anchor 97% of the gene content (Fig.
Self-incompatibility has been considered by geneticists a model system for reproductive biology and balancing selection, but our understanding of the genetic basis and evolution of this molecular lock-and-key system has remained limited by the extreme level of sequence divergence among haplotypes, resulting in a lack of appropriate genomic sequences. In this study, we report and analyze the full sequence of eleven distinct haplotypes of the self-incompatibility locus (S-locus) in two closely related Arabidopsis species, obtained from individual BAC libraries. We use this extensive dataset to highlight sharply contrasted patterns of molecular evolution of each of the two genes controlling self-incompatibility themselves, as well as of the genomic region surrounding them. We find strong collinearity of the flanking regions among haplotypes on each side of the S-locus together with high levels of sequence similarity. In contrast, the S-locus region itself shows spectacularly deep gene genealogies, high variability in size and gene organization, as well as complete absence of sequence similarity in intergenic sequences and striking accumulation of transposable elements. Of particular interest, we demonstrate that dominant and recessive S-haplotypes experience sharply contrasted patterns of molecular evolution. Indeed, dominant haplotypes exhibit larger size and a much higher density of transposable elements, being matched only by that in the centromere. Overall, these properties highlight that the S-locus presents many striking similarities with other regions involved in the determination of mating-types, such as sex chromosomes in animals or in plants, or the mating-type locus in fungi and green algae.
BackgroundThe Asteraceae represents an important plant family with respect to the numbers of species present in the wild and used by man. Nonetheless, genomic resources for Asteraceae species are relatively underdeveloped, hampering within species genetic studies as well as comparative genomics studies at the family level. So far, six BAC libraries have been described for the main crops of the family, i.e. lettuce and sunflower. Here we present the characterization of BAC libraries of chicory (Cichorium intybus L.) constructed from two genotypes differing in traits related to sexual and vegetative reproduction. Resolving the molecular mechanisms underlying traits controlling the reproductive system of chicory is a key determinant for hybrid development, and more generally will provide new insights into these traits, which are poorly investigated so far at the molecular level in Asteraceae.FindingsTwo bacterial artificial chromosome (BAC) libraries, CinS2S2 and CinS1S4, were constructed from HindIII-digested high molecular weight DNA of the contrasting genotypes C15 and C30.01, respectively. C15 was hermaphrodite, non-embryogenic, and S2S2 for the S-locus implicated in self-incompatibility, whereas C30.01 was male sterile, embryogenic, and S1S4. The CinS2S2 and CinS1S4 libraries contain 89,088 and 81,408 clones. Mean insert sizes of the CinS2S2 and CinS1S4 clones are 90 and 120 kb, respectively, and provide together a coverage of 12.3 haploid genome equivalents. Contamination with mitochondrial and chloroplast DNA sequences was evaluated with four mitochondrial and four chloroplast specific probes, and was estimated to be 0.024% and 1.00% for the CinS2S2 library, and 0.028% and 2.35% for the CinS1S4 library. Using two single copy genes putatively implicated in somatic embryogenesis, screening of both libraries resulted in detection of 12 and 13 positive clones for each gene, in accordance with expected numbers.ConclusionsThis indicated that both BAC libraries are valuable tools for molecular studies in chicory, one goal being the positional cloning of the S-locus in this Asteraceae species.
The origins and evolution of virulence in amphibian-infecting chytrids Batrachochytrium dendrobatidis ( Bd ) and Batrachochytrium salamandrivorans ( Bsal) are largely unknown. Here, we use deep nanopore sequencing of Bsal and comparative genomics against 21 high-quality genome assemblies that span the fungal Chytridiomycota. We discover that Bsal has the most repeat-rich genome of the Chytridiomycota, comprising 40.9% repetitive elements; this genome has expanded to more than 3× the length of its conspecific Bd , with autonomous and fully functional LTR/Gypsy elements contributing significantly to the expansion. The M36 metalloprotease virulence factors are highly expanded ( n = 177) in Bsal , most of which (53%) are flanked by transposable elements, suggesting they have a repeat-associated expansion. We find enrichment upstream of M36 metalloprotease genes of three novel repeat families belonging to the repeat superfamily of LINEs that are implicated with gene copy number variations. Additionally, Bsal has a highly compartmentalized genome architecture, with virulence factors enriched in gene-sparse/repeat-rich compartments, while core conserved genes are enriched in gene-rich/repeat-poor compartments. Genes upregulated during infection are primarily found in the gene-sparse/repeat-rich compartment in both Bd and Bsal . Furthermore, genes with signatures of positive selection in Bd are enriched in repeat-rich regions, suggesting these regions are a cradle for the evolution of chytrid pathogenicity. These are the hallmarks of two-speed genome evolution, and this study provides evidence of two-speed genomes in an animal pathogen, shedding light on the evolution of fungal pathogens of vertebrates driving global declines and extinctions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.