Potato (Solanum tuberosum L.) is the world’s most important non-cereal food crop, and the vast majority of commercially grown cultivars are highly heterozygous tetraploids. Advances in diploid hybrid breeding based on true seeds have the potential to revolutionize future potato breeding and production1–4. So far, relatively few studies have examined the genome evolution and diversity of wild and cultivated landrace potatoes, which limits the application of their diversity in potato breeding. Here we assemble 44 high-quality diploid potato genomes from 24 wild and 20 cultivated accessions that are representative of Solanum section Petota, the tuber-bearing clade, as well as 2 genomes from the neighbouring section, Etuberosum. Extensive discordance of phylogenomic relationships suggests the complexity of potato evolution. We find that the potato genome substantially expanded its repertoire of disease-resistance genes when compared with closely related seed-propagated solanaceous crops, indicative of the effect of tuber-based propagation strategies on the evolution of the potato genome. We discover a transcription factor that determines tuber identity and interacts with the mobile tuberization inductive signal SP6A. We also identify 561,433 high-confidence structural variants and construct a map of large inversions, which provides insights for improving inbred lines and precluding potential linkage drag, as exemplified by a 5.8-Mb inversion that is associated with carotenoid content in tubers. This study will accelerate hybrid potato breeding and enrich our understanding of the evolution and biology of potato as a global staple food crop.
Missing heritability in genome-wide association studies defines a major problem in genetic analyses of complex biological traits1,2. The solution to this problem is to identify all causal genetic variants and to measure their individual contributions3,4. Here we report a graph pangenome of tomato constructed by precisely cataloguing more than 19 million variants from 838 genomes, including 32 new reference-level genome assemblies. This graph pangenome was used for genome-wide association study analyses and heritability estimation of 20,323 gene-expression and metabolite traits. The average estimated trait heritability is 0.41 compared with 0.33 when using the single linear reference genome. This 24% increase in estimated heritability is largely due to resolving incomplete linkage disequilibrium through the inclusion of additional causal structural variants identified using the graph pangenome. Moreover, by resolving allelic and locus heterogeneity, structural variants improve the power to identify genetic factors underlying agronomically important traits leading to, for example, the identification of two new genes potentially contributing to soluble solid content. The newly identified structural variants will facilitate genetic improvement of tomato through both marker-assisted selection and genomic selection. Our study advances the understanding of the heritability of complex traits and demonstrates the power of the graph pangenome in crop breeding.
Pangenome graphs can represent all variation between multiple genomes, but existing methods for constructing them are biased due to reference-guided approaches. In response, we have developed PanGenome Graph Builder (PGGB), a reference-free pipeline for constructing unbiased pangenome graphs. PGGB uses all-to-all whole-genome alignments and learned graph embeddings to build and iteratively refine a model in which we can identify variation, measure conservation, detect recombination events, and infer phylogenetic relationships.
The maize (Zea mays ssp. mays) wild ancestor, teosinte, has three times the seed protein content of most modern inbreds and hybrids, but the mechanisms responsible for this trait are unknown. We created a contiguous haplotype DNA sequence of a teosinte, Zea mays ssp. Parviglumis, with Trio-Binning, and map-based cloned a major high-protein QTL, teosinte high protein 9 (Thp9) on chromosome 9. Thp9 encodes an asparagine synthetase 4 that is highly expressed in teosinte, but not in the B73 inbred, where a deletion in the 10 th intron of Thp9-B73 causes incorrect splicing of Thp9-B73 transcripts. Transgenic expression of Thp9-teosinte in B73 significantly increased seed protein content. Introgression of Thp9-teosinte into modern maize inbreds and hybrids greatly enhanced free amino acid accumulation, especially asparagine, throughout the plant, increasing seed protein content without affecting yield. Thp9-teosinte appears to increase nitrogen utilization efficiency, important for promoting a high yield under low nitrogen conditions.
SUMMARY Ramie (Boehmeria nivea) is an economically important natural fiber‐producing crop that has been cultivated for thousands of years in China; however, the evolution of this crop remains largely unknown. Here, we report a ramie domestication analysis based on genome assembly and resequencing of cultivated and wild accessions. Two chromosome‐level genomes representing wild and cultivated ramie were assembled de novo. Numerous structural variations between two assemblies, together with the genetic variations from population resequencing, constituted a comprehensive genomic variation map for ramie. Domestication analysis identified 71 high‐confidence selective sweeps comprising 320 predicted genes, and 29 genes from sweeps were associated with fiber growth in the expression. In addition, we identified seven genetic loci associated with the fiber yield trait in the segregated population derived from the crossing of two assembled accessions, and two of which showed an overlap with the selective sweeps. These findings indicated that bast fiber traits were focused on during the domestication history of ramie. This study sheds light on the domestication of ramie and provides a valuable resource for biological and breeding studies of this important crop.
Background Castor bean (Ricinus communis L.) is an important oil crop, which belongs to the Euphorbiaceae family. The seed oil of castor bean is currently the only commercial source of ricinoleic acid that can be used for producing about 2000 industrial products. However, it remains largely unknown regarding the origin, domestication, and the genetic basis of key traits of castor bean. Results Here we perform a de novo chromosome-level genome assembly of the wild progenitor of castor bean. By resequencing and analyzing 505 worldwide accessions, we reveal that the accessions from East Africa are the extant wild progenitors of castor bean, and the domestication occurs ~ 3200 years ago. We demonstrate that significant genetic differentiation between wild populations in Kenya and Ethiopia is associated with past climate fluctuation in the Turkana depression ~ 7000 years ago. This dramatic change in climate may have caused the genetic bottleneck in wild castor bean populations. By a genome-wide association study, combined with quantitative trait locus analysis, we identify important candidate genes associated with plant architecture and seed size. Conclusions This study provides novel insights of domestication and genome evolution of castor bean, which facilitates genomics-based breeding of this important oilseed crop and potentially other tree-like crops in future.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.