We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The number of pan-genes in these diverse genomes exceeds 103,000, with approximately a third found across all genotypes. The results demonstrate that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres revealed additional variation in major cytological landmarks. We show that combining structural variation with single-nucleotide polymorphisms can improve the power of quantitative mapping studies. We also document variation at the level of DNA methylation and demonstrate that unmethylated regions are enriched for cis-regulatory elements that contribute to phenotypic variation.
Crop genomics has seen dramatic advances in recent years due to improvements in sequencing technology, assembly methods, and computational resources. These advances have led to the development of new tools to facilitate crop improvement. The study of structural variation within species and the characterization of the pan-genome has revealed extensive genome content variation among individuals within a species that is paradigm shifting to crop genomics and improvement. Here, we review advances in crop genomics and how utilization of these tools is shifting in light of pan-genomes that are becoming available for many crop species.
We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The data indicate that the number of pan-genes exceeds 103,000 and that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres further reveal the locations and internal structures of major cytological landmarks. We show that combining structural variation with SNPs can improve the power of quantitative mapping studies. Finally, we document variation at the level of DNA methylation, and demonstrate that unmethylated regions are enriched for cis-regulatory elements that overlap QTL and contribute to changes in gene expression.One sentence summaryA multi-genome analysis of maize reveals previously unknown variation in gene content, genome structure, and methylation.
Intact transposable elements (TEs) account for 65% of the maize genome and can impact gene function and regulation. Although TEs comprise the majority of the maize genome and affect important phenotypes, genome wide patterns of TE polymorphisms in maize have only been studied in a handful of maize genotypes, due in part to the challenging nature of assessing highly repetitive sequences. We implemented a method to use short read sequencing data from 509 diverse inbred lines to classify the presence/absence of 494,564 non-redundant TEs that were previously annotated in four genome assemblies including B73, Mo17, PH207, and W22. Different orders of TEs (i.e. LTRs, Helitrons, TIRs) had different frequency distributions within the population. Across the different orders, TE family size was negatively correlated with average population frequency of TEs in the family and nested TEs are at lower frequency than non-nested TEs. Age of LTR elements was positively correlated with population frequency. Comparison with SNP data revealed that while a majority of TEs are tagged by nearby SNPs (r2 > 0.9) there are also many TEs in low to moderate linkage disequilibrium with nearby SNPs. This study provides a population scale genome-wide assessment of TE variation in maize, and provides valuable insight on variation in TEs in maize and factors that contribute to this variation.
Intact transposable elements (TEs) account for 65% of the maize genome and can impact gene function and regulation. Although TEs comprise the majority of the maize genome and affect important phenotypes, genome wide patterns of TE polymorphisms in maize have only been studied in a handful of maize genotypes, due to the challenging nature of assessing highly repetitive sequences. We implemented a method to use short read sequencing data from 509 diverse inbred lines to classify the presence/absence of 445,418 non-redundant TEs that were previously annotated in four genome assemblies including B73, Mo17, PH207, and W22. Different orders of TEs (i.e. LTRs, Helitrons, TIRs) had different frequency distributions within the population. LTRs with lower LTR similarity were generally more frequent in the population than LTRs with higher LTR similarity, though high frequency insertions with very high LTR similarity were observed. LTR similarity and frequency estimates of nested elements and the outer elements in which they insert revealed that most nesting events occurred very near the timing of the outer element insertion. TEs within genes were at higher frequency than those that were outside of genes and this is particularly true for those not inserted into introns. Many TE insertional polymorphisms observed in this population were tagged by SNP markers. However, there were also 19.9% of the TE polymorphisms that were not well tagged by SNPs (R2 < 0.5) that potentially represent information that has not been well captured in previous SNP based marker-trait association studies. This study provides a population scale genome-wide assessment of TE variation in maize, and provides valuable insight on variation in TEs in maize and factors that contribute to this variation.
Maize exhibits tremendous gene expression variation between different lines.Complementation of diverse gene expression patterns in hybrids could play an important role in the manifestation of heterosis. In this study, we used transcriptome data of five different tissues from 33 maize inbreds and 89 hybrids (430 samples in total)to survey the global gene expression landscape of F 1 -hybrids relative to their inbred parents. Analysis of this data set revealed that single parent expression (SPE), which is defined as gene expression in only one of the two parents, while commonly observed, is highly genotype-and tissue-specific. Genes that have SPE in at least one pair of inbreds also tend to be tissue-specific. Genes with SPE caused by genomic presence/absence variation (PAV SPE) are much more frequently expressed in hybrids than genes that are present in the genome of both inbreds, but expressed in only a single-parent (non-PAV SPE) (74.7% vs. 59.7%). For non-PAV SPE genes, allele specific expression was used to investigate whether parental alleles not expressed in the inbred line ("silent allele") can be actively transcribed in the hybrid. We found that expression of the silent allele in the hybrid is relatively rare (~6.3% of non-PAV SPE genes), but is observed in almost all hybrids and tissues. Non-PAV SPE genes with expression of the silent allele in the hybrid are more likely to exhibit above high-parent expression level in the hybrid than those that do not express the silent allele. Finally, both PAV SPE and non-PAV SPE genes are highly enriched for being classified as non-syntenic, but depleted for curated genes with experimentally determined functions. This study provides a more comprehensive understanding of the potential role of non-PAV SPE and PAV SPE genes in heterosis.
SUMMARYSingle‐parent expression (SPE) is defined as gene expression in only one of the two parents. SPE can arise from differential expression between parental alleles, termed non‐presence/absence (non‐PAV) SPE, or from the physical absence of a gene in one parent, termed PAV SPE. We used transcriptome data of diverse Zea mays (maize) inbreds and hybrids, including 401 samples from five different tissues, to test for differences between these types of SPE genes. Although commonly observed, SPE is highly genotype and tissue specific. A positive correlation was observed between the genetic distance of the two inbred parents and the number of SPE genes identified. Regulatory analysis showed that PAV SPE and non‐PAV SPE genes are mainly regulated by cis effects, with a small fraction under trans regulation. Polymorphic transposable element insertions in promoter sequences contributed to the high level of cis regulation for PAV SPE and non‐PAV SPE genes. PAV SPE genes were more frequently expressed in hybrids than non‐PAV SPE genes. The expression of parentally silent alleles in hybrids of non‐PAV SPE genes was relatively rare but occurred in most hybrids. Non‐PAV SPE genes with expression of the silent allele in hybrids are more likely to exhibit above high parent expression level than hybrids that do not express the silent allele, leading to non‐additive expression. This study provides a comprehensive understanding of the nature of non‐PAV SPE and PAV SPE genes and their roles in gene expression complementation in maize hybrids.
Wheat stem rust disease caused by Puccinia graminis f. sp. tritici (Pgt) is a global threat to wheat production. Fast evolving populations of Pgt limit the efficacy of plant genetic resistance and constrain disease management strategies. Understanding molecular mechanisms that lead to rust infection and disease susceptibility could deliver novel strategies to deploy crop resistance through genetic loss of disease susceptibility. We used comparative transcriptome-based and orthology-guided approaches to characterize gene expression changes associated with Pgt infection in susceptible and resistant Triticum aestivum genotypes as well as the non-host Brachypodium distachyon. We targeted our analysis to genes with differential expression in T. aestivum and genes suppressed or not affected in B. distachyon and report several processes potentially linked to susceptibility to Pgt, such as cell death suppression and impairment of photosynthesis. We complemented our approach with a gene co-expression network analysis to identify wheat targets to deliver resistance to Pgt through removal or modification of putative susceptibility genes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.