Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genome organization and content. Most maize genomes are composed of TEs. We developed an approach to define shared and variable TE insertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17 and PH207) with uniform structural annotations of TEs. Among these genomes we identified approximately 400 000 TEs that are polymorphic, encompassing 1.6 Gb of variable TE sequence. These polymorphic TEs include a combination of recent transposition events as well as deletions of older TEs. There are examples of polymorphic TEs within each of the superfamilies of TEs and they are found distributed across the genome, including in regions of recent shared ancestry among individuals. There are many examples of polymorphic TEs within or near maize genes. In addition, there are 2380 gene annotations in the B73 genome that are located within variable TEs, providing evidence for the role of TEs in contributing to the substantial differences in annotated gene content among these genotypes. TEs are highly variable in our survey of four temperate maize genomes, highlighting the major contribution of TEs in driving variation in genome organization and gene content.
Transposable elements (TEs) are ubiquitous components of eukaryotic genomes and can create variation in genomic organization. The majority of maize genomes are composed of TEs. We developed an approach to define shared and variable TE insertions across genome assemblies and applied this method to four maize genomes (B73, W22, Mo17, and PH207). Among these genomes we identified 1.6 Gb of variable TE sequence representing a combination of recent TE movement and deletion of previously existing TEs. Although recent TE movement only accounted for a portion of the TE variability, we identified 4,737 TEs unique to one genome with defined insertion sites in all other genomes. Variable TEs are found for all superfamilies and are distributed across the genome, including in regions of recent shared ancestry among individuals. There are 2,380 genes annotated in the B73 genome located within variable TEs, providing evidence for the role of TEs in contributing to the substantial differences in gene content among these genotypes. The large scope of TE variation present in this limited sample of temperate maize genomes highlights the major contribution of TEs in driving variation in genome organization and gene content.Significance StatementThe majority of the maize genome is comprised of transposable elements (TEs) that have the potential to create genomic variation within species. We developed a method to identify shared and non-shared TEs using whole genome assemblies of four maize inbred lines. Variable TEs are found throughout the maize genome and in comparisons of any two genomes we find ~20% of the genome is due to non-shared TEs. Several thousand maize genes are found within TEs that are variable across lines, highlighting the contribution of TEs to gene content variation. This study creates a comprehensive resource for genomic studies of TE variability among four maize genomes, which will enable studies on the consequences of variable TEs on genome function.
SUMMARYMaize is a diverse paleotetraploid species with considerable presence/absence variation and copy number variation. One mechanism through which presence/absence variation can arise is differential fractionation. Fractionation refers to the loss of duplicate gene pairs from one of the maize subgenomes during diploidization. Differential fractionation refers to non-shared gene loss events between individuals following a whole-genome duplication event. We investigated the prevalence of presence/absence variation resulting from differential fractionation in the syntenic portion of the genome using two whole-genome de novo assemblies of the inbred lines B73 and PH207. Between these two genomes, syntenic genes were highly conserved with less than 1% of syntenic genes being subject to differential fractionation. The few variably fractionated syntenic genes that were identified are unlikely to contribute to functional phenotypic variation, as there is a significant depletion of these genes in annotated gene sets. In further comparisons of 60 diverse inbred lines, non-syntenic genes were six times more likely to be variable than syntenic genes, suggesting that comparisons among additional genome assemblies are not likely to result in the discovery of large-scale presence/absence variation among syntenic genes.
Tandem duplicate genes are proximally duplicated and as such occur in similar genomic neighborhoods. Using the maize B73 and PH207 de novo genome assemblies, we identified thousands of tandem gene duplicates that account for ∼10% of the annotated genes. These tandem duplicates have a bimodal distribution of ages, which coincide with ancient allopolyploidization and more recent domestication. Tandem duplicates are smaller on average and have a higher probability of containing LTR elements than other genes, suggesting origins in nonhomologous recombination. Within relatively recent tandem duplicate genes, ∼26% appear to be undergoing degeneration or divergence in function from the ancestral copy. Our results show that tandem duplicates are abundant in maize, arose in bursts throughout maize evolutionary history under multiple potential mechanisms, and may provide a substrate for novel phenotypic variation.
SUMMARYMaize is a diverse paleotetraploid species with widespread presence/absence variation and copy number variation. One mechanism through which presence/absence variation can arise is differential fractionation. Fractionation refers to the loss of duplicate gene pairs from one of the maize subgenomes during diploidization and differential fractionation refers to non-shared gene loss events between individuals. We investigated the prevalence of presence/absence variation resulting from differential fractionation in the syntenic portion of the genome using two whole genome de novo assemblies of the inbred lines B73 and PH207. Between these two genomes, syntenic genes were highly conserved with less than 1% of syntenic genes being subject to differential fractionation. The few variable syntenic genes that were identified are unlikely to contribute to functional phenotypic variation, as there is a significant depletion of these genes in annotated gene sets. In further comparisons of 60 diverse inbred lines, non-syntenic genes were six times more likely to be variable compared to syntenic genes, suggesting that comparisons among additional genome assemblies are not likely to result in the discovery of large-scale presence/absence variation among syntenic genes.SIGNIFICANCE STATEMENTThere is a large amount of presence/absence variation for gene content in maize. One mechanism that has been hypothesized to contribute to this variation is differential fractionation between individuals following the maize whole genome duplication event. Using comparative genomics, with sorghum and rice representing the ancestral state, we observed little evidence of differential fractionation among elite inbred lines and the few differentially fractionated genes identified did not appear to confer functional significance.
Maize exhibits tremendous gene expression variation between different lines.Complementation of diverse gene expression patterns in hybrids could play an important role in the manifestation of heterosis. In this study, we used transcriptome data of five different tissues from 33 maize inbreds and 89 hybrids (430 samples in total)to survey the global gene expression landscape of F 1 -hybrids relative to their inbred parents. Analysis of this data set revealed that single parent expression (SPE), which is defined as gene expression in only one of the two parents, while commonly observed, is highly genotype-and tissue-specific. Genes that have SPE in at least one pair of inbreds also tend to be tissue-specific. Genes with SPE caused by genomic presence/absence variation (PAV SPE) are much more frequently expressed in hybrids than genes that are present in the genome of both inbreds, but expressed in only a single-parent (non-PAV SPE) (74.7% vs. 59.7%). For non-PAV SPE genes, allele specific expression was used to investigate whether parental alleles not expressed in the inbred line ("silent allele") can be actively transcribed in the hybrid. We found that expression of the silent allele in the hybrid is relatively rare (~6.3% of non-PAV SPE genes), but is observed in almost all hybrids and tissues. Non-PAV SPE genes with expression of the silent allele in the hybrid are more likely to exhibit above high-parent expression level in the hybrid than those that do not express the silent allele. Finally, both PAV SPE and non-PAV SPE genes are highly enriched for being classified as non-syntenic, but depleted for curated genes with experimentally determined functions. This study provides a more comprehensive understanding of the potential role of non-PAV SPE and PAV SPE genes in heterosis.
SUMMARYSingle‐parent expression (SPE) is defined as gene expression in only one of the two parents. SPE can arise from differential expression between parental alleles, termed non‐presence/absence (non‐PAV) SPE, or from the physical absence of a gene in one parent, termed PAV SPE. We used transcriptome data of diverse Zea mays (maize) inbreds and hybrids, including 401 samples from five different tissues, to test for differences between these types of SPE genes. Although commonly observed, SPE is highly genotype and tissue specific. A positive correlation was observed between the genetic distance of the two inbred parents and the number of SPE genes identified. Regulatory analysis showed that PAV SPE and non‐PAV SPE genes are mainly regulated by cis effects, with a small fraction under trans regulation. Polymorphic transposable element insertions in promoter sequences contributed to the high level of cis regulation for PAV SPE and non‐PAV SPE genes. PAV SPE genes were more frequently expressed in hybrids than non‐PAV SPE genes. The expression of parentally silent alleles in hybrids of non‐PAV SPE genes was relatively rare but occurred in most hybrids. Non‐PAV SPE genes with expression of the silent allele in hybrids are more likely to exhibit above high parent expression level than hybrids that do not express the silent allele, leading to non‐additive expression. This study provides a comprehensive understanding of the nature of non‐PAV SPE and PAV SPE genes and their roles in gene expression complementation in maize hybrids.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.