Alternative splicing enhances transcriptome diversity in all eukaryotes and plays a role in plant tissue identity and stress adaptation. To catalog new maize (Zea mays) transcripts and identify genomic loci that regulate alternative splicing, we analyzed over 90 RNA-seq libraries from maize inbred lines B73 and Mo17, as well as Syn10 doubled haploid lines (progenies from B73 3 Mo17). Transcript discovery was augmented with publicly available data from 14 maize tissues, expanding the maize transcriptome by more than 30,000 and increasing the percentage of intron-containing genes that undergo alternative splicing to 40%. These newly identified transcripts greatly increase the diversity of the maize proteome, sometimes coding for entirely different proteins compared with their most similar annotated isoform. In addition to increasing proteome diversity, many genes encoding novel transcripts gained an additional layer of regulation by microRNAs, often in a tissue-specific manner. We also demonstrate that the majority of genotype-specific alternative splicing can be genetically mapped, with cis-acting quantitative trait loci (QTLs) predominating. A large number of trans-acting QTLs were also apparent, with nearly half located in regions not shown to contain genes associated with splicing. Taken together, these results highlight the currently underappreciated role that alternative splicing plays in tissue identity and genotypic variation in maize.
Deleterious alleles have long been proposed to play an important role in patterning phenotypic variation and are central to commonly held ideas explaining the hybrid vigor observed in the offspring of a cross between two inbred parents. We test these ideas using evolutionary measures of sequence conservation to ask whether incorporating information about putatively deleterious alleles can inform genomic selection (GS) models and improve phenotypic prediction. We measured a number of agronomic traits in both the inbred parents and hybrids of an elite maize partial diallel population and re-sequenced the parents of the population. Inbred elite maize lines vary for more than 350,000 putatively deleterious sites, but show a lower burden of such sites than a comparable set of traditional landraces. Our modeling reveals widespread evidence for incomplete dominance at these loci, and supports theoretical models that more damaging variants are usually more recessive. We identify haplotype blocks using an identity-by-decent (IBD) analysis and perform genomic prediction analyses in which we weigh blocks on the basis of complementation for segregating putatively deleterious variants. Cross-validation results show that incorporating sequence conservation in genomic selection improves prediction accuracy for grain yield and other fitness-related traits as well as heterosis for those traits. Our results provide empirical support for an important role for incomplete dominance of deleterious alleles in explaining heterosis and demonstrate the utility of incorporating functional annotation in phenotypic prediction and plant breeding.
Deleterious alleles have long been proposed to play an important role in patterning phenotypic variation and are central to commonly held ideas explaining the hybrid vigor observed in the offspring of a cross between two inbred parents. We test these ideas using evolutionary measures of sequence conservation to ask whether incorporating information about putatively deleterious alleles can inform genomic selection (GS) models and improve phenotypic prediction. We measured a number of agronomic traits in both the inbred parents and hybrids of an elite maize partial diallel population and re-sequenced the parents of the population. Inbred elite maize lines vary for more than 350,000 putatively deleterious sites, but show a lower burden of such sites than a comparable set of traditional landraces. Our modeling reveals widespread evidence for incomplete dominance at these loci, and supports theoretical models that more damaging variants are usually more recessive. We identify haplotype blocks using an identity-by-decent (IBD) analysis and perform genomic prediction analyses in which we weigh blocks on the basis of complementation for segregating putatively deleterious variants. Cross-validation results show that incorporating sequence conservation in genomic selection improves prediction accuracy for grain yield and other fitness-related traits as well as heterosis for those traits. Our results provide empirical support for an important role for incomplete dominance of deleterious alleles in explaining heterosis and demonstrate the utility of incorporating functional annotation in phenotypic prediction and plant breeding. Author summaryA key long-term goal of biology is understanding the genetic basis of phenotypic variation. Although most new mutations are likely disadvantageous, their prevalence and importance in explaining patterns of phenotypic variation is controversial and not well understood. In this study we combine whole genome-sequencing and field evaluation of a maize mapping population to investigate the contribution of deleterious mutations to phenotype. We show that a priori prediction of deleterious alleles correlates well with effect sizes for grain yield and that variants predicted to be more damaging are on average more recessive. We develop a simple model allowing for variation in the heterozygous effects of deleterious mutations and demonstrate its improved ability to predict both phenotypes and hybrid vigor. Our results help reconcile alternative explanations for hybrid vigor and highlight the use of leveraging evolutionary history to facilitate breeding for crop improvement.
Phytophthora root and stem rot caused by P. sojae is a destructive soybean soil-borne disease found worldwide. Discovery of genes conferring broad-spectrum resistance to the pathogen is a need to prevent the outbreak of the disease. Here, we show that soybean Rps11 is a 27.7-kb nucleotide-binding site-leucine-rich repeat (NBS-LRR or NLR) gene conferring broad-spectrum resistance to the pathogen. Rps11 is located in a genomic region harboring a cluster of large NLR genes of a single origin in soybean, and is derived from rounds of unequal recombination. Such events result in promoter fusion and LRR expansion that may contribute to the broad resistance spectrum. The NLR gene cluster exhibits drastic structural diversification among phylogenetically representative varieties, including gene copy number variation ranging from five to 23 copies, and absence of allelic copies of Rps11 in any of the non-Rps11-donor varieties examined, exemplifying innovative evolution of NLR genes and NLR gene clusters.
The Union Internationale pour la Protection des Obtentions Végétales (UPOV) currently relies on morphological characteristics to evaluate distinctness, uniformity, and stability (DUS) as eligibility requirements for the granting of Plant Variety Protection (PVP). We used 10 maize (Zea mays L.) inbred lines, including both unrelated and closely similar pairs, representing three heterotic groups to compare abilities of morphological, ribonucleic acid (RNA) transcription, metabolomic, and single nucleotide polymorphism (SNP) data to distinguish inbred lines. We used the range of variability and robustness as important factors to determine distinguishing power of each methodological approach. Using an index that ranged from 0 to 100 (useless to the perfect ideal), index scores for each methodology were: metabolomics (0), RNA transcription (18.2), morphology (19.6), and SNPs (35.7). The ability to distinguish among genotypes using RNA transcription expression data was concordant with SNP data for genotypes that were up to 97.2% similar according to SNPs. The SNP data alone could provide the basis for a determination of distinctness among inbred lines of maize with use of morphological, physiological, or agronomic performance data as supplementary information, if needed.
Background: Repetitive DNA is a major component of plant genomes and is thought to be a driver of evolutionary novelty. Describing variation in repeat content among individuals and between populations is key to elucidating the evolutionary significance of repetitive DNA. However, the cost of producing references genomes has limited large-scale intraspecific comparisons to a handful of model organisms where multiple reference genomes are available. Results: We examine repeat content variation in the genomes of 94 elite inbred maize lines using graph-based repeat clustering, a reference-free and rapid assay of repeat content. We examine population structure using genome-wide repeat profiles, and demonstrate the stiff-stalk and non-stiff-stalk heterotic populations are homogenous with regard to global repeat content. In contrast, and similar to previously reported results, the same individuals show clear differentiation, and aggregate into two populations when examining population structure using genome-wide SNPs. Additionally, we develop a novel kmer based technique to examine the chromosomal distribution of repeat clusters in silico and show a cluster dependent association with gene density. Conclusion: Our results indicate global repeat content variation in the heterotic populations of maize has not diverged, and is uncoupled from population stratification at SNP loci. We show that repeat families exhibit divergent patterns with regard to chromosomal distribution, some repeat clusters accumulate in regions of high gene density, whereas others aggregate in regions of low gene density.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.