We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The number of pan-genes in these diverse genomes exceeds 103,000, with approximately a third found across all genotypes. The results demonstrate that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres revealed additional variation in major cytological landmarks. We show that combining structural variation with single-nucleotide polymorphisms can improve the power of quantitative mapping studies. We also document variation at the level of DNA methylation and demonstrate that unmethylated regions are enriched for cis-regulatory elements that contribute to phenotypic variation.
It is generally believed that maize (Zea mays L. ssp. mays) arose as a tetraploid; however, the two progenitor genomes cannot be unequivocally traced within the genome of modern maize. We have taken a new approach to investigate the origin of the maize genome. We isolated and sequenced large genomic fragments from the regions surrounding five duplicated loci from the maize genome and their orthologous loci in sorghum, and then we compared these sequences with the orthologous regions in the rice genome. Within the studied segments, we identified 11 genes that were conserved in location, order, and orientation. We performed phylogenetic and distance analyses and examined the patterns of estimated times of divergence for sorghum and maize gene orthologs and also the time of divergence for maize orthologs. Our results support a tetraploid origin of maize. This analysis also indicates contemporaneous divergence of the ancestral sorghum genome and the two maize progenitor genomes about 11.9 million years ago (Mya). On the basis of a putative conversion event detected for one of the genes, tetraploidization must have occurred before 4.8 Mya, and therefore, preceded the major maize genome expansion by gene amplification and retrotransposition.Maize (Zea mays L. ssp. mays), from the grass tribe Andropogoneae, is an agronomically important crop and also a traditional genetic model. The theory that maize is a tetraploid first arose from the fact that maize has a haploid chromosome number of 10 (2n = 20), whereas many closely related grasses, such as Coix aquatica, Saccharum sp., and Erianthus sp., have only five chromosomes in the haploid nucleus (Celarier 1956;Mehra and Sharma 1975;Mason-Gamer et al. 1998). Rhoades (1951) demonstrated by genetic linkage mapping that nontandem gene duplicates are common in the maize genome. Mapping results of isozymic variants were also consistent with the hypothesis that the maize genome contains large duplicated regions (Goodman et al. 1980;McMillin and Scandalios 1980;Wendel et al. 1986). More recently, molecular mapping studies using RFLP markers have shown that most of the 10 maize chromosomes contain duplicated segments (Helentjaris et al. 1988;Ahn and Tanksley 1993). Comparative genomic studies based on the collinearity of grass genomes (Moore et al. 1995;Gale and Devos 1998) indicated that the maize genome aligns with the diploid rice and sorghum genomes in two chromosome sets, implying wholegenome duplication.Three models can explain the large-scale duplications in the maize genome, that is, segmental duplication (multiple independent duplications within a genome), autotetraploidy (intraspecific genomic duplication), and allotetraploidy (interspecific genome hybridization). Gaut and Doebley (1997) proposed a segmental allotetraploid model and suggested that one of the maize subgenomes is more closely related to sorghum than to the other maize subgenome. However, they deemed their conclusions tentative (Gaut and Doebley 1997;Gaut et al. 2000) in the absence of sorghum sequences. Al...
Three RFLP maps, as well as several RAPD maps have been developed in common bean (Phaseolus vulgaris L.). In order to align these maps, a core linkage map was established in the recombinant inbred population BAT93;Jalo EEP558 (BJ). This map has a total length of 1226 cM and comprises 563 markers, including some 120 RFLP and 430 RAPD markers, in addition to a few isozyme and phenotypic marker loci. Among the RFLPs mapped were markers from the University of California, Davis (established in the F of the BJ cross), University of Paris-Orsay, and University of Florida maps. These shared markers allowed us to Communicated by P. M. A. Tigerstedt
The maize genome, with its large complement of transposons and repeats, is a paradigm for the study of epigenetic mechanisms such as paramutation and imprinting. Here, we present the genome-wide map of cytosine methylation for two maize inbred lines, B73 and Mo17. CG (65%) and CHG (50%) methylation (where H = A, C, or T) is highest in transposons, while CHH (5%) methylation is likely guided by 24-nt, but not 21-nt, small interfering RNAs (siRNAs). Correlations with methylation patterns suggest that CG methylation in exons (8%) may deter insertion of Mutator transposon insertion, while CHG methylation at splice acceptor sites may inhibit RNA splicing. Using the methylation map as a guide, we used low-coverage sequencing to show that parental methylation differences are inherited by recombinant inbred lines. However, frequent methylation switches, guided by siRNA, persist for up to eight generations, suggesting that epigenetic inheritance resembling paramutation is much more common than previously supposed. The methylation map will provide an invaluable resource for epigenetic studies in maize.
RNA-guided CRISPR-Cas9 endonucleases are widely used for genome engineering, but our understanding of Cas9 specificity remains incomplete. Here, we developed a biochemical method (SITE-Seq), using Cas9 programmed with single-guide RNAs (sgRNAs), to identify the sequence of cut sites within genomic DNA. Cells edited with the same Cas9-sgRNA complexes are then assayed for mutations at each cut site using amplicon sequencing. We used SITE-Seq to examine Cas9 specificity with sgRNAs targeting the human genome. The number of sites identified depended on sgRNA sequence and nuclease concentration. Sites identified at lower concentrations showed a higher propensity for off-target mutations in cells. The list of off-target sites showing activity in cells was influenced by sgRNP delivery, cell type and duration of exposure to the nuclease. Collectively, our results underscore the utility of combining comprehensive biochemical identification of off-target sites with independent cell-based measurements of activity at those sites when assessing nuclease activity and specificity.
Plant oil is an important renewable resource for biodiesel production and for dietary consumption by humans and livestock. Through genetic mapping of the oil trait in plants, studies have reported multiple quantitative trait loci (QTLs) with small effects, but the molecular basis of oil QTLs remains largely unknown. Here we show that a high-oil QTL (qHO6) affecting maize seed oil and oleic-acid contents encodes an acyl-CoA:diacylglycerol acyltransferase (DGAT1-2), which catalyzes the final step of oil synthesis. We further show that a phenylalanine insertion in DGAT1-2 at position 469 (F469) is responsible for the increased oil and oleic-acid contents. The DGAT1-2 allele with F469 is ancestral, whereas the allele without F469 is a more recent mutant selected by domestication or breeding. Ectopic expression of the high-oil DGAT1-2 allele increases oil and oleic-acid contents by up to 41% and 107%, respectively. This work provides insights into the molecular basis of natural variation of oil and oleic-acid contents in plants and highlights DGAT as a promising target for increasing oil and oleic-acid contents in other crops.
GATA3 is an essential transcription factor that was first identified as a regulator of immune cell function. In recent microarray analyses of human breast tumors, both normal breast luminal epithelium and estrogen receptor (ESR1)-positive tumors showed high expression of GATA3. We sequenced genomic DNA from 111 breast tumors and three breast-tumor-derived cell lines and identified somatic mutations of GATA3 in five tumors and the MCF-7 cell line. These mutations cluster in the vicinity of the highly conserved second zinc-finger that is required for DNA binding. In addition to these five, we identified using cDNA sequencing a unique mis-splicing variant that caused a frameshift mutation. One of the somatic mutations we identified was identical to a germline GATA3 mutation reported in two kindreds with HDR syndrome/OMIM #146255, which is an autosomal dominant syndrome caused by the haplo-insufficiency of GATA3. The ectopic expression of GATA3 in human 293T cells caused the induction of 73 genes including six cytokeratins, and inhibited cell line doubling times. These data suggest that GATA3 is involved in growth control and the maintenance of the differentiated state in epithelial cells, and that GATA3 variants may contribute to tumorigenesis in ESR1-positive breast tumors.
The advent of next-generation DNA sequencing (NGS) technologies has led to the development of rapid genome-wide Single Nucleotide Polymorphism (SNP) detection applications in various plant species. Recent improvements in sequencing throughput combined with an overall decrease in costs per gigabase of sequence is allowing NGS to be applied to not only the evaluation of small subsets of parental inbred lines, but also the mapping and characterization of traits of interest in much larger populations. Such an approach, where sequences are used simultaneously to detect and score SNPs, therefore bypassing the entire marker assay development stage, is known as genotyping-by-sequencing (GBS). This review will summarize the current state of GBS in plants and the promises it holds as a genome-wide genotyping application.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.