An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.
Author contributions DCJ coordinated all analyses, isolated DNA for sequencing, analysed and filtered SNP calls, conducted diversity analysis and GWAS and drafted the manuscript. CR produced phenotype data for growth on various solid media and growth rates in liquid media. AR conducted analysis of dating using mitochondrial data. DS conducted GWAS. MP analysed all phenotype data. TM identified LTR transposon insertions and analysed transposon insertion data. FXM conducted crosses for analysis of spore viability ZI produced indel calls with Cortex. WL conducted analysis of recombination rate, linkage disequilibrium decay and PCA for distance between strains. TMKC assisted with phenotype and population analysis. RP analysed Cortex and GATK indel calls. MM conducted amino acid profiling. JLDL and AC produced automated measures of cell morphology. SB aligned reads and produced GATK SNP calls. GH analysed population structure using fineSTRUCTURE. BO'F estimated the TMRCA from the nuclear genome using ACG. TK identified LTR transposon insertions JTS produced de novo assemblies. LB developed the custom Workspace workflow Spotsizer. BT assisted with sequence analysis. DAB assisted with analysis of novel genes. TS assisted with strain verification. SC produced images of wild strains and assisted with strain verification. JEEUH assisted with SNP validation. LvT and MT assisted with LTR validation. LJ and JL assisted with manual measures of cell morphology and FACS. SA produced gene expression data. MF, KM and ND assisted with sequencing. WB initiated and assisted with strain collection. JH coordinated manual measures of cell morphology and FACS. RECS coordinated automated measures of cell morphology. MR coordinated amino acid profiling. NM conducted analysis of recombination, linkage disequilibrium and advised on aspects of diversity and GWAS. DJB advised on GWAS. RD facilitated sequencing. JB contributed to the initiation and development of the project and financed the JB laboratory. AccessionsSequence data are archived in the European Nucleotide Archive (www.ebi.ac.uk/ena/), Study Accessions PRJEB2733 and PRJEB6284 (Supplementary Table 7). All SNPs and indels were submitted to NCBI dbSNP (www.ncbi.nlm.nih.gov/SNP/). Accessions are 974514578-974688138 (SNPs) and 974702618-974688139 (indels). Europe PMC Funders Group AbstractNatural variation within species reveals aspects of genome evolution and function. The fission yeast Schizosaccharomyces pombe is an important model for eukaryotic biology, but researchers typically use one standard laboratory strain. To extend the utility of this model, we surveyed the genomic and phenotypic variation in 161 natural isolates. We sequenced the genomes of all strains, revealing moderate genetic diversity (π = 3 ×10 −3 ) and weak global population structure. We estimate that dispersal of S. pombe began within human antiquity (~340 BCE), and ancestors of these strains reached the Americas at ~1623 CE. We quantified 74 traits, revealing substantial heritable phenotypic diversity. We cond...
BackgroundNext generation sequencing (NGS) technologies are providing new ways to accelerate fine-mapping and gene isolation in many species. To date, the majority of these efforts have focused on diploid organisms with readily available whole genome sequence information. In this study, as a proof of concept, we tested the use of NGS for SNP discovery in tetraploid wheat lines differing for the previously cloned grain protein content (GPC) gene GPC-B1. Bulked segregant analysis (BSA) was used to define a subset of putative SNPs within the candidate gene region, which were then used to fine-map GPC-B1.ResultsWe used Illumina paired end technology to sequence mRNA (RNAseq) from near isogenic lines differing across a ~30-cM interval including the GPC-B1 locus. After discriminating for SNPs between the two homoeologous wheat genomes and additional quality filtering, we identified inter-varietal SNPs in wheat unigenes between the parental lines. The relative frequency of these SNPs was examined by RNAseq in two bulked samples made up of homozygous recombinant lines differing for their GPC phenotype. SNPs that were enriched at least 3-fold in the corresponding pool (6.5% of all SNPs) were further evaluated. Marker assays were designed for a subset of the enriched SNPs and mapped using DNA from individuals of each bulk. Thirty nine new SNP markers, corresponding to 67% of the validated SNPs, mapped across a 12.2-cM interval including GPC-B1. This translated to 1 SNP marker per 0.31 cM defining the GPC-B1 gene to within 13-18 genes in syntenic cereal genomes and to a 0.4 cM interval in wheat.ConclusionsThis study exemplifies the use of RNAseq for SNP discovery in polyploid species and supports the use of BSA as an effective way to target SNPs to specific genetic intervals to fine-map genes in unsequenced genomes.
BackgroundStorage triacylglycerols in castor bean seeds are enriched in the hydroxylated fatty acid ricinoleate. Extensive tissue-specific RNA-Seq transcriptome and lipid analysis will help identify components important for its biosynthesis.Methodology/FindingsStorage triacylglycerols (TAGs) in the endosperm of developing castor (Ricinus communis) seeds are highly enriched in ricinoleic acid (18:1-OH). We have analysed neutral lipid fractions from other castor tissues using TLC, GLC and mass spectrometry. Cotyledons, like the endosperm, contain high levels of 18:1-OH in TAG. Pollen and male developing flowers accumulate TAG but do not contain 18:1-OH and leaves do not contain TAG or 18:1-OH. Analysis of acyl-CoAs in developing endosperm shows that ricinoleoyl-CoA is not the dominant acyl-CoA, indicating that either metabolic channelling or enzyme substrate selectivity are important in the synthesis of tri-ricinolein in this tissue. RNA-Seq transcriptomic analysis, using Illumina sequencing by synthesis technology, has been performed on mRNA isolated from two stages of developing seeds, germinating seeds, leaf and pollen-producing male flowers in order to identify differences in lipid-metabolic pathways and enzyme isoforms which could be important in the biosynthesis of TAG enriched in 18:1-OH. This study gives comprehensive coverage of gene expression in a variety of different castor tissues. The potential role of differentially expressed genes is discussed against a background of proteins identified in the endoplasmic reticulum, which is the site of TAG biosynthesis, and transgenic studies aimed at increasing the ricinoleic acid content of TAG.Conclusions/SignificanceSeveral of the genes identified in this tissue-specific whole transcriptome study have been used in transgenic plant research aimed at increasing the level of ricinoleic acid in TAG. New candidate genes have been identified which might further improve the level of ricinoleic acid in transgenic crops.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.