An ordered draft sequence of the 17-gigabase hexaploid bread wheat (Triticum aestivum) genome has been produced by sequencing isolated chromosome arms. We have annotated 124,201 gene loci distributed nearly evenly across the homeologous chromosomes and subgenomes. Comparative gene analysis of wheat subgenomes and extant diploid and tetraploid wheat relatives showed that high sequence similarity and structural conservation are retained, with limited gene loss, after polyploidization. However, across the genomes there was evidence of dynamic gene gain, loss, and duplication since the divergence of the wheat lineages. A high degree of transcriptional autonomy and no global dominance was found for the subgenomes. These insights into the genome biology of a polyploid crop provide a springboard for faster gene isolation, rapid genetic marker development, and precise breeding to meet the needs of increasing food demand worldwide.
Author contributions DCJ coordinated all analyses, isolated DNA for sequencing, analysed and filtered SNP calls, conducted diversity analysis and GWAS and drafted the manuscript. CR produced phenotype data for growth on various solid media and growth rates in liquid media. AR conducted analysis of dating using mitochondrial data. DS conducted GWAS. MP analysed all phenotype data. TM identified LTR transposon insertions and analysed transposon insertion data. FXM conducted crosses for analysis of spore viability ZI produced indel calls with Cortex. WL conducted analysis of recombination rate, linkage disequilibrium decay and PCA for distance between strains. TMKC assisted with phenotype and population analysis. RP analysed Cortex and GATK indel calls. MM conducted amino acid profiling. JLDL and AC produced automated measures of cell morphology. SB aligned reads and produced GATK SNP calls. GH analysed population structure using fineSTRUCTURE. BO'F estimated the TMRCA from the nuclear genome using ACG. TK identified LTR transposon insertions JTS produced de novo assemblies. LB developed the custom Workspace workflow Spotsizer. BT assisted with sequence analysis. DAB assisted with analysis of novel genes. TS assisted with strain verification. SC produced images of wild strains and assisted with strain verification. JEEUH assisted with SNP validation. LvT and MT assisted with LTR validation. LJ and JL assisted with manual measures of cell morphology and FACS. SA produced gene expression data. MF, KM and ND assisted with sequencing. WB initiated and assisted with strain collection. JH coordinated manual measures of cell morphology and FACS. RECS coordinated automated measures of cell morphology. MR coordinated amino acid profiling. NM conducted analysis of recombination, linkage disequilibrium and advised on aspects of diversity and GWAS. DJB advised on GWAS. RD facilitated sequencing. JB contributed to the initiation and development of the project and financed the JB laboratory. AccessionsSequence data are archived in the European Nucleotide Archive (www.ebi.ac.uk/ena/), Study Accessions PRJEB2733 and PRJEB6284 (Supplementary Table 7). All SNPs and indels were submitted to NCBI dbSNP (www.ncbi.nlm.nih.gov/SNP/). Accessions are 974514578-974688138 (SNPs) and 974702618-974688139 (indels). Europe PMC Funders Group AbstractNatural variation within species reveals aspects of genome evolution and function. The fission yeast Schizosaccharomyces pombe is an important model for eukaryotic biology, but researchers typically use one standard laboratory strain. To extend the utility of this model, we surveyed the genomic and phenotypic variation in 161 natural isolates. We sequenced the genomes of all strains, revealing moderate genetic diversity (π = 3 ×10 −3 ) and weak global population structure. We estimate that dispersal of S. pombe began within human antiquity (~340 BCE), and ancestors of these strains reached the Americas at ~1623 CE. We quantified 74 traits, revealing substantial heritable phenotypic diversity. We cond...
BackgroundNext generation sequencing (NGS) technologies are providing new ways to accelerate fine-mapping and gene isolation in many species. To date, the majority of these efforts have focused on diploid organisms with readily available whole genome sequence information. In this study, as a proof of concept, we tested the use of NGS for SNP discovery in tetraploid wheat lines differing for the previously cloned grain protein content (GPC) gene GPC-B1. Bulked segregant analysis (BSA) was used to define a subset of putative SNPs within the candidate gene region, which were then used to fine-map GPC-B1.ResultsWe used Illumina paired end technology to sequence mRNA (RNAseq) from near isogenic lines differing across a ~30-cM interval including the GPC-B1 locus. After discriminating for SNPs between the two homoeologous wheat genomes and additional quality filtering, we identified inter-varietal SNPs in wheat unigenes between the parental lines. The relative frequency of these SNPs was examined by RNAseq in two bulked samples made up of homozygous recombinant lines differing for their GPC phenotype. SNPs that were enriched at least 3-fold in the corresponding pool (6.5% of all SNPs) were further evaluated. Marker assays were designed for a subset of the enriched SNPs and mapped using DNA from individuals of each bulk. Thirty nine new SNP markers, corresponding to 67% of the validated SNPs, mapped across a 12.2-cM interval including GPC-B1. This translated to 1 SNP marker per 0.31 cM defining the GPC-B1 gene to within 13-18 genes in syntenic cereal genomes and to a 0.4 cM interval in wheat.ConclusionsThis study exemplifies the use of RNAseq for SNP discovery in polyploid species and supports the use of BSA as an effective way to target SNPs to specific genetic intervals to fine-map genes in unsequenced genomes.
BackgroundStorage triacylglycerols in castor bean seeds are enriched in the hydroxylated fatty acid ricinoleate. Extensive tissue-specific RNA-Seq transcriptome and lipid analysis will help identify components important for its biosynthesis.Methodology/FindingsStorage triacylglycerols (TAGs) in the endosperm of developing castor (Ricinus communis) seeds are highly enriched in ricinoleic acid (18:1-OH). We have analysed neutral lipid fractions from other castor tissues using TLC, GLC and mass spectrometry. Cotyledons, like the endosperm, contain high levels of 18:1-OH in TAG. Pollen and male developing flowers accumulate TAG but do not contain 18:1-OH and leaves do not contain TAG or 18:1-OH. Analysis of acyl-CoAs in developing endosperm shows that ricinoleoyl-CoA is not the dominant acyl-CoA, indicating that either metabolic channelling or enzyme substrate selectivity are important in the synthesis of tri-ricinolein in this tissue. RNA-Seq transcriptomic analysis, using Illumina sequencing by synthesis technology, has been performed on mRNA isolated from two stages of developing seeds, germinating seeds, leaf and pollen-producing male flowers in order to identify differences in lipid-metabolic pathways and enzyme isoforms which could be important in the biosynthesis of TAG enriched in 18:1-OH. This study gives comprehensive coverage of gene expression in a variety of different castor tissues. The potential role of differentially expressed genes is discussed against a background of proteins identified in the endoplasmic reticulum, which is the site of TAG biosynthesis, and transgenic studies aimed at increasing the ricinoleic acid content of TAG.Conclusions/SignificanceSeveral of the genes identified in this tissue-specific whole transcriptome study have been used in transgenic plant research aimed at increasing the level of ricinoleic acid in TAG. New candidate genes have been identified which might further improve the level of ricinoleic acid in transgenic crops.
Chronic polymicrobial infections of the lung are the foremost cause of morbidity and mortality in cystic fibrosis (CF) patients. The composition of the microbial flora of the airway alters considerably during infection, particularly during patient exacerbation. An understanding of which organisms are growing, their environment and their behaviour in the airway is of importance for designing antibiotic treatment regimes and for patient prognosis. To this end, we have analysed sputum samples taken from separate cohorts of CF and non-CF subjects for metabolites and in parallel, and we have examined both isolated DNA and RNA for the presence of 16S rRNA genes and transcripts by high-throughput sequencing of amplicon or cDNA libraries. This analysis revealed that although the population size of all dominant orders of bacteria as measured by DNA- and RNA- based methods are similar, greater discrepancies are seen with less prevalent organisms, some of which we associated with CF for the first time. Additionally, we identified a strong relationship between the abundance of specific anaerobes and fluctuations in several metabolites including lactate and putrescine during patient exacerbation. This study has hence identified organisms whose occurrence within the CF microbiome has been hitherto unreported and has revealed potential metabolic biomarkers for exacerbation.
Chromosome painting is one of the most powerful and spectacular tools of modern molecular cytogenetics, enabling complex analyses of nuclear genome structure and evolution. For many years, this technique was restricted to the study of mammalian chromosomes, as it failed to work in plant genomes due mainly to the presence of large amounts of repetitive DNA common to all the chromosomes of the complement. The availability of ordered, chromosome-specific BAC clones of Arabidopsis thaliana containing relatively little repetitive genomic DNA enabled the first chromosome painting in dicotyledonous plants. Here, we show for the first time chromosome painting in three different cytotypes of a monocotyledonous plant—the model grass, Brachypodium distachyon. Possible directions of further detailed studies are proposed, such as the evolution of grass karyotypes, the behaviour of meiotic chromosomes, and the analysis of chromosome distribution at interphase.
Bis-(3′,5′) cyclic di-guanylate (cyclic di-GMP) is a key bacterial second messenger that is implicated in the regulation of many critical processes that include motility, biofilm formation and virulence. Cyclic di-GMP influences diverse functions through interaction with a range of effectors. Our knowledge of these effectors and their different regulatory actions is far from complete, however. Here we have used an affinity pull-down assay using cyclic di-GMP-coupled magnetic beads to identify cyclic di-GMP binding proteins in the plant pathogen Xanthomonas campestris pv. campestris (Xcc). This analysis identified XC_3703, a protein of the YajQ family, as a potential cyclic di-GMP receptor. Isothermal titration calorimetry showed that the purified XC_3703 protein bound cyclic di-GMP with a high affinity (Kd∼2 µM). Mutation of XC_3703 led to reduced virulence of Xcc to plants and alteration in biofilm formation. Yeast two-hybrid and far-western analyses showed that XC_3703 was able to interact with XC_2801, a transcription factor of the LysR family. Mutation of XC_2801 and XC_3703 had partially overlapping effects on the transcriptome of Xcc, and both affected virulence. Electromobility shift assays showed that XC_3703 positively affected the binding of XC_2801 to the promoters of target virulence genes, an effect that was reversed by cyclic di-GMP. Genetic and functional analysis of YajQ family members from the human pathogens Pseudomonas aeruginosa and Stenotrophomonas maltophilia showed that they also specifically bound cyclic di-GMP and contributed to virulence in model systems. The findings thus identify a new class of cyclic di-GMP effector that regulates bacterial virulence.
The pooid subfamily of grasses includes some of the most important crop, forage and turf species, such as wheat, barley and Lolium. Developing genomic resources, such as whole-genome physical maps, for analysing the large and complex genomes of these crops and for facilitating biological research in grasses is an important goal in plant biology. We describe a bacterial artificial chromosome (BAC)-based physical map of the wild pooid grass Brachypodium distachyon and integrate this with whole genome shotgun sequence (WGS) assemblies using BAC end sequences (BES). The resulting physical map contains 26 contigs spanning the 272 Mb genome. BES from the physical map were also used to integrate a genetic map. This provides an independent vaildation and confirmation of the published WGS assembly. Mapped BACs were used in Fluorescence In Situ Hybridisation (FISH) experiments to align the integrated physical map and sequence assemblies to chromosomes with high resolution. The physical, genetic and cytogenetic maps, integrated with whole genome shotgun sequence assemblies, enhance the accuracy and durability of this important genome sequence and will directly facilitate gene isolation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.