About 8,000 years ago in the Fertile Crescent, a spontaneous hybridization of the wild diploid grass Aegilops tauschii (2n 5 14; DD) with the cultivated tetraploid wheat Triticum turgidum (2n 5 4x 5 28; AABB) resulted in hexaploid wheat (T. aestivum; 2n 5 6x 5 42; AABBDD) 1,2 . Wheat has since become a primary staple crop worldwide as a result of its enhanced adaptability to a wide range of climates and improved grain quality for the production of baker's flour 2 . Here we describe sequencing the Ae. tauschii genome and obtaining a roughly 90-fold depth of short reads from libraries with various insert sizes, to gain a better understanding of this genetically complex plant. The assembled scaffolds represented 83.4% of the genome, of which 65.9% comprised transposable elements. We generated comprehensive RNA-Seq data and used it to identify 43,150 protein-coding genes, of which 30,697 (71.1%) were uniquely anchored to chromosomes with an integrated high-density genetic map. Whole-genome analysis revealed gene family expansion in Ae. tauschii of agronomically relevant gene families that were associated with disease resistance, abiotic stress tolerance and grain quality. This draft genome sequence provides insight into the environmental adaptation of bread wheat and can aid in defining the large and complicated genomes of wheat species.We selected Ae. tauschii accession AL8/78 for genome sequencing because it has been extensively characterized genetically (Supplementary Information). Using a whole genome shotgun strategy, we generated 398 Gb of high-quality reads from 45 libraries with insert sizes ranging from 200 bp to 20 kb (Supplementary Information). A hierarchical, iterative assembly of short reads employing the parallelized sequence assembler SOAPdenovo 3 achieved contigs with an N50 length (minimum length of contigs representing 50% of the assembly) of 4,512 bp (Table 1). Paired-end information combined with an additional 18.4 Gb of Roche/454 long-read sequences was used sequentially to generate 4.23-Gb scaffolds (83.4% were non-gapped contiguous sequences) with an N50 length of 57.6 kb (Supplementary Information). The assembly represented 97% of the 4.36-Gb genome as estimated by K-mer analysis (Supplementary Information). We also obtained 13,185 Ae. tauschii expressed sequence tag (EST) sequences using Sanger sequencing, of which 11,998 (91%) could be mapped to the scaffolds with more than 90% coverage (Supplementary Information).To aid in gene identification, we performed RNA-Seq (53.2 Gb for a 117-Mb transcriptome assembly) on 23 libraries representing eight tissues including pistil, root, seed, spike, stamen, stem, leaf and sheath (Supplementary Information). Using both evidence-based and de novo gene predictions, we identified 34,498 high-confidence protein-coding loci. FGENESH 4 and GeneID models were supported by a 60% overlap with either our ESTs and RNA-Seq reads, or with homologous proteins. More than 76% of the gene models had a significant match (E value # 10 25; alignment length $ 60%) in the ...
The small RNA transcriptomes of bread wheat and its emerging model Brachypodium distachyon were obtained by using deep sequencing technology. Small RNA compositions were analyzed in these two species. In addition to 70 conserved microRNAs (miRNAs) from 25 families, 23 novel wheat miRNAs were identified. For Brachypodium, 12 putative miRNAs were predicted from a limited number of expressed sequence tags, of which one was a potential novel miRNA. Also, 94 conserved miRNAs from 28 families were identified in this species. Expression validation was performed for several novel wheat miRNAs. RNA ligase-mediated 5' rapid amplification of complementary DNA ends experiments demonstrated their capability to cleave predicted target genes including three disease-resistant gene analogs. Differential expression of miRNAs was observed between Brachypodium vegetative and reproductive tissues, suggesting their different roles at the two growth stages. Our work significantly increases the novel miRNA numbers in wheat and provides the first set of small RNAs in B. distachyon.
Recent research has made impressive progress in single-turn dialogue modelling. In the multi-turn setting, however, current models are still far from satisfactory. One major challenge is the frequently occurred coreference and information omission in our daily conversation, making it hard for machines to understand the real intention. In this paper, we propose rewriting the human utterance as a pre-process to help multi-turn dialgoue modelling. Each utterance is first rewritten to recover all coreferred and omitted information. The next processing steps are then performed based on the rewritten utterance. To properly train the utterance rewriter, we collect a new dataset with human annotations and introduce a Transformer-based utterance rewriting architecture using the pointer network. We show the proposed architecture achieves remarkably good performance on the utterance rewriting task. The trained utterance rewriter can be easily integrated into online chatbots and brings general improvement over different domains. 1
ORCID IDs: 0000-0003-4953-7258 (L.W.); 0000-0002-3377-4040 (L.M.).The highly conserved florigen gene FLOWERING LOCUS T (FT) functions at the core of the flowering pathways. Extensive studies have examined the transcriptional regulation of FT; however, other layers of FT regulation remain unclear. Here, we identified miR5200 a Pooideae-specific microRNA that is expressed in leaves and targets Brachypodium distachyon FT orthologs for mRNA cleavage. miR5200 was abundantly expressed in plants grown under short-day (SD) conditions but was dramatically repressed in plants transferred to long-day (LD) conditions. We also found that the epigenetic chromatin status, specifically the levels of histone methylation marks, at miR5200 precursor loci changed in response to daylength. Moreover, artificial interruption of miR5200 activity by target mimicry in B. distachyon altered flowering time in SD but not in LD conditions, suggesting that miR5200 functions in photoperiod-mediated flowering time regulation. Together, these findings illustrate a posttranscriptional regulation mechanism of FT and provide insights into understanding of the multiple concerted pathways for flowering time control in plants.
MADS-box genes are important transcription factors for plant development, especially floral organogenesis. Brachypodium distachyon is a model for biofuel plants and temperate grasses such as wheat and barley, but a comprehensive analysis of MADS-box family proteins in Brachypodium is still missing. We report here a genome-wide analysis of the MADS-box gene family in Brachypodium distachyon. We identified 57 MADS-box genes and classified them into 32 MIKCc-type, 7 MIKC*-type, 9 Mα, 7 Mβ and 2 Mγ MADS-box genes according to their phylogenetic relationships to the Arabidopsis and rice MADS-box genes. Detailed gene structure and motif distribution were then studied. Investigation of their chromosomal localizations revealed that Brachypodium MADS-box genes distributed evenly across five chromosomes. In addition, five pairs of type II MADS-box genes were found on synteny blocks derived from whole genome duplication blocks. We then performed a systematic expression analysis of Brachypodium MADS-box genes in various tissues, particular floral organs. Further detection under salt, drought, and low-temperature conditions showed that some MADS-box genes may also be involved in abiotic stress responses, including type I genes. Comparative studies of MADS-box genes among Brachypodium, rice and Arabidopsis showed that Brachypodium had fewer gene duplication events. Taken together, this work provides useful data for further functional studies of MADS-box genes in Brachypodium distachyon.
Modern plant genomes are diploidized paleopolyploids. We revisited grass genome paleohistory in response to the diploidization process through a detailed investigation of the evolutionary fate of duplicated blocks. Ancestrally duplicated genes can be conserved, deleted, and shuffled, defining dominant (bias toward duplicate retention) and sensitive (bias toward duplicate erosion) chromosomal fragments. We propose a new grass genome paleohistory deriving from an ancestral karyotype structured in seven protochromosomes containing 16,464 protogenes and following evolutionary rules where 1) ancestral shared polyploidizations shaped conserved dominant (D) and sensitive (S) subgenomes, 2) subgenome dominance is revealed by both gene deletion and shuffling from the S blocks, 3) duplicate deletion/movement may have been mediated by single-/double-stranded illegitimate recombination mechanisms, 4) modern genomes arose through centromeric fusion of protochromosomes, leading to functional monocentric neochromosomes, 5) the fusion of two dominant blocks leads to supradominant neochromosomes (D + D = D) with higher ancestral gene retention compared with D + S = D (i.e., fusion of blocks with opposite sensitivity) or even S + S = S (i.e., fusion of two sensitive ancestral blocks). A new user-friendly online tool named “PlantSyntenyViewer,” available at http://urgi.versailles.inra.fr/synteny-cereal, presents the refined comparative genomics data.
Tomato flower abscises at the anatomically distinct abscission zone that separates the pedicel into basal and apical portions. During abscission, cell separation occurs only at the abscission zone indicating distinctive molecular regulation in its cells. We conducted a transcriptome analysis of tomato pedicel tissues during ethylene promoted abscission. We found that the abscission zone was the most active site with the largest set of differentially expressed genes when compared with basal and apical portions. Gene Ontology analyses revealed enriched transcription regulation and hydrolase activities in the abscission zone. We also demonstrate coordinated responses of hormone and cell wall related genes. Besides, a number of ESTs representing homologs of key Arabidopsis shoot apical meristem activity genes were found to be preferentially expressed in the abscission zone, including WUSCHEL (WUS), KNAT6, LATERAL ORGAN BOUNDARIES DOMAIN PROTEIN 1(LBD1), and BELL-like homeodomain protein 1 (BLH1), as well as tomato axillary meristem genes BLIND (Bl) and LATERAL SUPPRESSOR (Ls). More interestingly, the homologs of WUS and the potential functional partner OVATE FAMILIY PROTEIN (OFP) were subsequently down regulated during abscission while Bl and AGL12 were continuously and specifically induced in the abscission zone. The expression patterns of meristem activity genes corroborate the idea that cells of the abscission zone confer meristem-like nature and coincide with the course of abscission and post-abscission cell differentiation. Our data therefore propose a possible regulatory scheme in tomato involving meristem genes that may be required not only for the abscission zone development, but also for abscission.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.