Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within-and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.Asian cultivated rice is grown worldwide and comprises the staple food for half of the global population. It is envisaged that by the year 2035 1 feeding this growing population will necessitate that an additional 112 million metric tons of rice be produced on a smaller area of land, using less water and under more fluctuating climatic conditions, which will require that future rice cultivars be higher yielding and resilient to multiple abiotic and biotic stresses. The foundation of the continued improvement of rice cultivars is the rich genetic diversity within domesticated populations and wild relatives [2][3][4] . For over 2,000 years, two major types of O. sativa-O. sativa Xian group (here referred to as Xian/Indica (XI) and also known as , Hsien or Indica) and O. sativa Geng Group (here referred to as Geng/Japonica (GJ) and also known as , Keng or Japonica)-have historically been recognized [5][6][7] . Varied degrees of post-reproductive barriers exist between XI and GJ rice accessions 8 ; this differentiation between XI and GJ rice types and the presence of different varietal groups are well-documented at isozyme and DNA levels 6,9 . Two other distinct groups have also been recognized using molecular markers 10 ; one of these encompasses the Aus, Boro and Rayada ecotypes from Bangladesh and India (which we term the circum-Aus group (cA)) and the other comprises the famous Basmati and Sadri aromatic varieties (which we term the circum-Basmati group (cB)).Approximately 780,000 rice accessions are available in gene banks worldwide 11 . To enable the more efficient use of these accessions in future rice improvement, the Chinese Academy of Agricultural Sciences, BGI-Shenzhen and International Rice Research Institute sequenced over 3,000 rice genomes (3K-RG) as part of the 3,000 Rice Genomes Project 12. Here we present analyses of genetic variation in the 3K-RG that focus on important aspects of O. sativa diversity, single nucleotide polymorphisms (SNPs) and structural variation (deletions, duplications, inversions and translocations). We also construct a species pangenome consisting of 'core...
Genetic diversity is key to crop improvement. Owing to pervasive genomic structural variation, a single reference genome assembly cannot capture the full complement of sequence diversity of a crop species (known as the ‘pan-genome’1). Multiple high-quality sequence assemblies are an indispensable component of a pan-genome infrastructure. Barley (Hordeum vulgare L.) is an important cereal crop with a long history of cultivation that is adapted to a wide range of agro-climatic conditions2. Here we report the construction of chromosome-scale sequence assemblies for the genotypes of 20 varieties of barley—comprising landraces, cultivars and a wild barley—that were selected as representatives of global barley diversity. We catalogued genomic presence/absence variants and explored the use of structural variants for quantitative genetic analysis through whole-genome shotgun sequencing of 300 gene bank accessions. We discovered abundant large inversion polymorphisms and analysed in detail two inversions that are frequently found in current elite barley germplasm; one is probably the product of mutation breeding and the other is tightly linked to a locus that is involved in the expansion of geographical range. This first-generation barley pan-genome makes previously hidden genetic variation accessible to genetic studies and breeding.
A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k.
While previous studies have shown that histone modifications could influence plant growth and development by regulating gene transcription, knowledge about the relationships between these modifications and gene expression is still limited. This study used chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-Seq), to investigate the genome-wide distribution of four histone modifications: di and trimethylation of H3K4 (H3K4me2 and H3K4me3) and acylation of H3K9 and H3K27 (H3K9ac and H3K27ac) in Oryza sativa L. japonica. By analyzing published DNase-Seq data, this study explored DNase-Hypersensitive (DH) sites along the rice genome. The histone marks appeared mainly in generic regions and were enriched around the transcription start sites (TSSs) of genes. This analysis demonstrated that the four histone modifications and the DH sites were all associated with active transcription. Furthermore, the four histone modifications were highly concurrent with transcript regions-a promising feature that was used to predict missing genes in the rice gene annotation. The predictions were further validated by experimentally confirming the transcription of two predicted missing genes. Moreover, a sequence motif analysis was constructed in order to identify the DH sites and many putative transcription factor binding sites.
Appearance and milling quality are two crucial properties of rice grains affecting its market acceptability. Understanding the genetic base of rice grain quality could considerably improve the high quality breeding. Here, we carried out an association analysis to identify QTL affecting nine rice grain appearance and milling quality traits using a diverse panel of 258 accessions selected from 3K Rice Genome Project and evaluated in two environments Sanya and Shenzhen. Genome-wide association analyses using 22,488 high quality SNPs identified 72 QTL affecting the nine traits. Combined gene-based association and haplotype analyses plus functional annotation allowed us to shortlist 19 candidate genes for seven important QTL regions affecting the grain quality traits, including two cloned genes (GS3 and TUD), two fine mapped QTL (qGRL7.1 and qPGWC7) and three newly identified QTL (qGL3.4, qGW1.1, and qGW10.2). The most likely candidate gene(s) for each important QTL were also discussed. This research demonstrated the superior power to shortlist candidate genes affecting complex phenotypes by the strategy of combined GWAS, gene-based association and haplotype analyses. The identified candidate genes provided valuable sources for future functional characterization and genetic improvement of rice appearance and milling quality.
High salinity is one of the main factors limiting cotton growth and productivity. The genes that regulate salt stress in TM-1 upland cotton were monitored using microarray and real-time PCR (RT-PCR) with samples taken from roots. Microarray analysis showed that 1503 probe sets were up-regulated and 1490 probe sets were down-regulated in plants exposed for 3h to 100mM NaCl, and RT-PCR analysis validated 42 relevant/related genes. The distribution of enriched gene ontology terms showed such important processes as the response to water stress and pathways of hormone metabolism and signal transduction were induced by the NaCl treatment. Some key regulatory gene families involved in abiotic and biotic sources of stress such as WRKY, ERF, and JAZ were differentially expressed. Our transcriptome analysis might provide some useful insights into salt-mediated signal transduction pathways in cotton and offer a number of candidate genes as potential markers of tolerance to salt stress.
HighlightUnder darkness, JAZ7 was up-regulated and the mutant showed a severe leaf senescence phenotype. Genetics and transcriptomic analysis revealed JAZ7 as an important regulator of dark-induced leaf senescence.
Plant genera with both diploid and polyploid species are a common evolutionary occurrence. Polyploids, especially allopolyploids such as cotton and wheat, are a great model system for heterosis research. Here, we have integrated genome sequences and transcriptome data of Gossypium species to construct co-expression networks and identified functional modules from different cotton species, including 1155 and 1884 modules in G. arboreum and G. hirsutum, respectively. We overlayed the gene expression results onto the co-expression network. We further provided network comparison analysis for orthologous genes across the diploid and allotetraploid Gossypium. We also constructed miRNA-target networks and predicted PPI networks for both cotton species. Furthermore, we integrated in-house ChIP-seq data of histone modification (H3K4me3) together with cis-element analysis and gene sets enrichment analysis tools for studying possible gene regulatory mechanism in Gossypium species. Finally, we have constructed an online ccNET database (http://structuralbiology.cau.edu.cn/gossypium) for comparative gene functional analyses at a multi-dimensional network and epigenomic level across diploid and polyploid Gossypium species. The ccNET database will be beneficial for community to yield novel insights into gene/module functions during cotton development and stress response, and might be useful for studying conservation and diversity in other polyploid plants, such as T. aestivum and Brassica napus.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.