Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within-and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.Asian cultivated rice is grown worldwide and comprises the staple food for half of the global population. It is envisaged that by the year 2035 1 feeding this growing population will necessitate that an additional 112 million metric tons of rice be produced on a smaller area of land, using less water and under more fluctuating climatic conditions, which will require that future rice cultivars be higher yielding and resilient to multiple abiotic and biotic stresses. The foundation of the continued improvement of rice cultivars is the rich genetic diversity within domesticated populations and wild relatives [2][3][4] . For over 2,000 years, two major types of O. sativa-O. sativa Xian group (here referred to as Xian/Indica (XI) and also known as , Hsien or Indica) and O. sativa Geng Group (here referred to as Geng/Japonica (GJ) and also known as , Keng or Japonica)-have historically been recognized [5][6][7] . Varied degrees of post-reproductive barriers exist between XI and GJ rice accessions 8 ; this differentiation between XI and GJ rice types and the presence of different varietal groups are well-documented at isozyme and DNA levels 6,9 . Two other distinct groups have also been recognized using molecular markers 10 ; one of these encompasses the Aus, Boro and Rayada ecotypes from Bangladesh and India (which we term the circum-Aus group (cA)) and the other comprises the famous Basmati and Sadri aromatic varieties (which we term the circum-Basmati group (cB)).Approximately 780,000 rice accessions are available in gene banks worldwide 11 . To enable the more efficient use of these accessions in future rice improvement, the Chinese Academy of Agricultural Sciences, BGI-Shenzhen and International Rice Research Institute sequenced over 3,000 rice genomes (3K-RG) as part of the 3,000 Rice Genomes Project 12. Here we present analyses of genetic variation in the 3K-RG that focus on important aspects of O. sativa diversity, single nucleotide polymorphisms (SNPs) and structural variation (deletions, duplications, inversions and translocations). We also construct a species pangenome consisting of 'core...
Raising the bar: The efficacy of bioorthogonal reactions for bioconjugation has been thoroughly evaluated in four different biological settings. Powered by the development of new biocompatible ligands, the copper‐catalyzed azide–alkyne cycloaddition (see picture) has brought about unsurpassed bioconjugation efficiency, and thus it holds great promise as a highly potent and adaptive tool for a broader spectrum of biological applications.
Dystroglycan is a highly glycosylated extracellular matrix receptor with essential functions in skeletal muscle and the nervous system. Reduced matrix binding by α-dystroglycan (α-DG) due to perturbed glycosylation is a pathological feature of several forms of muscular dystrophy. Like-acetylglucosaminyltransferase (LARGE) synthesizes the matrix-binding heteropolysaccharide [-glucuronic acid-β1,3-xylose-α1,3-]n. Using a dual exoglycosidase digestion, we confirm that this polysaccharide is present on native α-DG from skeletal muscle. The atomic details of matrix binding were revealed by a high-resolution crystal structure of laminin G-like (LG) domains 4–5 of laminin α2 bound to a LARGE-synthesized oligosaccharide. A single glucuronic acid-β1,3-xylose disaccharide repeat straddles a Ca2+ ion in the LG4 domain, with oxygen atoms from both sugars replacing Ca2+-bound water molecules. The chelating binding mode accounts for the high affinity of this protein-carbohydrate interaction. These results reveal a novel mechanism of carbohydrate recognition and provide a structural framework for elucidating the mechanisms underlying muscular dystrophy.
A pan-genome is the union of the gene sets of all the individuals of a clade or a species and it provides a new dimension of genome complexity with the presence/absence variations (PAVs) of genes among these genomes. With the progress of sequencing technologies, pan-genome study is becoming affordable for eukaryotes with large-sized genomes. The Asian cultivated rice, Oryza sativa L., is one of the major food sources for the world and a model organism in plant biology. Recently, the 3000 Rice Genome Project (3K RGP) sequenced more than 3000 rice genomes with a mean sequencing depth of 14.3×, which provided a tremendous resource for rice research. In this paper, we present a genome browser, Rice Pan-genome Browser (RPAN), as a tool to search and visualize the rice pan-genome derived from 3K RGP. RPAN contains a database of the basic information of 3010 rice accessions, including genomic sequences, gene annotations, PAV information and gene expression data of the rice pan-genome. At least 12 000 novel genes absent in the reference genome were included. RPAN also provides multiple search and visualization functions. RPAN can be a rich resource for rice biology and rice breeding. It is available at http://cgm.sjtu.edu.cn/3kricedb/ or http://www.rmbreeding.cn/pan3k.
Tremendous efforts have been taken worldwide to develop genome-wide genetic stocks for rice functional genomic (FG) research since the rice genome was completely sequenced. To facilitate FG research of complex polygenic phenotypes in rice, we report the development of over 20,000 introgression lines (ILs) in three elite rice genetic backgrounds for a wide range of complex traits, including resistances/tolerances to many biotic and abiotic stresses, morpho-agronomic traits, physiological traits, etc., by selective introgression. ILs within each genetic background are phenotypically similar to their recurrent parent but each carries one or a few traits introgressed from a known donor. Together, these ILs contain a significant portion of loci affecting the selected complex phenotypes at which allelic diversity exists in the primary gene pool of rice. A forward genetics strategy was proposed and demonstrated with examples on how to use these ILs for large-scale FG research. Complementary to the genome-wide insertional mutants, these ILs opens a new way for highly efficient discovery, candidate gene identification and cloning of important QTLs for specific phenotypes based on convergent evidence from QTL position, expression profiling, functional and molecular diversity analyses of candidate genes, highlights the importance of genetic networks underlying complex phenotypes in rice that may ultimately lead to more complete understanding of the genetic and molecular bases of quantitative trait variation in rice.
Lewis X (Le x )-containing glycans play important roles in numerous cellular processes. However, the absence of robust, facile, and cost-effective methods for the synthesis of Le x and its structurally related analogs has severely hampered the elucidation of the specific functions of these glycan epitopes. Here we demonstrate that chemically defined guanidine 5-diphosphate--L-fucose (GDPfucose), the universal fucosyl donor, the Le x trisaccharide, and their C-5 substituted derivatives can be synthesized on preparative scales, using a chemoenzymatic approach. This method exploits L-fucokinase/GDP-fucose pyrophosphorylase (FKP), a bifunctional enzyme isolated from Bacteroides fragilis 9343, which converts L-fucose into GDP-fucose via a fucose-1-phosphate (Fuc-1-P) intermediate. Combining the activities of FKP and a Helicobacter pylori ␣1,3 fucosyltransferase, we prepared a library of Le x trisaccharide glycans bearing a wide variety of functional groups at the fucose C-5 position. These neoglycoconjugates will be invaluable tools for studying Le x -mediated biological processes.glycobiology ͉ catalysis ͉ enzyme
To monitor the kinetics of biological processes that take place within the minute time scale, simple and fast analytical methods are required. In this article, we present our discovery of an azide with an internal Cu(I)-chelating motif that enabled the development of the fastest protocol for Cu(I)-catalyzed azide–alkyne cycloaddition (CuAAC) to date, and its application toward following the dynamic process of glycan biosynthesis. We discovered that an electron-donating picolyl azide boosted the efficiency of the ligand-accelerated CuAAC 20–38-fold in living systems with no apparent toxicity. With a combination of this azide and BTTPS, a tris(triazolylmethyl)amine-based ligand for Cu(I), we were able to detect newly synthesized cell-surface glycans by flow cytometry using as low as 1 nM of a metabolic precursor. This supersensitive chemistry enabled us to monitor the dynamic glycan biosynthesis in mammalian cells and in early zebrafish embryogenesis. In live mammalian cells, we discovered that it takes approximately 30–45 min for a monosaccharide building block to be metabolized and incorporated into cell-surface glycoconjugates. In zebrafish embryos, the labeled glycans could be detected as early as the two-cell stage. To our knowledge, this was the first time that newly synthesized glycans were detected at the cleavage period (0.75–2 hpf) in an animal model using bioorthogonal chemistry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.