An annotated reference sequence representing the hexaploid bread wheat genome in 21 pseudomolecules has been analyzed to identify the distribution and genomic context of coding and noncoding elements across the A, B, and D subgenomes. With an estimated coverage of 94% of the genome and containing 107,891 high-confidence gene models, this assembly enabled the discovery of tissue- and developmental stage–related coexpression networks by providing a transcriptome atlas representing major stages of wheat development. Dynamics of complex gene families involved in environmental adaptation and end-use quality were revealed at subgenome resolution and contextualized to known agronomic single-gene or quantitative trait loci. This community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding.
contributed to the design and execution of the overall study. M.P.P., M.J., J.R.T., G.S., L.E.M., L.A.K., X.W., V.G., K.B.J., J.D.M., N.R., S.J.C., and P Brennan contributed to the statistical analysis. M.P.P., M.J., S.J.C. and P. Brennan wrote the first draft of the manuscript. D. Zeleniak, E.P., L.A.K., X.W., K.B.J., S.H.V., S.L.M., Y.Y., A.M.M., E.S.B., N.N.C., M.F., D.L., I.G., S.H., H. Blanche, A.H., G.T., Z.W., M.Y., K.G.S., S.J.C., and M.L. supervised or conducted the genotyping. The remaining authors conducted the epidemiologic studies and contributed samples to the GWAS and/or replication. All authors contributed to the writing of the manuscript. NIH Public Access Author ManuscriptNat Genet. Author manuscript; available in PMC 2012 January 1. AbstractWe conducted a two-stage genome-wide association study of renal cell carcinoma (RCC) in 3,772 cases and 8,505 controls of European background from 11 studies, and followed up 6 SNPs in three replication studies of 2,198 cases and 4,918 controls. Two loci on the regions of 2p21 and 11q13.3 were associated with RCC susceptibility below genome-wide significance. Two correlated variants (r 2 = 0.99 in controls), rs11894252 (P = 1.8×10 −8 ) and rs7579899 (P = 2.3×10 −9 ), map to EPAS1 on 2p21, which encodes hypoxia-inducible-factor-2 alpha, a transcription factor previously implicated in RCC. The second locus, rs7105934, at 11q13, contains no characterized genes (P = 7.8×10 −14 ). In addition, we observed a promising association on 12q24.31 for rs4765623 which maps to the scavenger receptor class B, member 1 (SCARB1) gene (P = 2.6×10 −8 ). Our study reports novel genomic regions associated with RCC risk that may lead to new etiological insights. Table 1, Online Methods and Supplementary note). All subjects from the IARC/CNG study were genotyped at the CNG with the exception of 305 cases and 323 controls from Russia that were genotyped at the Center "Bioengineering" and at the "Kurchatov Institute" in Moscow. All subjects from the NCI study were scanned at the NCI Core Genotyping Facility. In addition, 1,438 controls from the Wellcome Trust Case-Control Consortium were genotyped at the Sanger Institute, UK 10 . All RCC cases were defined on the basis of the International Classification of Diseases for Oncology, Second Edition (ICD-O-2), and included all cancers that were coded as C64.Comparable quality control metrics were applied to the two scanned data sets and following sample and SNP exclusions, genotype data for up to 577,547 SNPs were available for 2,461 cases and 5,081 controls in the IARC/CNG scan, while data for 585,576 SNPs were available for 1,311 cases and 3,424 controls in the NCI scan (Online Methods). Primary analyses were conducted using unconditional logistic regression models for genotype trend effects (1 degree of freedom) and adjusted for sex, country, eigenvectors, and study for the USA (Online Methods). In order to compute summary findings across both scans, a metaanalysis was performed using a fixed effects model with inverse variance wei...
The incidence of renal cell carcinoma (RCC) is increasing worldwide, and its prevalence is particularly high in some parts of Central Europe. Here we undertake whole-genome and transcriptome sequencing of clear cell RCC (ccRCC), the most common form of the disease, in patients from four different European countries with contrasting disease incidence to explore the underlying genomic architecture of RCC. Our findings support previous reports on frequent aberrations in the epigenetic machinery and PI3K/mTOR signalling, and uncover novel pathways and genes affected by recurrent mutations and abnormal transcriptome patterns including focal adhesion, components of extracellular matrix (ECM) and genes encoding FAT cadherins. Furthermore, a large majority of patients from Romania have an unexpected high frequency of A:T4T:A transversions, consistent with exposure to aristolochic acid (AA). These results show that the processes underlying ccRCC tumorigenesis may vary in different populations and suggest that AA may be an important ccRCC carcinogen in Romania, a finding with major public health implications.
Mouse genome contains two major families of short interspersed repeats in more than 10(5) copies scattered throughout the whole genome. They are referred to as B1 and B2 sequences since they were first isolated from the genome library by means of a dsRNA-B probe /1/. In this work, two copies of the B2 family were sequenced and compared with the previously sequenced B1 repeat /2/. A B2 ubiquitous repeat is ca. 190 bp long. The members of the family deviate in 3-5% of nucleotides from the consensus sequence. B2 contains regions of homology to the RNA polymerase III split promoter and to 4.5S snRNA I. Both B1 and B2 contain regions which resemble junctions between exons and introns. In contrast to B1, B2 does not contain apparent homologies to papova viral replication origins and a human Alu sequence. One side of the B2 repeat is represented by a very AT-rich sequence (ca. 30 bp long) followed with an oligo (dA) stretch 10-15 nucleotides long. This region of the repeat is the most variable one. The whole unit is flanked with 15-16 bp direct repeats different in sequenced copies of B2. The same is true of some copies of the B1 family. The properties of B1 and B2 repeats suggest that they may represent a novel class of transposon-like elements in eukaryotic genome. A possible role of B-type repeats in genome reorganization, DNA replication and pre-mRNA processing is discussed.
Three copies of a highly repetitive DNA sequence B1 which is complementary to the most abundant class of mouse fold-back RNA have been cloned in pBR322 plasmid and sequenced by the method of Maxam and Gilbert. All the three have a length of about 130 base pairs and are very similar in their base sequence. The deviation from the average sequence is equal to 4% and the overall mismatch between each two is not higher than 8%. One of the recombinant clones used contained two copies of B1 oriented in the same direction. All of the B1 copies are flanked with sequences which possess nonidentical but very similar structure. They consist of a number of AmCn blocks (where m varies from 2 to 8 and n equals 1-2). These peculiar sequences in all cases are separated from B1 by non-homologous DNA stretches of 2-8 residues. In one case, a long polypurine stretch is located next to such a block. It consists of 74 residues most of which represent a reiteration of the basic sequence AAAAG. We have found two regions within the B1 sequence which are homologous to the intron-exon junctions, especially to those present in the large intron of the mouse beta-globin gene. It may indicate the involvement of the B1 sequence in pre-mRNA splicing.
Previous genome-wide association studies (GWAS) have identified six risk loci for renal cell carcinoma (RCC). We conducted a meta-analysis of two new scans of 5,198 cases and 7,331 controls together with four existing scans, totalling 10,784 cases and 20,406 controls of European ancestry. Twenty-four loci were tested in an additional 3,182 cases and 6,301 controls. We confirm the six known RCC risk loci and identify seven new loci at 1p32.3 (rs4381241, P=3.1 × 10−10), 3p22.1 (rs67311347, P=2.5 × 10−8), 3q26.2 (rs10936602, P=8.8 × 10−9), 8p21.3 (rs2241261, P=5.8 × 10−9), 10q24.33-q25.1 (rs11813268, P=3.9 × 10−8), 11q22.3 (rs74911261, P=2.1 × 10−10) and 14q24.2 (rs4903064, P=2.2 × 10−24). Expression quantitative trait analyses suggest plausible candidate genes at these regions that may contribute to RCC susceptibility.
The complete nucleotide sequence of the 25 S rRNA gene from one rDNA repeating unit of Saccharomyces cerevisiae has been determined. The corresponding 25 S rRNA molecule contains 3392 nucleotides and has an estimated relative molecular mass (Mr, Na-salt) or 1.17 x 10(6). Striking sequence homology is observed with known 5'- and 3'-end terminal segments of L-rRNA from other eukaryotes. Possible models of interaction with 5.8 S rRNA are discussed.
Four full-length MADS-box cDNAs from chrysanthemum, designated Chrysanthemum Dendrathema grandiflorum MADS (CDM) 8, CDM41, CDM111, and CDM44, have been isolated and further functionally characterized. Protein sequence alignment and expression patterns of the corresponding genes suggest that CDM8 and CDM41 belong to the FRUITFULL (FUL) clade, CDM111 is a member of the APETALA1 (AP1) subfamily, and CDM44 is a member of the SEPALLATA3 (SEP3) subfamily of MADS-box transcription factors. Overexpression of CDM111 in Arabidopsis plants resulted in an aberrant phenotype that is reminiscent of the phenotype obtained by ectopic expression of the AP1 gene. In addition, CDM111 was able to partially complement the ap1-1 mutant from Arabidopsis, illustrating that CDM111 is the functional equivalent to AP1. Yeast two-and three-hybrid studies were performed to investigate the potential protein interactions and complexes in which these chrysanthemum MADS-box proteins are involved. Based on these studies, we conclude that CDM44 is most likely the SEP3 functional equivalent, because the CDM44 protein interacts with CDM proteins of the AP1/FUL and AG subfamilies, and as a higher order complex with the heterodimer between the presumed B-type CDM proteins.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.