Ewing sarcoma, a pediatric tumor characterized by EWSR1-ETS fusions, is predominantly observed in populations of European ancestry. We performed a genome-wide association study (GWAS) of 401 French individuals with Ewing sarcoma, 684 unaffected French individuals and 3,668 unaffected individuals of European descent and living in the United States. We identified candidate risk loci at 1p36.22, 10q21 and 15q15. We replicated these loci in two independent sets of cases and controls. Joint analysis identified associations with rs9430161 (P = 1.4 × 10(-20); odds ratio (OR) = 2.2) located 25 kb upstream of TARDBP, rs224278 (P = 4.0 × 10(-17); OR = 1.7) located 5 kb upstream of EGR2 and, to a lesser extent, rs4924410 at 15q15 (P = 6.6 × 10(-9); OR = 1.5). The major risk haplotypes were less prevalent in Africans, suggesting that these loci could contribute to geographical differences in Ewing sarcoma incidence. TARDBP shares structural similarities with EWSR1 and FUS, which encode RNA binding proteins, and EGR2 is a target gene of EWSR1-ETS. Variants at these loci were associated with expression levels of TARDBP, ADO (encoding cysteamine dioxygenase) and EGR2.
Deciphering the ways in which somatic mutations and germline susceptibility variants cooperate to promote cancer is challenging. Ewing sarcoma is characterized by fusions between EWSR1 and members of the ETS gene family, usually EWSR1-FLI1, leading to the generation of oncogenic transcription factors that bind DNA at GGAA motifs1–3. A recent genome-wide association study4 identified susceptibility variants near EGR2. Here we found that EGR2 knockdown inhibited proliferation, clonogenicity and spheroidal growth in vitro and induced regression of Ewing sarcoma xenografts. Targeted germline deep sequencing of the EGR2 locus in affected subjects and controls revealed 291 Ewing-associated SNPs. At rs79965208, the A risk allele connected adjacent GGAA repeats by converting an interspaced GGAT motif into a GGAA motif, thereby increasing the number of consecutive GGAA motifs and thus the EWSR1-FLI1–dependent enhancer activity of this sequence, with epigenetic characteristics of an active regulatory element. EWSR1-FLI1 preferentially bound to the A risk allele, which increased global and allele-specific EGR2 expression. Collectively, our findings establish cooperation between a dominant oncogene and a susceptibility variant that regulates a major driver of Ewing sarcomagenesis.
Diffuse large B-cell lymphoma (DLBCL) is the most common lymphoma subtype and is clinically aggressive. To identify genetic susceptibility loci for DLBCL, we conducted a meta-analysis of three new genome-wide association studies (GWAS) and one prior scan, totaling 3,857 cases and 7,666 controls of European ancestry, with additional genotyping of nine promising SNPs in 1,359 cases and 4,557 controls. In our multi-stage analysis, five independent SNPs in four loci achieved genome-wide significance marked by rs116446171 at 6p25.3 (EXOC2; P=2.33×10 −21 ), rs2523607 at 6p21.33 (HLA-B; 2.40×10 −10 ), rs79480871 at 2p23.3 (NCOA1; P=4.23×10 −8 ), and two independent SNPs, rs13255292 and rs4733601, at 8q24.21 (PVT1; P=9.98×10 −13 and P=3.63×10 −11 , respectively). These data provide substantial new evidence for genetic susceptibility to this B-cell malignancy, and point towards pathways involved in immune recognition and immune function in the pathogenesis of DLBCL.Diffuse large B-cell lymphoma (DLBCL), the most common subtype of non-Hodgkin lymphoma (NHL) 1 , has an aggressive clinical course 2 . The risk of DLBCL is increased in individuals with a family history of NHL (odds ratio (OR)=1.4; 95%CI 1.1-2.0) 3 , supporting a genetic contribution. Also, relatives of DLBCL patients are at elevated risk for both DLBCL (RR=9.8, and Hodgkin lymphoma (HL, RR=2.0, 95%CI 1.05-Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms Correspondence should be addressed to: James R. Cerhan, M.D., Ph.D., Mayo Clinic, 200 First Street SW, Rochester, MN 55905, Phone: 507.538.0499, Fax: 507.226.2478, cerhan.james@mayo.edu. 94 These authors contributed equally to this work. 95 These authors jointly directed this work. AUTHORS CONTRIBUTIONSJ.R.C., S.I.B., S.S.W., A.N., A.R.B.-W., Q.L., G. Severi, M. Melbye, L.R.T., M.P.P., C.L., B.M.B., S.L.S., S.d.S., K.E.S., C.F.S., N.R. and S.J.C. organized and designed the study. J.R.C., L.C., L.B., A.H., P.M.B., E.A.H., S.L.S., G. Salles, C.F.S., N.R. and S.J.C. conducted and supervised the genotyping of samples. J.R.C., S.I.B., V.J., Z.W., M.Y., L.C., P.I.W.d.B., D.C., J.G., D. Zhi, Y.W.A., J.H., B.M., J.S., L.L., J.P., C.C.C., N.C., S.d.S., K.E.S., C.F.S., N.R. and S.J.C. contributed to the design and execution of statistical analysis. J.R.C., S.I.B., V.J., H.G., J.M., S.S.W., Z.W., M.Y., L.C., A.N., D.C., A.M., C.R.F., A.J.D.R., C.L., K.E.S., C.F.S., N.R. and S.J.C. wrote the first draft of the manuscript. J.R.C., V.J., H.G., J.M., S.S.W., L.C., A.N., L.B., A.M., A.R.B.-W., Q.L., G. Severi, M. Melbye, J.G., R.D.J., E.K., L.R.T., M.P.P., C.M.V., J.J.S., G.G.G., D.A., R.S.K., M.Z., K.A.B., A.Z.-J., T.M.H., B.K.L., A.J.N., A.D., Y.W.A., M.L., C.A.T., S.M.A., T.E.W., G.J.W., A.S.V., D. Zelenika, H.T., C.H., T.J.M., H.H., B.G., H.-O.A., P.M.B., J.R., M.T.S., E.A.H., W.C., P.H., L.M.M., R.K.S., L.F.T., K.E.N., N.B., Y...
Recent investigations on metazoan transcription factors (TFs) indicate that single-gene duplication events and the gain and loss of protein domains are 2 crucial factors in shaping their protein-protein interaction networks. Plant genomes, on the other hand, have a history of polyploidy and whole-genome duplications (WGDs), and thus, their study helps to understand whether WGDs have also had a significant influence on protein network evolution. Here we investigate the evolution of the interaction network in the well-studied MADS domain MIKC-type proteins, a TF family which plays an important role in both the vegetative and the reproductive phases of plant life. We combine phylogenetic reconstruction, protein domain analysis, and interaction data from different species. We show that, unlike previously analyzed interaction networks, the MIKC-type protein network displays a characteristic topology, with overall high inter-subfamily connectivity, shared interactors between paralogs, and conservation of interaction patterns across species. The evaluation of the number of MIKC-type proteins at key time points throughout the evolution of land plants in the lineage leading to Arabidopsis suggested that most duplicates were retained after each round of WGD. We provide evidence that an initial network, formed by 9-11 homodimerizing proteins interacting with each other, existed in the common ancestor of all seed plants. This basic structure has been conserved after each round of WGD, adding layers of paralogs with similar interaction patterns. We thus present the first model where we can show that a network of eukaryotic TFs has evolved via rounds of WGD. Furthermore, we found that in subfamilies in which the K domain is most diverged, the interactions with other subfamilies have been largely lost. We discuss the possibility that such a high proportion of genes were retained after each WGD because of their capacity to form higher order complexes involving proteins from different subfamilies. The simultaneous duplications allowed for the conservation of the quantitative balance between the constituents and facilitated sub- and neofunctionalization through differential expression of whole units.
In recent years, genomes from an increasing number of organisms have been sequenced, but their annotation remains a time-consuming process. The BioCyc databases offer a framework for the integrated analysis of metabolic networks. The Pathway tool software suite allows the automated construction of a database starting from an annotated genome, but it requires prior integration of all annotations into a specific summary file or into a GenBank file. To allow the easy creation and update of a BioCyc database starting from the multiple genome annotation resources available over time, we have developed an ad hoc data management system that we called Cyc Annotation Database System (CycADS). CycADS is centred on a specific database model and on a set of Java programs to import, filter and export relevant information. Data from GenBank and other annotation sources (including for example: KAAS, PRIAM, Blast2GO and PhylomeDB) are collected into a database to be subsequently filtered and extracted to generate a complete annotation file. This file is then used to build an enriched BioCyc database using the PathoLogic program of Pathway Tools. The CycADS pipeline for annotation management was used to build the AcypiCyc database for the pea aphid (Acyrthosiphon pisum) whose genome was recently sequenced. The AcypiCyc database webpage includes also, for comparative analyses, two other metabolic reconstruction BioCyc databases generated using CycADS: TricaCyc for Tribolium castaneum and DromeCyc for Drosophila melanogaster. Linked to its flexible design, CycADS offers a powerful software tool for the generation and regular updating of enriched BioCyc databases. The CycADS system is particularly suited for metabolic gene annotation and network reconstruction in newly sequenced genomes. Because of the uniform annotation used for metabolic network reconstruction, CycADS is particularly useful for comparative analysis of the metabolism of different organisms.Database URL: http://www.cycadsys.org
BackgroundFolding and intermingling of chromosomes has the potential of bringing close to each other loci that are very distant genomically or even on different chromosomes. On the other hand, genomic rearrangements also play a major role in the reorganisation of loci proximities. Whether the same loci are involved in both mechanisms has been studied in the case of somatic rearrangements, but never from an evolutionary standpoint.ResultsIn this paper, we analysed the correlation between two datasets: (i) whole-genome chromatin contact data obtained in human cells using the Hi-C protocol; and (ii) a set of breakpoint regions resulting from evolutionary rearrangements which occurred since the split of the human and mouse lineages. Surprisingly, we found that two loci distant in the human genome but adjacent in the mouse genome are significantly more often observed in close proximity in the human nucleus than expected. Importantly, we show that this result holds for loci located on the same chromosome regardless of the genomic distance separating them, and the signal is stronger in gene-rich and open-chromatin regions.ConclusionsThese findings strongly suggest that part of the 3D organisation of chromosomes may be conserved across very large evolutionary distances. To characterise this phenomenon, we propose to use the notion of spatial synteny which generalises the notion of genomic synteny to the 3D case.
Global warming is associated with increasing stress and mortality on temperate seagrass beds, in particular during periods of high sea surface temperatures during summer months, adding to existing anthropogenic impacts, such as eutrophication and habitat destruction. We compare several expressed sequence tag (EST) in the ecologically important seagrass Zostera marina (eelgrass) to elucidate the molecular genetic basis of adaptation to environmental extremes. We compared the tentative unigene (TUG) frequencies of libraries derived from leaf and meristematic tissue from a control situation with two experimentally imposed temperature stress conditions and found that TUG composition is markedly different among these conditions (all P<0.0001). Under heat stress, we find that 63 TUGs are differentially expressed (d.e.) at 25°C compared with lower, no-stress condition temperatures (4°C and 17°C). Approximately one-third of d.e. eelgrass genes were characteristic for the stress response of the terrestrial plant model Arabidopsis thaliana. The changes in gene expression suggest complex photosynthetic adjustments among light-harvesting complexes, reaction center subunits of photosystem I and II, and components of the dark reaction. Heat shock encoding proteins and reactive oxygen scavengers also were identified, but their overall frequency was too low to perform statistical tests. In all conditions, the most abundant transcript (3-15%) was a putative metallothionein gene with unknown function. We also find evidence that heat stress may translate to enhanced infection by protists. A total of 210 TUGs contain one or more microsatellites as potential candidates for gene-linked genetic markers. Data are publicly available in a userfriendly database at http://www.uni-muenster.de/Evolution/ ebb/Services/zostera.
Our study provides encouraging evidence for associations between loci at 5q23.2 and 6q21 with EFS and OS in patients with diffuse large B-cell lymphoma treated with immunochemotherapy, suggesting novel biology and the potential contribution of host genetics to the prognosis of this aggressive malignancy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.