SUMMARYGene fusions represent an important class of somatic alterations in cancer. We systematically investigated fusions in 9,624 tumors across 33 cancer types using multiple fusion calling tools. We identified a total of 25,664 fusions, with a 63% validation rate. Integration of gene expression, copy number, and fusion annotation data revealed that fusions involving oncogenes tend to exhibit increased expression, whereas fusions involving tumor suppressors have the opposite effect. For fusions involving kinases, we found 1,275 with an intact kinase domain, the proportion of which varied significantly across cancer types. Our study suggests that fusions drive the development of 16.5% of cancer cases and function as the sole driver in more than 1% of them. Finally, we identified druggable fusions involving genes such as TMPRSS2, RET, FGFR3, ALK, and ESR1 in 6.0% of cases, and we predicted immunogenic peptides, suggesting that fusions may provide leads for targeted drug and immune therapy.
Summary Global DNA demethylation in humans is a fundamental process that occurs in pre-implantation embryos and reversion to naïve ground state pluripotent stem cells (PSCs). However the extent of DNA methylation reprogramming in human germline cells is unknown. Here we performed whole-genome bisulfite sequencing (WGBS) and RNA-sequencing (RNA-Seq) of human prenatal germline cells from 53–137 days of development. We discovered that the transcriptome and methylome of human germline is distinct from both human PSCs and the inner cell mass (ICM) of human blastocysts. Using this resource to monitor the outcome of global DNA demethylation with reversion of primed PSCs to the naïve ground state, we uncovered hotspots of ultralow methylation at transposons that are protected from demethylation in the germline and ICM. Taken together the human germline serves as a valuable in vivo tool for monitoring the epigenome of cells that have emerged from a global DNA demethylation event.
Amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD) are two incurable neurodegenerative disorders that exist on a symptomological spectrum and share both genetic underpinnings and pathophysiological hallmarks. Functional abnormality of TAR DNA-binding protein 43 (TDP-43), an aggregation-prone RNA and DNA binding protein, is observed in the vast majority of both familial and sporadic ALS cases and in ~40% of FTLD cases, but the cascade of events leading to cell death are not understood. We have expressed human TDP-43 (hTDP-43) in Drosophila neurons and glia, a model that recapitulates many of the characteristics of TDP-43-linked human disease including protein aggregation pathology, locomotor impairment, and premature death. We report that such expression of hTDP-43 impairs small interfering RNA (siRNA) silencing, which is the major post-transcriptional mechanism of retrotransposable element (RTE) control in somatic tissue. This is accompanied by de-repression of a panel of both LINE and LTR families of RTEs, with somewhat different elements being active in response to hTDP-43 expression in glia versus neurons. hTDP-43 expression in glia causes an early and severe loss of control of a specific RTE, the endogenous retrovirus (ERV) gypsy. We demonstrate that gypsy causes the degenerative phenotypes in these flies because we are able to rescue the toxicity of glial hTDP-43 either by genetically blocking expression of this RTE or by pharmacologically inhibiting RTE reverse transcriptase activity. Moreover, we provide evidence that activation of DNA damage-mediated programmed cell death underlies both neuronal and glial hTDP-43 toxicity, consistent with RTE-mediated effects in both cell types. Our findings suggest a novel mechanism in which RTE activity contributes to neurodegeneration in TDP-43-mediated diseases such as ALS and FTLD.
The Human Pangenome Reference Consortium (HPRC) presents a first draft human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals. These assemblies cover more than 99% of the expected sequence and are more than 99% accurate at the structural and base-pair levels. Based on alignments of the assemblies, we generated a draft pangenome that captures known variants and haplotypes, reveals novel alleles at structurally complex loci, and adds 119 million base pairs of euchromatic polymorphic sequence and 1,529 gene duplications relative to the existing reference, GRCh38. Roughly 90 million of the additional base pairs derive from structural variation. Using our draft pangenome to analyze short-read data reduces errors when discovering small variants by 34% and boosts the detected structural variants per haplotype by 104% compared to GRCh38-based workflows, and by 34% compared to using previous diversity sets of genome assemblies.
Alternative splicing is prevalent in plants, but little is known about its regulation in the context of developmental and signaling pathways. We describe here a new factor that influences pre-messengerRNA (mRNA) splicing and is essential for embryonic development in Arabidopsis thaliana. This factor was retrieved in a genetic screen that identified mutants impaired in expression of an alternatively spliced GFP reporter gene. In addition to the known spliceosomal component PRP8, the screen recovered Arabidopsis RTF2 (AtRTF2), a previously uncharacterized, evolutionarily conserved protein containing a replication termination factor 2 (Rtf2) domain. A homozygous null mutation in AtRTF2 is embryo lethal, indicating that AtRTF2 is an essential protein. Quantitative RT-PCR demonstrated that impaired expression of GFP in atrtf2 and prp8 mutants is due to inefficient splicing of the GFP pre-mRNA. A genome-wide analysis using RNA sequencing indicated that 13-16% of total introns are retained to a significant degree in atrtf2 mutants. Considering these results and previous suggestions that Rtf2 represents an ubiquitin-related domain, we discuss the possible role of AtRTF2 in ubiquitin-based regulation of pre-mRNA splicing.KEYWORDS alternative splicing; C2HC2 zinc finger; intron retention; Rtf2 domain; ubiquitin ligase I T is increasingly recognized that cotranscriptional and posttranscriptional gene regulation is comparable to transcriptional regulation in intricacy and importance. Pre-mRNA splicing is a cotranscriptional process and a major determinant of transcript abundance and complexity (Reddy et al. 2013). Constitutive splicing refers to the use of only one set of splice sites to generate a single mature mRNA. By contrast, alternative splicing occurs when variable splice sites are selected, leading to the generation of more than one processed RNA product from a single pre-messenger RNA (mRNA). An individual gene can thus potentially encode multiple proteins, leading to a substantial increase in proteomic diversity (Chen and Manley 2009;Syed et al. 2012;Reddy et al. 2013).Recent work has established that alternative splicing is common in plants, affecting 60% of intron-containing genes . Alternative splicing has important roles in plant growth, development, abiotic stress tolerance, circadian rhythms, and pathogen defense (Staiger and Brown 2013). The most common outcome of alternative splicing in plants is intron retention Lan et al. 2013), which occurs when an intron fails to be spliced out of the premRNA. Retained introns frequently contain premature termination codons (PTCs) that can channel the transcript into the nonsense-mediated decay (NMD) pathway. Intron retention provides a means for "transcriptome tuning" (Braunschweig et al. 2014) and contributes to the post-transcriptional regulation of gene expression by reducing levels of inappropriately expressed transcripts Ge and Porse 2013).Alternative splicing is subject to elaborate regulation that relies on general and specific trans-acting factors as ...
Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.
In computerized adaptive testing (CAT), aberrant responses such as careless errors and lucky guesses may cause significant ability estimation biases in the dynamic administration of test items. We investigated the robustness of the 4-parameter logistic item response theory (4PL IRT; Barton & Lord, 1981) model in comparison with the 3-parameter logistic (3PL) IRT model (Birnbaum, 1968). We applied additional precision and efficiency measures to evaluate the 4PL IRT model. We measured the precision of CAT with respect to the estimation bias and mean absolute differences (MAD) between estimated and actual abilities. An improvement in administrative efficiency is reflected in fewer items being required for satisfying the stopping rule. Our results indicate that the 4PL IRT model provides a more efficient and robust ability estimation than the 3PL model.
Inherent genetic programming and environmental factors affect fetal growth in utero. Epidemiologic data in growth-altered fetuses, either intrauterine growth restricted (IUGR) or large for gestational age (LGA), demonstrate that these newborns are at increased risk of cardiometabolic disease in adulthood. There is growing evidence that the in utero environment leads to epigenetic modification, contributing to eventual risk of developing heart disease or diabetes. In this study, we used reduced representation bisulfite sequencing to examine genome-wide DNA methylation variation in placental samples from offspring born IUGR, LGA, and appropriate for gestational age (AGA) and to identify differential methylation of genes important for conferring risk of cardiometabolic disease. We found that there were distinct methylation signatures for IUGR, LGA, and AGA groups and identified over 500 differentially methylated genes (DMGs) among these group comparisons. Functional and gene network analyses revealed expected relationships of DMGs to placental physiology and transport, but also identified novel pathways with biologic plausibility and potential clinical importance to cardiometabolic disease. Specific loci for DMGs of interest had methylation patterns that were strongly associated with anthropometric presentations. We further validated altered gene expression of these specific DMGs contributing to vascular and metabolic diseases (SLC36A1, PTPRN2, CASZ1, IL10), thereby establishing transcriptional effects toward assigning functional significance. Our results suggest that the gene expression and methylation state of the human placenta are related and sensitive to the intrauterine environment, as it affects fetal growth patterns. We speculate that these observed changes may affect risk for offspring in developing adult cardiometabolic disease.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.