Abstract:BackgroundRNA sequencing has been proposed as a means of increasing diagnostic rates in studies of undiagnosed rare inherited disease. Recent studies have reported diagnostic improvements in the range of 7.5–35% by profiling splicing, gene expression quantification and allele specific expression. To-date however, no study has systematically assessed the presence of gene-fusion transcripts in cases of germline disease. Fusion transcripts are routinely identified in cancer studies and are increasingly recognized… Show more
“…Thus, detection algorithms primarily trained with oncogenic fusions may be biased by these and not optimized to account for different expression levels and patterns of read support. Such difficulties were demonstrated in our study where TopHat Fusion (Kim and Salzberg, 2011) using default parameters succeeded in detecting only one of eight fusion events detected and laboratory-validated in our rare disease cohort (Oliver et al, 2019b). To address this, we implemented a series of filtering and classification steps to detect fusions potentially linked to rare genetic constitutive disease.…”
Section: Adapting Fusion Detection To Rare Diseasementioning
confidence: 94%
“…Discussion of fusion transcripts detected in normal tissues centered on apparently benign events resulting from co-transcription of neighboring genes or more controversially from trans-splicing (Akiva et al, 2006;Peng et al, 2015;Babiceanu et al, 2016;Yuan et al, 2017;He et al, 2018). Reports of fusions in the context of inherited disease existed only in isolated case studies and were not systematically reported on until 2019 (Oliver et al, 2019b). The formulation of computational fusion detection software reflected the field's focus on oncology-related fusion events and algorithms were primarily trained using incompletely characterized tumors or cancer cell-lines (Kumar et al, 2016).…”
Section: Adapting Fusion Detection To Rare Diseasementioning
confidence: 99%
“…Our GTEx fusion database was queried for exon to exon fusions involving the gene pairs comprising eleven fusion candidates reported in our prior study (Oliver et al, 2019b; Table 1 rows FIGURE 1 | Dot plots illustrating the number of observations of selected exon-exon fusion transcripts in the GTEx RNA-Seq data by tissue type. Fusion analysis was performed using RNA-Seq data from 8187 samples passing QC, representing 549 individuals and 52 tissue types, extracted from GTEx (version 6p).…”
Section: Pathogenic Fusions In Normal Tissuesmentioning
confidence: 99%
“…Fusion transcript identification was performed using STAR-Fusion (Haas et al, 2017) with default settings following STAR (v2.5.2b) two-pass alignment (Dobin et al, 2013). Similar to our previously described methods, preliminary fusion calls were used to maximize sensitivity by avoiding default filters encoded in the callers (Oliver et al, 2019b). Fusion-supporting junction and spanning reads identified by STAR Fusion were combined into a single supporting read count for each event.…”
Section: Pathogenic Fusions In Normal Tissuesmentioning
confidence: 99%
“…Fusion-supporting junction and spanning reads identified by STAR Fusion were combined into a single supporting read count for each event. Fusions (A)-(F) are fusion candidates originating from a cohort analysis of rare disease patients previously published by the authors (Oliver et al, 2019b). Five fusions experimentally validated in the authors' cohort analysis were not observed in the GTEx database and are not displayed in the figure.…”
Section: Pathogenic Fusions In Normal Tissuesmentioning
Several recent studies have demonstrated the utility of RNA-Seq in the diagnosis of rare inherited disease. Diagnostic rates 35% higher than those previously achievable with DNA-Seq alone have been attained. These studies have primarily profiled gene expression and splicing defects, however, some have also shown that fusion transcripts are diagnostic or phenotypically relevant in patients with constitutional disorders. Fusion transcripts have traditionally been studied as oncogenic phenomena, with relevance only to cancer testing. Consequently, fusion detection algorithms were biased toward the detection of well-known oncogenic fusions, hindering their application to rare Mendelian genetic disease studies. A recent methodology published by the authors successfully tailored a traditional algorithm to the detection of pathogenic fusion events in inherited disease. A key mechanism of decreasing false positive or biologically benign events was comparison to a database of events detected in normal tissues. This approach is akin to population frequency-based filtering of genetic variants. It is predicated on the idea that pathogenic fusion transcripts are absent from normal tissue. We report on an analysis of RNA-Seq data from the genotype-tissue expression (GTEx) project in which known pathogenic fusions are computationally detected at low levels in normal tissues unassociated with the disease phenotype. Examples include archetypal cancer fusion transcripts, as well as fusions responsible for rare inherited disease. We consider potential explanations for the detectability of such transcripts and discuss the bearing such results have on the future profiling of genetic disease patients for pathogenic gene fusions.
“…Thus, detection algorithms primarily trained with oncogenic fusions may be biased by these and not optimized to account for different expression levels and patterns of read support. Such difficulties were demonstrated in our study where TopHat Fusion (Kim and Salzberg, 2011) using default parameters succeeded in detecting only one of eight fusion events detected and laboratory-validated in our rare disease cohort (Oliver et al, 2019b). To address this, we implemented a series of filtering and classification steps to detect fusions potentially linked to rare genetic constitutive disease.…”
Section: Adapting Fusion Detection To Rare Diseasementioning
confidence: 94%
“…Discussion of fusion transcripts detected in normal tissues centered on apparently benign events resulting from co-transcription of neighboring genes or more controversially from trans-splicing (Akiva et al, 2006;Peng et al, 2015;Babiceanu et al, 2016;Yuan et al, 2017;He et al, 2018). Reports of fusions in the context of inherited disease existed only in isolated case studies and were not systematically reported on until 2019 (Oliver et al, 2019b). The formulation of computational fusion detection software reflected the field's focus on oncology-related fusion events and algorithms were primarily trained using incompletely characterized tumors or cancer cell-lines (Kumar et al, 2016).…”
Section: Adapting Fusion Detection To Rare Diseasementioning
confidence: 99%
“…Our GTEx fusion database was queried for exon to exon fusions involving the gene pairs comprising eleven fusion candidates reported in our prior study (Oliver et al, 2019b; Table 1 rows FIGURE 1 | Dot plots illustrating the number of observations of selected exon-exon fusion transcripts in the GTEx RNA-Seq data by tissue type. Fusion analysis was performed using RNA-Seq data from 8187 samples passing QC, representing 549 individuals and 52 tissue types, extracted from GTEx (version 6p).…”
Section: Pathogenic Fusions In Normal Tissuesmentioning
confidence: 99%
“…Fusion transcript identification was performed using STAR-Fusion (Haas et al, 2017) with default settings following STAR (v2.5.2b) two-pass alignment (Dobin et al, 2013). Similar to our previously described methods, preliminary fusion calls were used to maximize sensitivity by avoiding default filters encoded in the callers (Oliver et al, 2019b). Fusion-supporting junction and spanning reads identified by STAR Fusion were combined into a single supporting read count for each event.…”
Section: Pathogenic Fusions In Normal Tissuesmentioning
confidence: 99%
“…Fusion-supporting junction and spanning reads identified by STAR Fusion were combined into a single supporting read count for each event. Fusions (A)-(F) are fusion candidates originating from a cohort analysis of rare disease patients previously published by the authors (Oliver et al, 2019b). Five fusions experimentally validated in the authors' cohort analysis were not observed in the GTEx database and are not displayed in the figure.…”
Section: Pathogenic Fusions In Normal Tissuesmentioning
Several recent studies have demonstrated the utility of RNA-Seq in the diagnosis of rare inherited disease. Diagnostic rates 35% higher than those previously achievable with DNA-Seq alone have been attained. These studies have primarily profiled gene expression and splicing defects, however, some have also shown that fusion transcripts are diagnostic or phenotypically relevant in patients with constitutional disorders. Fusion transcripts have traditionally been studied as oncogenic phenomena, with relevance only to cancer testing. Consequently, fusion detection algorithms were biased toward the detection of well-known oncogenic fusions, hindering their application to rare Mendelian genetic disease studies. A recent methodology published by the authors successfully tailored a traditional algorithm to the detection of pathogenic fusion events in inherited disease. A key mechanism of decreasing false positive or biologically benign events was comparison to a database of events detected in normal tissues. This approach is akin to population frequency-based filtering of genetic variants. It is predicated on the idea that pathogenic fusion transcripts are absent from normal tissue. We report on an analysis of RNA-Seq data from the genotype-tissue expression (GTEx) project in which known pathogenic fusions are computationally detected at low levels in normal tissues unassociated with the disease phenotype. Examples include archetypal cancer fusion transcripts, as well as fusions responsible for rare inherited disease. We consider potential explanations for the detectability of such transcripts and discuss the bearing such results have on the future profiling of genetic disease patients for pathogenic gene fusions.
Deletions, duplications, insertions, inversions, and translocations, collectively called structural variations (SVs), affect more base pairs of the genome than any other sequence variant. The recent technological advancements in genome sequencing have enabled the discovery of tens of thousands of SVs per human genome. These SVs primarily affect non‐coding DNA sequences, but the difficulties in interpreting their impact limit our understanding of human disease etiology. The functional annotation of non‐coding DNA sequences and methodologies to characterize their three‐dimensional (3D) organization in the nucleus have greatly expanded our understanding of the basic mechanisms underlying gene regulation, thereby improving the interpretation of SVs for their pathogenic impact. Here, we discuss the various mechanisms by which SVs can result in altered gene regulation and how these mechanisms can result in rare genetic disorders. Beyond changing gene expression, SVs can produce novel gene‐intergenic fusion transcripts at the SV breakpoints.
Over the past decade high‐throughput DNA sequencing approaches, namely whole exome and whole genome sequencing became a standard procedure in Mendelian disease diagnostics. Implementation of these technologies greatly facilitated diagnostics and shifted the analysis paradigm from variant identification to prioritisation and evaluation. The diagnostic rates vary widely depending on the cohort size, heterogeneity, and disease and range from around 30% to 50% leaving the majority of patients undiagnosed. Advances in omics technologies and computational analysis provide an opportunity to increase these unfavourable rates by providing evidence for disease‐causing variant validation and prioritisation. This review aims to provide an overview of the current application of several omics technologies including RNA‐sequencing, proteomics, metabolomics and DNA‐methylation profiling for diagnostics of rare genetic diseases in general and inborn errors of metabolism in particular.This article is protected by copyright. All rights reserved.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.