2023
DOI: 10.1038/s41588-023-01373-3
|View full text |Cite
|
Sign up to set email alerts
|

Aberrant splicing prediction across human tissues

Abstract: Aberrant splicing is a major cause of genetic disorders but its direct detection in transcriptomes is limited to clinically accessible tissues such as skin or body fluids. While DNA-based machine learning models can prioritize rare variants for affecting splicing, their performance in predicting tissue-specific aberrant splicing remains unassessed. Here we generated an aberrant splicing benchmark dataset, spanning over 8.8 million rare variants in 49 human tissues from the Genotype-Tissue Expression (GTEx) dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
17
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 27 publications
(26 citation statements)
references
References 69 publications
0
17
0
Order By: Relevance
“…We next annotated the genes using the seven features from the software intOGen, which include positional recurrence of mutations in genome sequence (OncodriveCLUSTL), positional recurrence of mutations in protein conformation (HotMAPS), enrichment of mutations in functional domains (smRegions), three alternative measures of selection strength inferred from synonymous and nonsynonymous mutations (CBaSE, MutPanning, and dNdScv), and OncodriveFML, a method identifying excess of somatic mutations across tumors in both coding and non-coding genomic regions (37,(49)(50)(51)(52)(53)(54)(55). Moreover, we annotated genetic variants falling into gene bodies, including deep intronic variants, with AbSplice-DNA, a tool predicting variants causing aberrant splicing (35). On the RNA-seq data, we used OUTRIDER on a total of 12,966 protein-coding genes commonly expressed across the dataset to call high or low expression outliers, and FRASER to call splicing outliers (39,41).…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…We next annotated the genes using the seven features from the software intOGen, which include positional recurrence of mutations in genome sequence (OncodriveCLUSTL), positional recurrence of mutations in protein conformation (HotMAPS), enrichment of mutations in functional domains (smRegions), three alternative measures of selection strength inferred from synonymous and nonsynonymous mutations (CBaSE, MutPanning, and dNdScv), and OncodriveFML, a method identifying excess of somatic mutations across tumors in both coding and non-coding genomic regions (37,(49)(50)(51)(52)(53)(54)(55). Moreover, we annotated genetic variants falling into gene bodies, including deep intronic variants, with AbSplice-DNA, a tool predicting variants causing aberrant splicing (35). On the RNA-seq data, we used OUTRIDER on a total of 12,966 protein-coding genes commonly expressed across the dataset to call high or low expression outliers, and FRASER to call splicing outliers (39,41).…”
Section: Resultsmentioning
confidence: 99%
“…Using FRASER, which accurately captured the consequence of this initially discarded structural variant as a splicing outlier, we were now able to recognize its significant transcriptomic impact and rescue it. As a complementary approach to RNA-seq-based splicing outlier calling, we considered AbSplice (35), a recently published algorithm that predicts whether a rare variant causes aberrant splicing. Here, we applied it for the first time to hematologic malignancy samples.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations