2016
DOI: 10.1093/bioinformatics/btw430
|View full text |Cite
|
Sign up to set email alerts
|

Complementary feature selection from alternative splicing events and gene expression for phenotype prediction

Abstract: MotivationA central task of bioinformatics is to develop sensitive and specific means of providing medical prognoses from biomarker patterns. Common methods to predict phenotypes in RNA-Seq datasets utilize machine learning algorithms trained via gene expression. Isoforms, however, generated from alternative splicing, may provide a novel and complementary set of transcripts for phenotype prediction. In contrast to gene expression, the number of isoforms increases significantly due to numerous alternative splic… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…However, these studies utilized RNA-seq data by leveraging only gene-level expression data rather than more detailed transcript-level data available for the alternative splicing transcripts (Chen and Manley 2009). Most recently, a study analyzed the utility of RNA-seq transcriptlevel data for the disease/nondisease phenotype classification of the samples, showing the advantage of the transcript expression data for the disease phenotype prediction task (Labuzzetta et al 2016). However, the question of whether or not the utility of transcript-level expression presents a general trend across all main biological and biomedical classification tasks remains open.…”
Section: Introductionmentioning
confidence: 99%
“…However, these studies utilized RNA-seq data by leveraging only gene-level expression data rather than more detailed transcript-level data available for the alternative splicing transcripts (Chen and Manley 2009). Most recently, a study analyzed the utility of RNA-seq transcriptlevel data for the disease/nondisease phenotype classification of the samples, showing the advantage of the transcript expression data for the disease phenotype prediction task (Labuzzetta et al 2016). However, the question of whether or not the utility of transcript-level expression presents a general trend across all main biological and biomedical classification tasks remains open.…”
Section: Introductionmentioning
confidence: 99%
“…These methods include various network topology analyses, dimensionality reduction methods, anomaly detection, supervised and unsupervised machine learning algorithms, as well as summarization and visualization techniques for complex high-dimensional data [439]. These methods have been used for feature selection on big data sets directly [440,441].…”
Section: Current and Emerging Technologymentioning
confidence: 99%
“…Several methods have been developed to automatically predict or generate genotype-phenotype associations. To predict phenotype associations, these methods use different sources such as literature [15][16][17], functional annotations [15,18,19], protein-protein interactions (PPIs) [15,[20][21][22], expression profiles [23,24], genetic variations [15,25], or their combinations [15]. The general idea behind most of these methods is to find genetic similarities, or interactions, and transfer phenotypes between genes based on the assumption that similar or interacting genes are involved in similar or related phenotypes [26].…”
Section: Introductionmentioning
confidence: 99%