2018
DOI: 10.1016/j.csbj.2018.02.005
|View full text |Cite
|
Sign up to set email alerts
|

A Review of Matched-pairs Feature Selection Methods for Gene Expression Data Analysis

Abstract: With the rapid accumulation of gene expression data from various technologies, e.g., microarray, RNA-sequencing (RNA-seq), and single-cell RNA-seq, it is necessary to carry out dimensional reduction and feature (signature genes) selection in support of making sense out of such high dimensional data. These computational methods significantly facilitate further data analysis and interpretation, such as gene function enrichment analysis, cancer biomarker detection, and drug targeting identification in precision m… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
34
0
1

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 56 publications
(35 citation statements)
references
References 88 publications
0
34
0
1
Order By: Relevance
“…Data normalization is a method to standardize the range of features without reducing the dimension of the data [19][20][21][22]41]. Data normalization process is important since it is important to select the best features without eliminating useful information from the preprocessed data [19][20][21][22]. Conventional single stage feature selection having the drawback of possibly selecting data after eliminating useful data during feature extraction stage.…”
Section: Stage 1: Data Normalization Methods and Data Dimension Reductionmentioning
confidence: 99%
See 3 more Smart Citations
“…Data normalization is a method to standardize the range of features without reducing the dimension of the data [19][20][21][22]41]. Data normalization process is important since it is important to select the best features without eliminating useful information from the preprocessed data [19][20][21][22]. Conventional single stage feature selection having the drawback of possibly selecting data after eliminating useful data during feature extraction stage.…”
Section: Stage 1: Data Normalization Methods and Data Dimension Reductionmentioning
confidence: 99%
“…Thus, for this work, raw data samples are normalized using ten different data normalization methods. Based on the comprehensive review done on the previous researches, five data normalization methods are chosen from the commonly used methods, namely, Decimal Scaling (DS), Z-score (ZS), Linear Scaling (LS), Min-Max (MM) and Mean & Standard Deviation (MSD) methods [19][20][21][22]. The other five data normalization methods are newly introduced in early breast cancer detection application, namely, Relative Logarithmic Sum Squared Voltage (RLSSV), Relative Logarithmic Voltage (RLV), Relative Voltage (RV), Fractional Voltage Change (FVC) and Relative Sum Squared Voltage (RSSV) [8][9].…”
Section: Stage 1: Data Normalization Methods and Data Dimension Reductionmentioning
confidence: 99%
See 2 more Smart Citations
“…There are currently four methods available in literature to identify 4mC sites, including iDNA4mC (Chen et al, 2017), 4mCPred (Su et al, 2018), 4mcPred-SVM (Wei et al, 2018a), and 4mcPred-IFL (Wei et al, 2019a). iDNA4mC, as the first machine learning predictor, encodes sequences by nucleotide chemical properties and nucleotide frequency to features and trains support vector machine (SVM) models for prediction (Liang et al, 2018). Although this method has the ability to distinguish between 4mC and non-4mC sites, the prediction accuracy is relatively low overall.…”
Section: Introductionmentioning
confidence: 99%