2013
DOI: 10.2478/itms-2013-0015
|View full text |Cite
|
Sign up to set email alerts
|

Feature Ranking by Classification Accuracy Estimation of Multiple Data Samples

Abstract: -This article considers the gene ranking algorithm for the microarray data. The rank vector is estimated by classifications of the random data samples. At each iteration, the ranks of genes participating in the successful classification become higher. Unlike other methods of feature selection, the proposed algorithm allows increasing the generality of the classification models by construction of the balanced training samples and taking into account the descriptiveness of the gene combinations by the subset est… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 17 publications
(19 reference statements)
0
1
0
Order By: Relevance
“…In one of the numerical experiments the real data contained information on the gene expression of cancer patients (143 observations of 60,483 features) obtained from the Cancer Genome Atlas (TCGA). From these data by means of the simplified method of ranking of features proposed by Novoselova [30] 23 most informative features concerning the forecast of a vital status of patients having diagnosed glioblastoma were identified. This approach substantially simplifies numerical difficulties in following data processing.…”
Section: Figure Simplified Diagram Of Combining Two Data Sources Into An Ensemblementioning
confidence: 99%
“…In one of the numerical experiments the real data contained information on the gene expression of cancer patients (143 observations of 60,483 features) obtained from the Cancer Genome Atlas (TCGA). From these data by means of the simplified method of ranking of features proposed by Novoselova [30] 23 most informative features concerning the forecast of a vital status of patients having diagnosed glioblastoma were identified. This approach substantially simplifies numerical difficulties in following data processing.…”
Section: Figure Simplified Diagram Of Combining Two Data Sources Into An Ensemblementioning
confidence: 99%