2019
DOI: 10.3390/app9173589
|View full text |Cite
|
Sign up to set email alerts
|

MLW-gcForest: A Multi-Weighted gcForest Model for Cancer Subtype Classification by Methylation Data

Abstract: Effective cancer treatment requires a clear subtype. Due to the small sample size, high dimensionality, and class imbalances of cancer gene data, classifying cancer subtypes by traditional machine learning methods remains challenging. The gcForest algorithm is a combination of machine learning methods and a deep neural network and has been indicated to achieve better classification of small samples of data. However, the gcForest algorithm still faces many challenges when this method is applied to the classific… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
2

Relationship

1
9

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 44 publications
(75 reference statements)
0
8
0
Order By: Relevance
“…The gcForest also has been applied in the medical field. Dong et al (2019) [23] proposed a multi-weighted gcForest algorithm to build a lung adenocarcinoma staging prediction model. The weights of random forests (RF) in the standard gcForest are the same, which might be not reasonable.…”
Section: B the Studies Of Gcforestmentioning
confidence: 99%
“…The gcForest also has been applied in the medical field. Dong et al (2019) [23] proposed a multi-weighted gcForest algorithm to build a lung adenocarcinoma staging prediction model. The weights of random forests (RF) in the standard gcForest are the same, which might be not reasonable.…”
Section: B the Studies Of Gcforestmentioning
confidence: 99%
“…At the same time, the deep forest algorithm can be used to extract multi-level amplitude features and multi-level dense scale-invariant feature transformation features, amplitude features, and realize fault recognition in multi-feature mode [17][18][19]. In addition, some scholars used machine learning methods to improve the deep forest model, solving the problems of the long characteristics of single-sample data of mechanical equipment vibration signals and the high cost of deep forest model data processing, and realized the fault diagnosis of mechanical equipment under small training samples [20]. The bearing vibration signal data samples of coal mine main fans cover a large number of sample points, mostly 1000-4000 data points.…”
Section: Introductionmentioning
confidence: 99%
“…The small data size and imbalanced data problems have been addressed in [22], where an enhanced algorithm to handle the data was proposed, combining traditional techniques with deep learning methods. The results obtained confirmed that deep learning enhances performance; in addition to that, methylation data were suggested to be effectively used to improve diagnosis of cancer.…”
Section: Introductionmentioning
confidence: 99%