2020
DOI: 10.1186/s12859-020-03653-9
|View full text |Cite
|
Sign up to set email alerts
|

Comparison of methods for the detection of outliers and associated biomarkers in mislabeled omics data

Abstract: Background Previous studies have reported that labeling errors are not uncommon in omics data. Potential outliers may severely undermine the correct classification of patients and the identification of reliable biomarkers for a particular disease. Three methods have been proposed to address the problem: sparse label-noise-robust logistic regression (Rlogreg), robust elastic net based on the least trimmed square (enetLTS), and Ensemble. Ensemble is an ensembled classification based on distinct feat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(15 citation statements)
references
References 50 publications
1
14
0
Order By: Relevance
“…MTL-EN variable selection accuracy was very similar to enetLTS with high PSR and FDR. As also shown in our previous study [ 9 ], Ensemble had the highest variable selection accuracy with much low FDR; however, Ensemble missed some associated variables when the proportion of outliers was 10% or 15%.…”
Section: Simulation Studysupporting
confidence: 80%
See 4 more Smart Citations
“…MTL-EN variable selection accuracy was very similar to enetLTS with high PSR and FDR. As also shown in our previous study [ 9 ], Ensemble had the highest variable selection accuracy with much low FDR; however, Ensemble missed some associated variables when the proportion of outliers was 10% or 15%.…”
Section: Simulation Studysupporting
confidence: 80%
“…In the simulation experiment, we compared the two methods enetLTS and MTL-EN using C-step and AR-Cstep algorithms, respectively. Through our previous research [ 9 ] and subsequent simulation experiments, we can see that enetLTS is good at identifying outliers. However, the FDR of its variable selection is high, and many unrelated variables are identified.…”
Section: Simulation Studymentioning
confidence: 92%
See 3 more Smart Citations