2020
DOI: 10.1214/19-ba1182
|View full text |Cite
|
Sign up to set email alerts
|

Theory of Optimal Bayesian Feature Filtering

Abstract: Optimal Bayesian feature filtering (OBF) is a supervised screening method designed for biomarker discovery. In this article, we prove two major theoretical properties of OBF. First, optimal Bayesian feature selection under a general family of Bayesian models reduces to filtering if and only if the underlying Bayesian model assumes all features are mutually independent. Therefore, OBF is optimal if and only if one assumes all features are mutually independent, and OBF is the only filter method that is optimal u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
14
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(14 citation statements)
references
References 83 publications
0
14
0
Order By: Relevance
“…In each cancer, at least 40% of mones were statistically significant irrespective of the statistical test used (FDR= 5%, see Supplementary File 1), and 75.7% of mones significant by at least one method were significant by all tests (see methods, see Figure 1b). A smaller subset of these mones showed strong effect sizes, as identified by optimal Bayesian filter (OBF) [17] statistics (see Methods). 22% of all mone-cancer pairs met this criterion based on distributional differences between tumor and normal slides (see Supplementary Figure 2).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…In each cancer, at least 40% of mones were statistically significant irrespective of the statistical test used (FDR= 5%, see Supplementary File 1), and 75.7% of mones significant by at least one method were significant by all tests (see methods, see Figure 1b). A smaller subset of these mones showed strong effect sizes, as identified by optimal Bayesian filter (OBF) [17] statistics (see Methods). 22% of all mone-cancer pairs met this criterion based on distributional differences between tumor and normal slides (see Supplementary Figure 2).…”
Section: Resultsmentioning
confidence: 99%
“…Differential mone analysis identifies mones with statistically significant distributional differences across classes. Welch's t-test, Kolmogorov Smirnov (KS) test, Wilcoxon Rank Sum (WRS) test, and optimal Bayesian Filter (OBF) (see [17] for details) were used for statistical analysis. t-test, WRS test, and KS test use the Benjamini-Hochberg procedure [38] for FDR correction.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…In each cancer, at least 40% of mones were statistically significant irrespective of the statistical test used (FDR= 5%, see Supplementary File 1), and 75.7% of mones significant by at least one method were significant by all tests (see methods, see Figure 1b). A smaller subset of these mones showed strong effect sizes, as identified by optimal Bayesian filter (OBF) [19] statistics (see Methods). 22% of all mone-cancer pairs met this criterion based on distributional differences between tumor and normal slides (see Supplementary Figure 3).…”
Section: Resultsmentioning
confidence: 99%
“…Structured multi-class OBF (see [45] for details) considers the four possible relations (known as structures by OBF) between frozen normal, frozen tumor, and FFPE tumor slides: (A) a mone does not differentiate between slides (prior probability=0.5), (B) a mone has one distribution for frozen slides (both tumor and adjacent normal slides) and another distribution for FFPE slides (prior probability=0.5/3), (C) a mone has one distribution for tumor slides (both frozen and FFPE) and another distribution for frozen normal slides (prior probability=0.5/3), and (D) a mone has one distribution for FFPE tumor slides and frozen adjacent normal slides and another distribution for frozen tumor slides. Mones with structure B for which frozen tumor and FFPE tumors lie on both sides of frozen adjacent normal slides (based on mean values) are considered ineffective due to FFPE/frozen differences.…”
Section: Methodsmentioning
confidence: 99%