2013
DOI: 10.1016/j.patcog.2013.02.017
|View full text |Cite
|
Sign up to set email alerts
|

Classifier design given an uncertainty class of feature distributions via regularized maximum likelihood and the incorporation of biological pathway knowledge in steady-state phenotype classification

Abstract: Contemporary high-throughput technologies provide measurements of very large numbers of variables but often with very small sample sizes. This paper proposes an optimization-based paradigm for utilizing prior knowledge to design better performing classifiers when sample sizes are limited. We derive approximate expressions for the first and second moments of the true error rate of the proposed classifier under the assumption of two widely-used models for the uncertainty classes; ε-contamination and p-point clas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2013
2013
2020
2020

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 9 publications
(4 citation statements)
references
References 44 publications
0
4
0
Order By: Relevance
“…The more a prior is constrained by scientific knowledge, the more confident one can be that the prior distribution is concentrated around the correct model; however, as noted previously, one must be prudent, since concentrating the prior away from the true model can result in very poor results. With optimal Bayesian classification in the context of phenotype classification, knowledge concerning genetic signaling pathways has been integrated into prior construction [29, 30]. In [17, 18], a hierarchical Poisson prior is employed that models cellular mRNA concentrations using a log-normal distribution and then models the sequencing as sampling the RNA concentrations through a Poisson process.…”
Section: Prior Constructionmentioning
confidence: 99%
“…The more a prior is constrained by scientific knowledge, the more confident one can be that the prior distribution is concentrated around the correct model; however, as noted previously, one must be prudent, since concentrating the prior away from the true model can result in very poor results. With optimal Bayesian classification in the context of phenotype classification, knowledge concerning genetic signaling pathways has been integrated into prior construction [29, 30]. In [17, 18], a hierarchical Poisson prior is employed that models cellular mRNA concentrations using a log-normal distribution and then models the sequencing as sampling the RNA concentrations through a Poisson process.…”
Section: Prior Constructionmentioning
confidence: 99%
“…At present, the target recognition and classification methods of remote sensing images can be divided into traditional statistical methods [1][2][3][4][5][6][7][8], machine learning methods [9][10][11], and deep learning algorithms [12][13][14]. The traditional statistical methods realize the classification by using statistical analysis of the gray value and texture of image.…”
Section: Introductionmentioning
confidence: 99%
“…While the authors demonstrate that some of the best methods for prognosis prediction incorporate molecular features selected by expert prior knowledge along with both molecular and clinical data, all methods used are based on data-driven machine learning rather than optimal prediction and error estimation and do not take full advantage of network structure to improve prediction. In [ 11 , 12 ], the authors present methods of constructing uncertainty classes of gene expression distributions in the OBC framework that are consistent with available pathway information to improve classification. However, the focus is on diagnosis rather than prognosis, and these works treat network uncertainty as stemming from ignorance.…”
Section: Introductionmentioning
confidence: 99%