Automatic Feature Selection via Weighted Kernels and Regularization

Allen, Genevera I.

doi:10.1080/10618600.2012.681213

Cited by 56 publications

(69 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…SPAM (sparse additive models) is similar to COSSO in that it truncates complexity, but it allows p ≫ n (Ravikumar et al, 2009). Kernel iterative feature extraction (KNIFE) by Allen (2013) imposes L 1 -regularization on L 2 -penalized splines.…”

Section: Methods Comparison and Numerical Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods

White

Stefanski

2017

Journal of the American Statistical Association

View full text Add to dashboard Cite

This paper develops a nonparametric shrinkage and selection estimator via the measurement error selection likelihood approach recently proposed by Stefanski, Wu, and White. The Measurement Error Kernel Regression Operator (MEKRO) has the same form as the Nadaraya-Watson kernel estimator, but optimizes a measurement error model selection likelihood to estimate the kernel bandwidths. Much like LASSO or COSSO solution paths, MEKRO results in solution paths depending on a tuning parameter that controls shrinkage and selection via a bound on the harmonic mean of the pseudo-measurement error standard deviations. We use small-sample-corrected AIC to select the tuning parameter. Large-sample properties of MEKRO are studied and small-sample properties are explored via Monte Carlo experiments and applications to data.

show abstract

Section: Methods Comparison and Numerical Resultsmentioning

confidence: 99%

“…We use the default GCV criterion for MARS. For KNIFE, we fix λ 1 = 1 and use a radial kernel with γ = 1/ p as suggested in Allen (2013). The weight power for the ACOSSO is fixed at γ = 2, as suggested by Storlie et al (2011).…”

Section: Methods Comparison and Numerical Resultsmentioning

confidence: 99%

Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods

White

Stefanski

2017

Journal of the American Statistical Association

View full text Add to dashboard Cite

show abstract

“…Note that this is not the case in the classical kernel-based approaches where the prediction is the only goal but not features selection. Previously demonstrated methods for feature selection using kernel machines [52] lack a probabilistic model required by our approach. Further extension of our model to those cases is possible but beyond the scope of this paper.…”

Section: Discussionmentioning

confidence: 99%

Probabilistic Modeling of Imaging, Genetics and Diagnosis

Batmanghelich

Dalca

Quon

et al. 2016

IEEE Trans. Med. Imaging

View full text Add to dashboard Cite

We propose a unified Bayesian framework for detecting genetic variants associated with disease by exploiting image-based features as an intermediate phenotype. The use of imaging data for examining genetic associations promises new directions of analysis, but currently the most widely used methods make sub-optimal use of the richness that these data types can offer. Currently, image features are most commonly selected based on their relevance to the disease phenotype. Then, in a separate step, a set of genetic variants is identified to explain the selected features. In contrast, our method performs these tasks simultaneously in order to jointly exploit information in both data types. The analysis yields probabilistic measures of clinical relevance for both imaging and genetic markers. We derive an efficient approximate inference algorithm that handles the high dimensionality of image and genetic data. We evaluate the algorithm on synthetic data and demonstrate that it outperforms traditional models. We also illustrate our method on Alzheimer’s Disease Neuroimaging Initiative data.

show abstract

“…To achieve that goal, one may consider more flexible forms of kernel functions to allow the method to automatically remove variables. Existing literature in nonlinear variable selection, for example, the COSSO (Lin and Zhang, 2007) and KNIFE (Allen, 2013), can be useful here. In that case, the corresponding computational algorithm can be much more challenging.…”

Section: Discussionmentioning

confidence: 99%

Kernel continuum regression

Lee

Liu

2013

Computational Statistics & Data Analysis

View full text Add to dashboard Cite

The continuum regression technique provides an appealing regression framework connecting ordinary least squares, partial least squares and principal component regression in one family. It offers some insight on the underlying regression model for a given application. Moreover, it helps to provide deep understanding of various regression techniques. Despite the useful framework, however, the current development on continuum regression is only for linear regression. In many applications, nonlinear regression is necessary. The extension of continuum regression from linear models to nonlinear models using kernel learning is considered. The proposed kernel continuum regression technique is quite general and can handle very flexible regression model estimation. An efficient algorithm is developed for fast implementation. Numerical examples have demonstrated the usefulness of the proposed technique.

show abstract

Automatic Feature Selection via Weighted Kernels and Regularization

Cited by 56 publications

References 25 publications

Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods

Variable Selection in Kernel Regression Using Measurement Error Selection Likelihoods

Probabilistic Modeling of Imaging, Genetics and Diagnosis

Kernel continuum regression

Contact Info

Product

Resources

About