2010
DOI: 10.1198/jasa.2010.tm10130
|View full text |Cite
|
Sign up to set email alerts
|

Variable Selection Using Adaptive Nonlinear Interaction Structures in High Dimensions

Abstract: Numerous penalization based methods have been proposed for fitting a traditional linear regression model in which the number of predictors, p, is large relative to the number of observations, n. Most of these approaches assume sparsity in the underlying coefficients and perform some form of variable selection. Recently, some of this work has been extended to non-linear additive regression models. However, in many contexts one wishes to allow for the possibility of interactions among the predictors. This poses … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
124
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 104 publications
(127 citation statements)
references
References 28 publications
2
124
0
Order By: Relevance
“…The CAP can also be seen as a part of the composite penalties (5), so we added the CAP to Table 1. Choi et al (2010), Radchenko and James (2010), Bach et al (2012) and Bien et al (2013) also considered methods for hierarchical selection. This paper focuses on the setting where there are grouped structures without any overlap.…”
Section: Penalties For Other Settingsmentioning
confidence: 99%
“…The CAP can also be seen as a part of the composite penalties (5), so we added the CAP to Table 1. Choi et al (2010), Radchenko and James (2010), Bach et al (2012) and Bien et al (2013) also considered methods for hierarchical selection. This paper focuses on the setting where there are grouped structures without any overlap.…”
Section: Penalties For Other Settingsmentioning
confidence: 99%
“…) and h (j) is the vector of the bandwidths except h j , with a(x, K , h (j) ) and b j (x, K ) defined in (13). This assumption has a similar role as the assumption (4) in [19].…”
Section: Assumption (B)mentioning
confidence: 92%
“…1(a). Now consider the bias functional in (13). As a function of h j , its behavior is depicted in Fig.…”
Section: The Optimal Bandwidth Matrixmentioning
confidence: 99%
See 1 more Smart Citation
“…However, the all-pairs Lasso estimator does not account for any structural information which has been shown to be important for prediction and interpretation of the high dimensional interaction regression model [2,30,25,29,6]. In statistics, a hierarchical structure between main effects and interaction effects has been shown to be very effective in constraining the search space and identifying important individual features and interactions [2,30,25,29,6]. Specifically, the hierarchical constraint requires that an interaction term xixj is selected in the model only if the main effects xi and/or xj are included.…”
Section: Introductionmentioning
confidence: 99%