2020
DOI: 10.1093/jssam/smaa028
|View full text |Cite
|
Sign up to set email alerts
|

Boosted Kernel Weighting – Using Statistical Learning to Improve Inference from Nonprobability Samples

Abstract: Given the growing popularity of nonprobability samples as a cost- and time-efficient alternative to probability sampling, a variety of adjustment approaches have been proposed to correct for self-selection bias in nonrandom samples. Popular methods such as inverse propensity-score weighting (IPSW) and propensity-score (PS) adjustment by subclassification (PSAS) utilize a probability sample as a reference to estimate pseudo-weights for the nonprobability sample based on PSs. A recent contribution, kernel weight… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 38 publications
0
8
0
Order By: Relevance
“…Logistic models are often used to estimate the propensity to participate in the survey of each individual. In recent decades, numerous machine‐learning (ML) methods have been considered in the literature for the treatment of nonprobability samples and have proved to be more suitable for regression and classification than linear regression methods (Castro‐Martín et al., 2020 ; Chu & Beaumont, 2019 ; Ferri‐García & Rueda, 2020 ; Kern et al., 2020 ).…”
Section: Methodsmentioning
confidence: 99%
“…Logistic models are often used to estimate the propensity to participate in the survey of each individual. In recent decades, numerous machine‐learning (ML) methods have been considered in the literature for the treatment of nonprobability samples and have proved to be more suitable for regression and classification than linear regression methods (Castro‐Martín et al., 2020 ; Chu & Beaumont, 2019 ; Ferri‐García & Rueda, 2020 ; Kern et al., 2020 ).…”
Section: Methodsmentioning
confidence: 99%
“…Many techniques, varying in sophistication, can be used to estimate the propensity score ( 22 ). Concretely, logistic regression is the most commonly used method for fitting the propensity score; in this case, Σ is taken to be the class of linear functions passed through the logistic activation.…”
Section: Propensity Score Reweightingmentioning
confidence: 99%
“…Very recently, a kernel weighting approach has been proposed by , where the weighted estimator is proved to be consistent under a weak exchangeability condition. To further weaken the modeling assumptions, Kern et al (2020) propose to use algorithmic tree-based methods, including random forests and gradient tree boosting, for estimating the PS in kernel weighting.…”
Section: Introductionmentioning
confidence: 99%