2020
DOI: 10.1093/jssam/smz051
|View full text |Cite
|
Sign up to set email alerts
|

Integrating Probability and Nonprobability Samples for Survey Inference

Abstract: Survey data collection costs have risen to a point where many survey researchers and polling companies are abandoning large, expensive probability-based samples in favor of less expensive nonprobability samples. The empirical literature suggests this strategy may be suboptimal for multiple reasons, among them that probability samples tend to outperform nonprobability samples on accuracy when assessed against population benchmarks. However, nonprobability samples are often preferred due to convenience and costs… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 48 publications
(34 citation statements)
references
References 20 publications
0
26
0
1
Order By: Relevance
“…More generally, there is an extensive literature on approaches for making inferences from data collected from nonprobability samples 50 52 . Other promising approaches include integrating surveys of varying quality 53 , 54 , and leveraging the estimated ddc in one outcome to correct bias in others under several scenarios ( Supplementary Information D ).…”
Section: Discussionmentioning
confidence: 99%
“…More generally, there is an extensive literature on approaches for making inferences from data collected from nonprobability samples 50 52 . Other promising approaches include integrating surveys of varying quality 53 , 54 , and leveraging the estimated ddc in one outcome to correct bias in others under several scenarios ( Supplementary Information D ).…”
Section: Discussionmentioning
confidence: 99%
“…To address the limitations of non-probability sampling, we applied innovative strategies for sensitivity analyses to strengthen conclusions. 16 17 Additionally, we adjusted for non-response in both samples by using sample weights based on several sociodemographic characteristics (ie, sex, race/ethnicity, age and educational attainment), a standard procedure for addressing non-response in surveys.…”
Section: Discussionmentioning
confidence: 99%
“…In a sensitivity analysis, we conducted linear regression modelling using Bayesian data integration with responses from the RDD and online samples. 16 17 We retained the five-level response options for each of the three variables measuring perceptions about COVID-19 for these analyses. The Bayesian framework is well suited for integrating multiple data sources of varying quality, such as probability and non-probability samples.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…For our 𝑆 1 data, 𝜌 ̂𝑟𝑦 =0.006, which means that it has a large selection bias or defect (Meng, 2018) especially if the sample size is large; in our case, it is just about 1500. Survey organizations are trying to move away from probability sampling to reduce high cost (Sakshaug et al 2019 andWisniowski et al 2020). Instead, they use nonprobability sample (for example, web samples) which is less costly and easily available, but possibly brings in biases into the sample.…”
Section: Introductionmentioning
confidence: 99%