Variable selection in Propensity Score Adjustment to mitigate selection bias in online surveys

Ferri‐García, Ramón; Rueda, María del Mar

doi:10.1007/s00362-022-01296-x

Cited by 7 publications

(6 citation statements)

References 56 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The inclusion of these response types could potentially be achieved through various types of hierarchical modelling to account for pseudo‐replication, propensity score weighting to calibrate for nonprobability (Ferri‐García & Rueda, 2022), or multilevel regressions with poststratification adjustments (Mercer et al., 2017). However, such techniques may only be viable with larger data sets, may require extensive knowledge of model assumptions, are imperfect in reducing bias and still risk inflating covariance‐based estimates even with small proportions of invalid respondents (Copas et al., 2020; Dever et al., 2008; Guo et al., 2020; King et al., 2018).…”

Section: Results From Suspicion Variable Analysismentioning

confidence: 99%

“…Although the elimination of possible careless responders would result in the loss of some valuable information from our study, we deemed it appropriate to eliminate all possible fraudulent responses due to the relatively small sample size and implications that their inclusion could have on correlational statistics. Various statistical modelling techniques can assist researchers wanting to include careless responders or other fraudulent responses (Copas et al, 2020;Dever et al, 2008;Ferri-García & Rueda, 2022;Mercer et al, 2017); however, a similar level of concern should be granted to analyses considering the use of statistical adjustments. Furthermore, researchers who use these techniques should still report the presence of fraud in their surveys and the extent to which the modelling adjustments may differ with the exclusion of that potential fraud.…”

Section: Removing Survey Responsesmentioning

confidence: 99%

See 1 more Smart Citation

Addressing fraudulent responses in online surveys: Insights from a web‐based participatory mapping study

Johnson,

Adams,

Byrne

2023

People and Nature

View full text Add to dashboard Cite

Web‐based studies of human dimensions are increasing across environmental and socio‐ecological disciplines. However, the prevalence of fraud threatens research quality. Increased fraud rates should be expected as surveys move progressively more online, motivated by expanding reach, cost savings and/or in response to COVID‐19. Web‐based research must better account for fraud to maintain confidence in findings. Practical diagnostic tools and data quality protocols are required to detect fraud and ensure results quality. Drawing on our experience using an online participatory mapping case study, we discuss methods to detect potentially fraudulent responses—and identify some limitations. We begin by reviewing the current state of knowledge on fraudulent responses or ‘fraudsters’ and its relative absence in environmental and socio‐ecological disciplines. We then describe our research approach, the indicators and variables we used to detect and assess fraud and our decision‐making process to eliminate suspicious responses without jeopardizing research integrity. We found that despite several preventative measures, many fraudulent respondents could provide survey responses and effectively mimicked legitimate respondents at first glance. By assuming each response to be ‘potentially fraudulent’, we determined that the complete screening of each respondent, while time‐consuming, can limit the prevalence of fraud. We also determined that the most common data consistency checks (e.g. duration, trap questions and straight‐liner checks) are unlikely to guarantee valid respondents. If not acknowledged and addressed, fraud has the potential to undermine data integrity, discredit research findings and limit the utility of results for policy. This study contributes to environmental and socio‐ecological research by reviewing existing fraudster literature and using our experience with fraud to provide recommendations for researchers to address this problem. We encourage researchers implementing online qualitative research methods to thoroughly assess and report fraud, when possible, to ensure widespread knowledge of this growing threat. Read the free Plain Language Summary for this article on the Journal blog.

show abstract

Section: Results From Suspicion Variable Analysismentioning

confidence: 99%

Section: Removing Survey Responsesmentioning

confidence: 99%

Addressing fraudulent responses in online surveys: Insights from a web‐based participatory mapping study

Johnson,

Adams,

Byrne

2023

People and Nature

View full text Add to dashboard Cite

show abstract

“…24 Groups were compared to assess significance of differences before and after propensity score based IPTW, using measure of associations with Cramer's Phi (V) for categorical variables and Rsquared for continuous variables. 25 Mann-Whitney tests were used for continuous variables, whereas Pearson χ 2 tests were used for categorical variables, including endpoints.…”

Section: Discussionmentioning

confidence: 99%

Ultrasound‐guided deployment of ProGlide™ device in transfemoral transcatheter aortic valve implantation and risk reduction of vascular complications: A propensity‐matched cohort study

Ross,

Nogic,

Cong

et al. 2024

Cathet Cardio Intervent

View full text Add to dashboard Cite

BackgroundProGlide is a percutaneous suture‐mediated closure device used in arterial and venous closure following percutaneous intervention. Risk of vascular complications from use, particularly related to failure in heemostasis, or acute vessel closure, remains significant and often related to improper suture deployment. We describe a technique of ultrasound‐guided ProGlide deployment in transfemoral transcatheter aortic valve implantation (TF‐TAVI).AimsThe aim of this study is to assess vascular outcomes for ultrasound‐guided deployment of ProGlide vascular closure devices in patients undergoing TF‐TAVI.MethodsWe collected relevant clinical data of patients undergoing TAVI in a large volume centre. Primary outcome: main access Valve Academic Research Consortium 3 (VARC‐3) major vascular complication. Secondary outcome: any major/minor VARC‐3 vascular complication, its type (bleed or ischemia), and treatment required (medical, percutaneous, or surgical). We performed inverse weighting propensity score analysis to compare the population undergoing ultrasound‐guided versus conventional ProGlide deployment for main TAVI access. Ultrasound technique for ProGlide insertion was performed as described below.ResultsFive hundred and seventeen patients undergoing TF‐TAVI were included.Primary outcome: In 126 (ultrasound‐guided) and 391 (conventional ProGlide insertion), 0% versus 1.8% (p < 0.001) had a major VARC‐3 vascular complication, respectively.Secondary outcome: 0.8% (one minor VARC‐3 bleed) vs 4.1% (13 bleeds and three occlusions) had any VARC‐3 vascular complication (major and minor) (p < 0.001). Surgical treatment of vascular complication was required in 0.8% versus 1.3% (p = NS).ConclusionsUltrasound‐guided deployment of ProGlide for vascular closure reduced the risk of major vascular complications in a large population undergoing TAVI.

show abstract

“…Therefore, although weighting within classes is a commonly used procedure for non-response cross-sectional and longitudinal weighting in panels, a more pragmatic alternative is to use a regression-based approach, all the more so when numerous auxiliary variables are available [18]. For this we are going to use the popular Propensity Score Adjustment (PSA) method [20,29,30] to model the probability that a unit k of the new theoretical sample s (j) responds to M j , where j = 1, ..., t , or that another unit k of the effective sample s (i) r responds to M j , where i = 1, ..., j − 1 , j = 2, ..., t , and i < j.…”

Section: Weight Adjustment Based On Propensitiesmentioning

confidence: 99%

“…• Learning rate ∈ [0.001, 0.9] : the weight shrinkage applied after each boosting step. • Maximum depth ∈ [1,30] : the maximum number of splits that each tree can contain. • Minimum child weight ∈ [0, 10] : the minimum total of instance weights needed to consider a new partition.…”

Section: Modelling Non-responsementioning

confidence: 99%

Calibration and XGBoost reweighting to reduce coverage and non-response biases in overlapping panel surveys: application to the Healthcare and Social Survey

Castro,

Rueda,

Sánchez-Cantalejo

et al. 2024

BMC Med Res Methodol

View full text Add to dashboard Cite

Background Surveys have been used worldwide to provide information on the COVID-19 pandemic impact so as to prepare and deliver an effective Public Health response. Overlapping panel surveys allow longitudinal estimates and more accurate cross-sectional estimates to be obtained thanks to the larger sample size. However, the problem of non-response is particularly aggravated in the case of panel surveys due to population fatigue with repeated surveys. Objective To develop a new reweighting method for overlapping panel surveys affected by non-response. Methods We chose the Healthcare and Social Survey which has an overlapping panel survey design with measurements throughout 2020 and 2021, and random samplings stratified by province and degree of urbanization. Each measurement comprises two samples: a longitudinal sample taken from previous measurements and a new sample taken at each measurement. Results Our reweighting methodological approach is the result of a two-step process: the original sampling design weights are corrected by modelling non-response with respect to the longitudinal sample obtained in a previous measurement using machine learning techniques, followed by calibration using the auxiliary information available at the population level. It is applied to the estimation of totals, proportions, ratios, and differences between measurements, and to gender gaps in the variable of self-perceived general health. Conclusion The proposed method produces suitable estimators for both cross-sectional and longitudinal samples. For addressing future health crises such as COVID-19, it is therefore necessary to reduce potential coverage and non-response biases in surveys by means of utilizing reweighting techniques as proposed in this study.

show abstract

Variable selection in Propensity Score Adjustment to mitigate selection bias in online surveys

Cited by 7 publications

References 56 publications

Addressing fraudulent responses in online surveys: Insights from a web‐based participatory mapping study

Addressing fraudulent responses in online surveys: Insights from a web‐based participatory mapping study

Ultrasound‐guided deployment of ProGlide™ device in transfemoral transcatheter aortic valve implantation and risk reduction of vascular complications: A propensity‐matched cohort study

Calibration and XGBoost reweighting to reduce coverage and non-response biases in overlapping panel surveys: application to the Healthcare and Social Survey

Contact Info

Product

Resources

About