Matthias Neumair scite author profile

Contact tracing is one of several strategies employed in many countries to curb the spread of SARS-CoV-2. Digital contact tracing (DCT) uses tools such as cell-phone applications to improve tracing speed and reach. We model the impact of DCT on the spread of the virus for a large epidemiological parameter space consistent with current literature on SARS-CoV-2. We also model DCT in combination with random testing (RT) and social distancing (SD). Modelling is done with two independently developed individual-based (stochastic) models that use the Monte Carlo technique, benchmarked against each other and against two types of deterministic models. For current best estimates of the number of asymptomatic SARS-CoV-2 carriers (approximately 40\%), their contagiousness (similar to that of symptomatic carriers), the reproductive number before interventions (R0 at least 3) we find that DCT must be combined with other interventions such as SD and/or RT to push the reproductive number below one. At least 60\% of the population would have to use the DCT system for its effect to become significant. On its own, DCT cannot bring the reproductive number below 1 unless nearly the entire population uses the DCT system and follows quarantining and testing protocols strictly. For lower uptake of the DCT system, DCT still reduces the number of people that become infected. When DCT is deployed in a population with an ongoing outbreak where O(0.1\%) of the population have already been infected, the gains of the DCT intervention come at the cost of requiring up to 15% of the population to be quarantined (in response to being traced) on average each day for the duration of the epidemic, even when there is sufficient testing capability to test every traced person.

show abstract

The impact of digital contact tracing on the SARS-CoV-2 pandemic—a comprehensive modelling study

Pollmann

Schönert

Müller

et al. 2021

EPJ Data Sci.

View full text Add to dashboard Cite

Contact tracing is one of several strategies employed in many countries to curb the spread of SARS-CoV-2. Digital contact tracing (DCT) uses tools such as cell-phone applications to improve tracing speed and reach. We model the impact of DCT on the spread of the virus for a large epidemiological parameter space consistent with current literature on SARS-CoV-2. We also model DCT in combination with random testing (RT) and social distancing (SD).Modelling is done with two independently developed individual-based (stochastic) models that use the Monte Carlo technique, benchmarked against each other and against two types of deterministic models.For current best estimates of the number of asymptomatic SARS-CoV-2 carriers (approximately 40%), their contagiousness (similar to that of symptomatic carriers), the reproductive number before interventions (${R_{0}}$ R 0 at least 3) we find that DCT must be combined with other interventions such as SD and/or RT to push the reproductive number below one. At least 60% of the population would have to use the DCT system for its effect to become significant. On its own, DCT cannot bring the reproductive number below 1 unless nearly the entire population uses the DCT system and follows quarantining and testing protocols strictly. For lower uptake of the DCT system, DCT still reduces the number of people that become infected.When DCT is deployed in a population with an ongoing outbreak where $\mathcal{O}$ O (0.1%) of the population have already been infected, the gains of the DCT intervention come at the cost of requiring up to 15% of the population to be quarantined (in response to being traced) on average each day for the duration of the epidemic, even when there is sufficient testing capability to test every traced person.

show abstract

Accommodating heterogeneous missing data patterns for prostate cancer risk prediction

Neumair

Kattan

Freedland

et al. 2022

BMC Med Res Methodol

View full text Add to dashboard Cite

Background We compared six commonly used logistic regression methods for accommodating missing risk factor data from multiple heterogeneous cohorts, in which some cohorts do not collect some risk factors at all, and developed an online risk prediction tool that accommodates missing risk factors from the end-user. Methods Ten North American and European cohorts from the Prostate Biopsy Collaborative Group (PBCG) were used for fitting a risk prediction tool for clinically significant prostate cancer, defined as Gleason grade group ≥ 2 on standard TRUS prostate biopsy. One large European PBCG cohort was withheld for external validation, where calibration-in-the-large (CIL), calibration curves, and area-underneath-the-receiver-operating characteristic curve (AUC) were evaluated. Ten-fold leave-one-cohort-internal validation further validated the optimal missing data approach. Results Among 12,703 biopsies from 10 training cohorts, 3,597 (28%) had clinically significant prostate cancer, compared to 1,757 of 5,540 (32%) in the external validation cohort. In external validation, the available cases method that pooled individual patient data containing all risk factors input by an end-user had best CIL, under-predicting risks as percentages by 2.9% on average, and obtained an AUC of 75.7%. Imputation had the worst CIL (-13.3%). The available cases method was further validated as optimal in internal cross-validation and thus used for development of an online risk tool. For end-users of the risk tool, two risk factors were mandatory: serum prostate-specific antigen (PSA) and age, and ten were optional: digital rectal exam, prostate volume, prior negative biopsy, 5-alpha-reductase-inhibitor use, prior PSA screen, African ancestry, Hispanic ethnicity, first-degree prostate-, breast-, and second-degree prostate-cancer family history. Conclusion Developers of clinical risk prediction tools should optimize use of available data and sources even in the presence of high amounts of missing data and offer options for users with missing risk factors.

show abstract

Active Data Science for Improving Clinical Risk Prediction

Ankerst¹,

Neumair²

2022

View full text Add to dashboard Cite

Clinical risk prediction models are commonly developed in a post-hoc and passive fashion, capitalizing on convenient data from completed clinical trials or retrospective cohorts. Impacts of the models often end at their publication rather than with the patients. The field of clinical risk prediction is rapidly improving in a progressively more transparent data science era. Based on collective experience over the past decade by the Prostate Biopsy Collaborative Group (PBCG), this paper proposes the following four data science-driven strategies for improving clinical risk prediction to the benefit of clinical practice and research. The first proposed strategy is to actively design prospective data collection, monitoring, analysis and validation of risk tools following the same standards as for clinical trials in order to elevate the quality of training data. The second suggestion is to make risk tools and model formulas available online. User-friendly risk tools will bring quantitative information to patients and their clinicians for improved knowledge-based decision-making. As past experience testifies, online tools expedite independent validation, providing helpful information as to whether the tools are generalizable to new populations. The third proposal is to dynamically update and localize risk tools to adapt to changing demographic and clinical landscapes. The fourth strategy is to accommodate systematic missing data patterns across cohorts in order to maximize the statistical power in model training, as well as to accommodate missing information on the end-user side too, in order to maximize utility for the public.

show abstract

Globally accessible end-user-friendly prostate cancer risk prediction tools based on contemporary cohorts with heterogeneous missing risk factors

Neumair

Kattan

Freedland

et al. 2022

Preprint

View full text Add to dashboard Cite

Background: Missing risk factors, whether random or not measured at all, across different hospitals and patients provide challenges, both for the developers of online clinical risk tools and the patients trying to use such tools. This paper provides a development and end-user solution to the commonly encountered limitations of clinical risk tool based decision making due to missing information.Methods: Six state-of-the-art logistic regression approaches accommodating missing data were compared using prostate cancer data from ten North American and European cohorts from the Prostate Biopsy Collaborative Group (PBCG). An additional large European PBCG cohort was withheld for external validation, where calibration-in-the-large (CIL), calibration curves, and area-underneath-the-receiver-operating characteristic curve (AUC) were evaluated. Ten-fold leave-one-cohort-internal validation further validated the optimal missing data approach.Results: Among 12,703 biopsies from 10 training cohorts, 3,597 (28%) had clinically significant prostate cancer, compared to 1,757 of 5,540 (32%) in the external validation cohort. In external validation, the available cases method that pooled individual patient data containing all risk factors input by an end-user had best CIL, under-predicting risks as percentages by 2.9% on average, and obtained an AUC of 75.7%. Imputation had the worst CIL (-13.3%). The available cases method was further validated as optimal in internal cross-validation and thus used for development of an online risk tool. For end-users of the risk tool, two risk factors were mandatory: serum prostate-specific antigen (PSA) and age, and ten were optional: digital rectal exam, prostate volume, prior negative biopsy, 5-alpha-reductase-inhibitor use, prior PSA screen, African ancestry, Hispanic ethnicity, first-degree prostate-, breast-, and second-degree prostate-cancer family history.Conclusion: Developers of clinical risk prediction tools should optimize use of available data and sources even in the presence of high amounts of missing data and offer options for users with missing risk factors.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.