2022
DOI: 10.1080/07357907.2022.2084621
|View full text |Cite
|
Sign up to set email alerts
|

The Big Data Paradox in Clinical Practice

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 25 publications
(24 citation statements)
references
References 43 publications
1
7
0
Order By: Relevance
“…If biobanks are not representative of the target population, they are vulnerable to selection bias, 3,[19][20][21][22] a naïve analysis is not expected to align with the population truth. [23][24][25] Handling selection bias is particularly challenging because the magnitude and direction of its impact on estimates are hard to determine, 26 its effect cannot be mitigated by increasing sample size, 27,28 and it can be coupled with other data imperfections. 3,[29][30][31][32] Moreover, contrary to previous arguments, 33,34 recent evidence suggests that even genetic association analyses with inherited germline susceptibility factors can also be prone to selection bias.…”
Section: Background and Significancementioning
confidence: 99%
“…If biobanks are not representative of the target population, they are vulnerable to selection bias, 3,[19][20][21][22] a naïve analysis is not expected to align with the population truth. [23][24][25] Handling selection bias is particularly challenging because the magnitude and direction of its impact on estimates are hard to determine, 26 its effect cannot be mitigated by increasing sample size, 27,28 and it can be coupled with other data imperfections. 3,[29][30][31][32] Moreover, contrary to previous arguments, 33,34 recent evidence suggests that even genetic association analyses with inherited germline susceptibility factors can also be prone to selection bias.…”
Section: Background and Significancementioning
confidence: 99%
“…Recently, I coined the term "data confession" [26] to encourage more disclosures in research publications about defects in data conceptualization, collection or pre-processing, as another component in enhancing the replicability and ultimately the reliability of published scientific studies, since data quality matters far more than data quantity [23,4,27,3]. The retrospective introspection summarized above suggests a more general data science confession (DSC) in our publications, where we can benefit from each others' mistakes and lessons learned, especially how we reason with ourselves, where we can engage in a pure intellectual dialogue without being distracted by suspicions of impure motivations.…”
Section: Taking a Lead In Data (Science) Confessionmentioning
confidence: 99%
“…As more variables are added, a causal graph may become too complex to interpret usefully. An important practical goal thus may be to reduce an overly complex causal graph to a more coarse-grained causal description that can be used to deal with the problem at hand [ 28 , 29 , 30 , 31 ]. For example, the causal diagram in Figure 2 C may be appropriate if a drug that modifies EMT is being studied.…”
Section: Causal Diagramsmentioning
confidence: 99%