2022
DOI: 10.3389/fgene.2022.899523
|View full text |Cite
|
Sign up to set email alerts
|

A Principal Component Informed Approach to Address Polygenic Risk Score Transferability Across European Cohorts

Abstract: One important confounder in genome-wide association studies (GWASs) is population genetic structure, which may generate spurious associations if not properly accounted for. This may ultimately result in a biased polygenic risk score (PRS) prediction, especially when applied to another population. To explore this matter, we focused on principal component analysis (PCA) and asked whether a population genetics informed strategy focused on PCs derived from an external reference population helps in mitigating this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 45 publications
(68 reference statements)
1
2
0
Order By: Relevance
“…In other words: although we took measures to control for confounders by including multiple geographic and socio-economic variables into our models, only by enriching for biologically-relevant regions, and by controlling for genome-wide covA stratification, we could expose robust and consistent signals. This finding mirrors the known difficulty in discerning functional gene-trait associations from spurious correlations mediated by genetic structure 1921,25,27,28 , which are indeed independent between different populations and different biobanks with different recruitment strategies. In addition, while combining GWAS-derived effect sizes at genome scale is sensitive to subtle correction-surviving biases 26 , our approach does not rely on such methodology.…”
Section: Discussionsupporting
confidence: 65%
See 1 more Smart Citation
“…In other words: although we took measures to control for confounders by including multiple geographic and socio-economic variables into our models, only by enriching for biologically-relevant regions, and by controlling for genome-wide covA stratification, we could expose robust and consistent signals. This finding mirrors the known difficulty in discerning functional gene-trait associations from spurious correlations mediated by genetic structure 1921,25,27,28 , which are indeed independent between different populations and different biobanks with different recruitment strategies. In addition, while combining GWAS-derived effect sizes at genome scale is sensitive to subtle correction-surviving biases 26 , our approach does not rely on such methodology.…”
Section: Discussionsupporting
confidence: 65%
“…A plethora of methods [11][12][13][14] have been developed to correct for these unwanted sources of variance that might bias GWAS discovery. Indeed even the finer cases of population structure present in national Biobanks [15][16][17] have been demonstrated to affect GWAS [18][19][20][21][22][23] and, if not carefully addressed, hamper analyses following up on these results, such as Polygenic Risk Scoring 19,21,22,[24][25][26] and polygenic selection testing 27,28 .…”
Section: Introductionmentioning
confidence: 99%
“…The current model is designed for single ancestry analysis. While a large portion of GWAS data currently comes from individuals of European ancestry, it is well-known that polygenic risk scores do not transfer well between individuals of different ancestries, which can impact their utility for patients of non-European ancestry [64][65][66][67][68][69] . Many methods that utilize summary data from multiple populations have already been proposed and demonstrate improved prediction in under-represented populations 25,29,30,[70][71][72][73][74] .…”
Section: Discussionmentioning
confidence: 99%