Multi-trait GWAS for diverse ancestries: Mapping the knowledge gap
Lucie Troubat,
Deniz Fettahoglu,
Léo Henches
et al.
Abstract:Background: Approximately 95% of samples analyzed in univariate genome-wide association studies (GWAS) are of European ancestry. This bias toward European ancestry populations in association screening also exists for other analyses and methods that are often developed and tested on European ancestry only. However, existing data in non-European populations, which are often of modest sample size, could benefit from innovative approaches as recently illustrated in the context of polygenic risk scores. Methods: He… Show more
“…Figure 3 presents the correlation between each feature and the number of univariate and multi-trait GWAS associated loci. As expected, the number of univariate associated loci was positively correlated with the number of traits, mean Neff, the mean h can lead to a lack of robustness of the omnibus test 10 . We checked how the condition number 𝜅 * was related to JASS gain (Fig.…”
Section: Multi-trait Versus Univariate Gwas Across 19266 Random Setssupporting
confidence: 71%
“…Overall, this suggests that the multi-trait test can be highly complementary to the univariate test, performing better in situations where the univariate tests display low power. We noted in a recent study 10 that a high multicollinearity of the matrix underlying the null hypothesis (Σ r ) can lead to a lack of robustness of the omnibus test 10 . We checked how the condition number Σ r was related to JASS gain ( Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Selecting clinically homogenous traits is the strategy most commonly used 4,8, 10,14–19 . In our previous large-scale analysis 8 , an heterogeneous set yielded the largest number of new association as compared to clinically homogenous sets.…”
Section: Discussionmentioning
confidence: 99%
“…This focus should not lead the reader to think that multi-trait GWAS is useful only on large sample studies of European ancestry. We actually recently updated the JASS pipeline to run a Multi-ancestry Multi-trait GWAS, which was able to detect 367 new association loci, despite the modest sample size of the non-European cohorts used 10 . Future work might leverage non-European existing 28 and upcoming biobanks 29 to investigate the validity of our results for non-European ancestries.…”
Since the first Genome-Wide Association Studies (GWAS), thousands of variant-trait associations have been discovered. However, the sample size required to detect additional variants using standard univariate association screening is increasingly prohibitive. Multi-trait GWAS offers a relevant alternative: it can improve statistical power and lead to new insights about gene function and the joint genetic architecture of human phenotypes. Although many methodological hurdles of multi-trait testing have been discussed, the strategy to select trait, among overwhelming possibilities, has been overlooked. In this study, we conducted extensive multi-trait tests using JASS (Joint Analysis of Summary Statistics) and assessed which genetic features of the analysed sets were associated with an increased detection of variants as compared to univariate screening. Our analyses identified multiple factors associated with the gain in the association detection in multi-trait tests. Together, these factors of the analysed sets are predictive of the gain of the multi-trait test (Pearson's rho equal to 0.43 between the observed and predicted gain, P < 1.6 x 10-60). Applying an alternative multi-trait approach (MTAG, multi-trait analysis of GWAS), we found that in most scenarios but particularly those with larger numbers of traits, JASS outperformed MTAG. Finally, we benchmark several strategies to select set of traits including the prevalent strategy of selecting clinically similar traits, which systematically underperformed selecting clinically heterogenous traits or selecting sets that issued from our data-driven models. This work provides a unique picture of the determinant of multi-trait GWAS statistical power and outline practical strategies for multi-trait testing.
“…Figure 3 presents the correlation between each feature and the number of univariate and multi-trait GWAS associated loci. As expected, the number of univariate associated loci was positively correlated with the number of traits, mean Neff, the mean h can lead to a lack of robustness of the omnibus test 10 . We checked how the condition number 𝜅 * was related to JASS gain (Fig.…”
Section: Multi-trait Versus Univariate Gwas Across 19266 Random Setssupporting
confidence: 71%
“…Overall, this suggests that the multi-trait test can be highly complementary to the univariate test, performing better in situations where the univariate tests display low power. We noted in a recent study 10 that a high multicollinearity of the matrix underlying the null hypothesis (Σ r ) can lead to a lack of robustness of the omnibus test 10 . We checked how the condition number Σ r was related to JASS gain ( Fig.…”
Section: Resultsmentioning
confidence: 99%
“…Selecting clinically homogenous traits is the strategy most commonly used 4,8, 10,14–19 . In our previous large-scale analysis 8 , an heterogeneous set yielded the largest number of new association as compared to clinically homogenous sets.…”
Section: Discussionmentioning
confidence: 99%
“…This focus should not lead the reader to think that multi-trait GWAS is useful only on large sample studies of European ancestry. We actually recently updated the JASS pipeline to run a Multi-ancestry Multi-trait GWAS, which was able to detect 367 new association loci, despite the modest sample size of the non-European cohorts used 10 . Future work might leverage non-European existing 28 and upcoming biobanks 29 to investigate the validity of our results for non-European ancestries.…”
Since the first Genome-Wide Association Studies (GWAS), thousands of variant-trait associations have been discovered. However, the sample size required to detect additional variants using standard univariate association screening is increasingly prohibitive. Multi-trait GWAS offers a relevant alternative: it can improve statistical power and lead to new insights about gene function and the joint genetic architecture of human phenotypes. Although many methodological hurdles of multi-trait testing have been discussed, the strategy to select trait, among overwhelming possibilities, has been overlooked. In this study, we conducted extensive multi-trait tests using JASS (Joint Analysis of Summary Statistics) and assessed which genetic features of the analysed sets were associated with an increased detection of variants as compared to univariate screening. Our analyses identified multiple factors associated with the gain in the association detection in multi-trait tests. Together, these factors of the analysed sets are predictive of the gain of the multi-trait test (Pearson's rho equal to 0.43 between the observed and predicted gain, P < 1.6 x 10-60). Applying an alternative multi-trait approach (MTAG, multi-trait analysis of GWAS), we found that in most scenarios but particularly those with larger numbers of traits, JASS outperformed MTAG. Finally, we benchmark several strategies to select set of traits including the prevalent strategy of selecting clinically similar traits, which systematically underperformed selecting clinically heterogenous traits or selecting sets that issued from our data-driven models. This work provides a unique picture of the determinant of multi-trait GWAS statistical power and outline practical strategies for multi-trait testing.
“…One of the pitfalls of GWAS is that the result is easily confounded by different ancestries of selected samples, case-control imbalance, various sequencing methods, and linkage disequilibrium, leading to poor stability and transferability across different study groups [37][38][39]. However, with the progression of GWAS analysis tools, these confounding factors could be minimized to a lower level.…”
Section: Genome-wide Association Study (Gwas) Of Nscl/p Worldwidementioning
Non-syndromic cleft lip with or without palate (NSCL/P) is a prevalent birth defect that affects 1/500–1/1400 live births globally. The genetic basis of NSCL/P is intricate and involves both genetic and environmental factors. In the past few years, various genetic inheritance models have been proposed to elucidate the underlying mechanisms of NSCL/P. These models range from simple monogenic inheritance to more complex polygenic inheritance. Here, we present a comprehensive overview of the genetic inheritance model of NSCL/P exemplified by representative genes and regions from both monogenic and polygenic perspectives. We also summarize existing association studies and corresponding loci of NSCL/P within the Chinese population and highlight the potential of utilizing polygenic risk scores for risk stratification of NSCL/P. The potential application of polygenic models offers promising avenues for improved risk assessment and personalized approaches in the prevention and management of NSCL/P individuals.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.