The African continent is regarded as the cradle of modern humans and African genomes contain more genetic variation than those from any other continent, yet only a fraction of the genetic diversity among African individuals has been surveyed1. Here we performed whole-genome sequencing analyses of 426 individuals—comprising 50 ethnolinguistic groups, including previously unsampled populations—to explore the breadth of genomic diversity across Africa. We uncovered more than 3 million previously undescribed variants, most of which were found among individuals from newly sampled ethnolinguistic groups, as well as 62 previously unreported loci that are under strong selection, which were predominantly found in genes that are involved in viral immunity, DNA repair and metabolism. We observed complex patterns of ancestral admixture and putative-damaging and novel variation, both within and between populations, alongside evidence that Zambia was a likely intermediate site along the routes of expansion of Bantu-speaking populations. Pathogenic variants in genes that are currently characterized as medically relevant were uncommon—but in other genes, variants denoted as ‘likely pathogenic’ in the ClinVar database were commonly observed. Collectively, these findings refine our current understanding of continental migration, identify gene flow and the response to human disease as strong drivers of genome-level population variation, and underscore the scientific imperative for a broader characterization of the genomic diversity of African individuals to understand human ancestry and improve health.
A Correction to this paper has been published: https://doi.org/10.1038/s41586-021-03286-9.
Genome-wide association studies (GWAS) provide huge information on statistically significant single-nucleotide polymorphisms (SNPs) associated with various human complex traits and diseases. By performing GWAS studies, scientists have successfully identified the association of hundreds of thousands to millions of SNPs to a single phenotype. Moreover, the association of some SNPs with rare diseases has been intensively tested. However, classic GWAS studies have not yet provided solid, knowledgeable insight into functional and biological mechanisms underlying phenotypes or mechanisms of diseases. Therefore, several post-GWAS (pGWAS) methods have been recommended. Currently, there is no simple scientific document to provide a quick guide for performing pGWAS analysis. pGWAS is a crucial step for a better understanding of the biological machinery beyond the SNPs. Here, we provide an overview to performing pGWAS analysis and demonstrate the challenges behind each method. Furthermore, we direct readers to key articles for each pGWAS method and present the overall issues in pGWAS analysis. Finally, we include a custom pGWAS pipeline to guide new users when performing their research.
BackgroundSchizophrenia is a severe mental disorder affecting >21 million people worldwide. Some genetic studies reported that single nucleotide polymorphism (SNP) involving variant rs1344706 from the ZNF804A gene in human beings is associated with the risk of schizophrenia in several populations. Similar results tend to conflict with other reports in literature, indicating that no true significant association exists between rs1344706 and schizophrenia. We seek to determine the level of association of this SNP with schizophrenia in the Asian population using more recent genome-wide association study (GWAS) datasets.MethodsApplying a computational approach with inclusion of more recent GWAS datasets, we conducted a meta-analysis to examine the level of association of SNP rs1344706 and the risk of schizophrenia disorder among the Asian population constituting Chinese, Indonesians, Japanese, Kazakhs and Singaporeans. For a total of 21 genetic studies, including a total of 28,842 cases and 35,630 controls, regression analysis, publication bias, Cochran’s Q and I2 tests were performed. The DerSimonian and Laird random-effects model was used to assess the association of the genetic variant to schizophrenia. Leave-one-out sensitivity analysis was also conducted to determine the influence of each study on the final outcome of the association study.ResultsOur summarized analysis for Asian population revealed a pooled odds ratio of 1.06, 95% confidence interval of 1.01–1.11 and two-tailed P-value of 0.0228. Our test for heterogeneity showed the presence of large heterogeneity (I2=53.44%, P =0.00207) and Egger’s regression test (P =0.8763) and Begg’s test (P =0.8347), indicating no presence of publication bias among our selected studies. In our sensitivity analysis, 10 different studies comprising of ~50% of the entire study had an impact on our final results as each leave-one-out test became insignificant. Our result suggests that genetic variant rs1344706 might be associated with the development of schizophrenia in Asians.
Motivation Post-GWAS analysis is designed to decipher the functional consequences of significant Single Nucleotide Polymorphisms (SNP) in the era of Genome Wide Association Studies (GWAS). This can be translated into research insights and clinical benefits such as effectiveness of strategies for disease screening, treatment and prevention. However, the setup of post-GWAS (pGWAS) tools can be quite complicated and it mostly requires big data. The challenge however is, scientists are required to have sufficient experience with several of these technically complex and complicated tools in order to complete the pGWAS analysis. Results We present SysBiolPGWAS, a post-GWAS web application that provides a comprehensive functionality for biologists and non-bioinformaticians to conduct several post-GWAS analyses to overcome the above challenges. It provides unique functionalities for analysis involving multi-omics datasets and visualization using various bioinformatics tools. SysBiolPGWAS provides access to individual post-GWAS tools and a novel custom post-GWAS pipeline which integrates several individual post-GWAS tools and data. The SysBiolPGWAS app was developed to be a one-stop shop for post-GWAS analysis. It targets researchers in the area of the human genome and performs its analysis mainly in the autosomal chromosomes. Availability SysBiolPGWAS web app was developed using JavaScript/TypeScript web frameworks and is available at: https://spgwas.waslitbre.org/. Supplementary information All codes are available in this GitHub repository https://github.com/covenant-university-bioinformatics.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.