Xiaoyi Gao scite author profile

Chronic kidney disease (CKD) is a significant public health problem, and recent genetic studies have identified common CKD susceptibility variants. The CKDGen consortium performed a meta-analysis of genome-wide association data in 67,093 Caucasian individuals from 20 population-based studies to identify new susceptibility loci for reduced renal function, estimated by serum creatinine (eGFRcrea), cystatin C (eGFRcys), and CKD (eGFRcrea <60 ml/min/1.73m2; n = 5,807 CKD cases). Follow-up of the 23 genome-wide significant loci (p<5×10−8) in 22,982 replication samples identified 13 novel loci for renal function and CKD (in or near LASS2, GCKR, ALMS1, TFDP2, DAB2, SLC34A1, VEGFA, PRKAG2, PIP5K1B, ATXN2, DACH1, UBE2Q2, and SLC7A9) and 7 creatinine production and secretion loci (CPS1, SLC22A2, TMEM60, WDR37, SLC6A13, WDR72, BCAS3). These results further our understanding of biologic mechanisms of kidney function by identifying loci potentially influencing nephrogenesis, podocyte function, angiogenesis, solute transport, and metabolic functions of the kidney.

show abstract

A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms

Gao

Starmer

Martin

2008

Genetic Epidemiology

658

710

View full text Add to dashboard Cite

Multiple testing is a challenging issue in genetic association studies using large numbers of single nucleotide polymorphism (SNP) markers, many of which exhibit linkage disequilibrium (LD). Failure to adjust for multiple testing appropriately may produce excessive false positives or overlook true positive signals. The Bonferroni method of adjusting for multiple comparisons is easy to compute, but is well known to be conservative in the presence of LD. On the other hand, permutation-based corrections can correctly account for LD among SNPs, but are computationally intensive. In this work, we propose a new multiple testing correction method for association studies using SNP markers. We show that it is simple, fast and more accurate than the recently developed methods and is comparable to permutation-based corrections using both simulated and real data. We also demonstrate how it might be used in whole-genome association studies to control type I error. The efficiency and accuracy of the proposed method make it an attractive choice for multiple testing adjustment when there is high intermarker LD in the SNP data set. Genet. Epidemiol. 32:361-369, 2008.r 2008 Wiley-Liss, Inc.Key words: single nucleotide polymorphism; composite linkage disequilibrium; multiple testing correction; principal component analysis; eigenvalues INTRODUCTIONMultiple testing is a challenging issue for genetic data analysis. Candidate gene and genome-wide association studies involve statistical testing of not just a single hypothesis, but many. Even when the point-wise error rate (PWER, a p ) is set to a low level, the experiment-wise error rate (EWER, a e ) increases with the number of tests carried out. For this reason, strict significance thresholds have been recommended to control EWER [Risch and Merikangas, 1996]. However, an overly conservative approach may result in overlooking true positive signals, while an overly liberal criterion could produce excessive false positives. Šidák and Bonferroni corrections are popular approaches for controlling a e by specifying what a p values should be used for each individual test. The Šidák correction is calculated as a p ¼ 1 À ð1 À a e Þ 1=N , where N is the number of individual hypotheses to be tested [Šidák, 1967]. This correction assumes that the hypothesis tests are independent. Noting that ð1 À a p Þ N % 1 À Na p for small a p , we obtain the Bonferroni correction as Bonferroni, 1935Bonferroni, , 1936, which is an approximation to the Šidák correction.Recently, single nucleotide polymorphisms (SNPs), which are often densely genotyped, have become popular markers for genetic association studies. The closely spaced SNPs frequently yield high correlation because of extensive linkage disequilibrium (LD) among them [Wall and Pritchard, 2003]. Therefore, when association studies are conducted with many SNPs, the tests performed on each SNP are usually not independent, depending on the correlation structure among the SNPs. This violation of the independence assumption limits the Šidák and Bonferron...

show abstract

Meta-analysis identifies multiple loci associated with kidney function–related traits in east Asian populations

Okada¹,

Sim²,

Go³

et al. 2012

Nat Genet

271

280

View full text Add to dashboard Cite

Chronic kidney disease (CKD), impairment of kidney function, is a serious public health problem, and the assessment of genetic factors influencing kidney function has substantial clinical relevance. Here, we report a meta-analysis of genome-wide association studies for kidney function–related traits, including 71,149 east Asian individuals from 18 studies in 11 population-, hospital- or family-based cohorts, conducted as part of the Asian Genetic Epidemiology Network (AGEN). Our meta-analysis identified 17 loci newly associated with kidney function–related traits, including the concentrations of blood urea nitrogen, uric acid and serum creatinine and estimated glomerular filtration rate based on serum creatinine levels (eGFRcrea) (P < 5.0 × 10−8). We further examined these loci with in silico replication in individuals of European ancestry from the KidneyGen, CKDGen and GUGC consortia, including a combined total of ~110,347 individuals. We identify pleiotropic associations among these loci with kidney function–related traits and risk of CKD. These findings provide new insights into the genetics of kidney function.

show abstract

Avoiding the high Bonferroni penalty in genome‐wide association studies

Gao¹,

Becker

et al. 2009

Genetic Epidemiology

283

263

View full text Add to dashboard Cite

A major challenge in genome-wide association studies (GWASs) is to derive the multiple testing threshold when hypothesis tests are conducted using a large number of single nucleotide polymorphisms. Permutation tests are considered the gold standard in multiple testing adjustment in genetic association studies. However, it is computationally intensive, especially for GWASs, and can be impractical if a large number of random shuffles are used to ensure accuracy. Many researchers have developed approximation algorithms to relieve the computing burden imposed by permutation. One particularly attractive alternative to permutation is to calculate the effective number of independent tests, Meff, which has been shown to be promising in genetic association studies. In this study, we compare recently developed Meff methods and validate them by the permutation test with 10,000 random shuffles using two real GWAS data sets: an Illumina 1M BeadChip and an Affymetrix GeneChip® Human Mapping 500K Array Set. Our results show that the simpleM method produces the best approximation of the permutation threshold, and it does so in the shortest amount of time. We also show that Meff is indeed valid on a genome-wide scale in these data sets based on statistical theory and significance tests. The significance thresholds derived can provide practical guidelines for other studies using similar population samples and genotyping platforms.

show abstract

NRXN3 Is a Novel Locus for Waist Circumference: A Genome-Wide Association Study from the CHARGE Consortium

et al. 2009

View full text Add to dashboard Cite

Central abdominal fat is a strong risk factor for diabetes and cardiovascular disease. To identify common variants influencing central abdominal fat, we conducted a two-stage genome-wide association analysis for waist circumference (WC). In total, three loci reached genome-wide significance. In stage 1, 31,373 individuals of Caucasian descent from eight cohort studies confirmed the role of FTO and MC4R and identified one novel locus associated with WC in the neurexin 3 gene [NRXN3 (rs10146997, p = 6.4×10−7)]. The association with NRXN3 was confirmed in stage 2 by combining stage 1 results with those from 38,641 participants in the GIANT consortium (p = 0.009 in GIANT only, p = 5.3×10−8 for combined analysis, n = 70,014). Mean WC increase per copy of the G allele was 0.0498 z-score units (0.65 cm). This SNP was also associated with body mass index (BMI) [p = 7.4×10−6, 0.024 z-score units (0.10 kg/m2) per copy of the G allele] and the risk of obesity (odds ratio 1.13, 95% CI 1.07–1.19; p = 3.2×10−5 per copy of the G allele). The NRXN3 gene has been previously implicated in addiction and reward behavior, lending further evidence that common forms of obesity may be a central nervous system-mediated disorder. Our findings establish that common variants in NRXN3 are associated with WC, BMI, and obesity.

show abstract

Genome-Wide Association and Functional Follow-Up Reveals New Loci for Kidney Function

et al. 2012

View full text Add to dashboard Cite

Chronic kidney disease (CKD) is an important public health problem with a genetic component. We performed genome-wide association studies in up to 130,600 European ancestry participants overall, and stratified for key CKD risk factors. We uncovered 6 new loci in association with estimated glomerular filtration rate (eGFR), the primary clinical measure of CKD, in or near MPPED2, DDX1, SLC47A1, CDK12, CASP9, and INO80. Morpholino knockdown of mpped2 and casp9 in zebrafish embryos revealed podocyte and tubular abnormalities with altered dextran clearance, suggesting a role for these genes in renal function. By providing new insights into genes that regulate renal function, these results could further our understanding of the pathogenesis of CKD.

show abstract

Multiple testing corrections for imputed SNPs

Gao

2011

Genet. Epidemiol.

103

View full text Add to dashboard Cite

Multiple testing corrections are an active research topic in genetic association studies, especially for genome-wide association studies (GWAS), where tests of association with traits are conducted at millions of imputed SNPs with estimated allelic dosages now. Failure to address multiple comparisons appropriately can introduce excess false positive results and make subsequent studies following up those results inefficient. Permutation tests are considered the gold standard in multiple testing adjustment; however, this procedure is computationally demanding, especially for GWAS. Notably, the permutation thresholds for the huge number of estimated allelic dosages in real data sets have not been reported. Although many researchers have recently developed algorithms to rapidly approximate the permutation thresholds with accuracy similar to the permutation test, these methods have not been verified with estimated allelic dosages. In this study, we compare recently published multiple testing correction methods using 2.5M estimated allelic dosages. We also derive permutation significance levels based on 10,000 GWAS results under the null hypothesis of no association. Our results show that the simpleM method works well with estimated allelic dosages and gives the closest approximation to the permutation threshold while requiring the least computation time.

show abstract

AWclust: point-and-click software for non-parametric population structure analysis

Gao

Starmer

2008

BMC Bioinformatics

View full text Add to dashboard Cite

Background: Population structure analysis is important to genetic association studies and evolutionary investigations. Parametric approaches, e.g. STRUCTURE and L-POP, usually assume Hardy-Weinberg equilibrium (HWE) and linkage equilibrium among loci in sample population individuals. However, the assumptions may not hold and allele frequency estimation may not be accurate in some data sets. The improved version of STRUCTURE (version 2.1) can incorporate linkage information among loci but is still sensitive to high background linkage disequilibrium. Nowadays, large-scale single nucleotide polymorphisms (SNPs) are becoming popular in genetic studies. Therefore, it is imperative to have software that makes full use of these genetic data to generate inference even when model assumptions do not hold or allele frequency estimation suffers from high variation.

show abstract

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xiaoyi Gao

New loci associated with kidney function and chronic kidney disease

A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms

Meta-analysis identifies multiple loci associated with kidney function–related traits in east Asian populations

Avoiding the high Bonferroni penalty in genome‐wide association studies

NRXN3 Is a Novel Locus for Waist Circumference: A Genome-Wide Association Study from the CHARGE Consortium

Genome-Wide Association and Functional Follow-Up Reveals New Loci for Kidney Function

Multiple testing corrections for imputed SNPs

AWclust: point-and-click software for non-parametric population structure analysis

Contact Info

Product

Resources

About