Latchezar Dimitrov scite author profile

Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10–20% (14–24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.

show abstract

Evidence for two independent prostate cancer risk–associated loci in the HNF1B gene at 17q12

Sun

et al. 2008

View full text Add to dashboard Cite

A fine mapping study in the HNF1B gene at 17q12 among two study populations revealed a second prostate cancer locus, ~26 kb centromeric to the first known locus (rs4430796); these are separated by a recombination hotspot. A SNP in the second locus (rs11649743) was confirmed in five additional populations, and P=1.7×10−9 for an allelic test in the seven combined studies. The association at each SNP remains significant after adjusting for the other SNP.

show abstract

A Combined Genomewide Linkage Scan of 1,233 Families for Prostate Cancer–Susceptibility Genes Conducted by the International Consortium for Prostate Cancer Genetics

Dimitrov

Chang

et al. 2005

The American Journal of Human Genetics

130

View full text Add to dashboard Cite

Evidence of the existence of major prostate cancer (PC)-susceptibility genes has been provided by multiple segregation analyses. Although genomewide screens have been performed in over a dozen independent studies, few chromosomal regions have been consistently identified as regions of interest. One of the major difficulties is genetic heterogeneity, possibly due to multiple, incompletely penetrant PC-susceptibility genes. In this study, we explored two approaches to overcome this difficulty, in an analysis of a large number of families with PC in the International Consortium for Prostate Cancer Genetics (ICPCG). One approach was to combine linkage data from a total of 1,233 families to increase the statistical power for detecting linkage. Using parametric (dominant and recessive) and nonparametric analyses, we identified five regions with "suggestive" linkage (LOD score >1.86): 5q12, 8p21, 15q11, 17q21, and 22q12. The second approach was to focus on subsets of families that are more likely to segregate highly penetrant mutations, including families with large numbers of affected individuals or early age at diagnosis. Stronger evidence of linkage in several regions was identified, including a "significant" linkage at 22q12, with a LOD score of 3.57, and five suggestive linkages (1q25, 8q13, 13q14, 16p13, and 17q21) in 269 families with at least five affected members. In addition, four additional suggestive linkages (3p24, 5q35, 11q22, and Xq12) were found in 606 families with mean age at diagnosis of < or = 65 years. Although it is difficult to determine the true statistical significance of these findings, a conservative interpretation of these results would be that if major PC-susceptibility genes do exist, they are most likely located in the regions generating suggestive or significant linkage signals in this large study.

show abstract

A support vector machine approach for detecting gene‐gene interaction

Chen¹,

Sun²,

Dimitrov³

et al. 2007

Genetic Epidemiology

View full text Add to dashboard Cite

Although genetic factors play an important role in most human diseases, multiple genes or genes and environmental factors may influence individual risk. In order to understand the underlying biological mechanisms of complex diseases, it is important to understand the complex relationships that control the process. In this paper, we consider different perspectives, from each optimization, complexity analysis, and algorithmic design, which allows us to describe a reasonable and applicable computational framework for detecting gene-gene interactions. Accordingly, support vector machine and combinatorial optimization techniques (local search and genetic algorithm) were tailored to fit within this framework. Although the proposed approach is computationally expensive, our results indicate this is a promising tool for the identification and characterization of high order gene-gene and gene-environment interactions. We have demonstrated several advantages of this method, including the strong power for classification, less concern for overfitting, and the ability to handle unbalanced data and achieve more stable models. We would like to make the support vector machine and combinatorial optimization techniques more accessible to genetic epidemiologists, and to promote the use and extension of these powerful approaches.

show abstract

Discovery and fine-mapping of adiposity loci using high density imputation of genome-wide association studies in individuals of African ancestry: African Ancestry Anthropometry Genetics Consortium

Ng¹,

Graff²,

Lu³

et al. 2017

PLoS Genet

102

View full text Add to dashboard Cite

Genome-wide association studies (GWAS) have identified >300 loci associated with measures of adiposity including body mass index (BMI) and waist-to-hip ratio (adjusted for BMI, WHRadjBMI), but few have been identified through screening of the African ancestry genomes. We performed large scale meta-analyses and replications in up to 52,895 individuals for BMI and up to 23,095 individuals for WHRadjBMI from the African Ancestry Anthropometry Genetics Consortium (AAAGC) using 1000 Genomes phase 1 imputed GWAS to improve coverage of both common and low frequency variants in the low linkage disequilibrium African ancestry genomes. In the sex-combined analyses, we identified one novel locus (TCF7L2/HABP2) for WHRadjBMI and eight previously established loci at P < 5×10−8: seven for BMI, and one for WHRadjBMI in African ancestry individuals. An additional novel locus (SPRYD7/DLEU2) was identified for WHRadjBMI when combined with European GWAS. In the sex-stratified analyses, we identified three novel loci for BMI (INTS10/LPL and MLC1 in men, IRX4/IRX2 in women) and four for WHRadjBMI (SSX2IP, CASC8, PDE3B and ZDHHC1/HSD11B2 in women) in individuals of African ancestry or both African and European ancestry. For four of the novel variants, the minor allele frequency was low (<5%). In the trans-ethnic fine mapping of 47 BMI loci and 27 WHRadjBMI loci that were locus-wide significant (P < 0.05 adjusted for effective number of variants per locus) from the African ancestry sex-combined and sex-stratified analyses, 26 BMI loci and 17 WHRadjBMI loci contained ≤ 20 variants in the credible sets that jointly account for 99% posterior probability of driving the associations. The lead variants in 13 of these loci had a high probability of being causal. As compared to our previous HapMap imputed GWAS for BMI and WHRadjBMI including up to 71,412 and 27,350 African ancestry individuals, respectively, our results suggest that 1000 Genomes imputation showed modest improvement in identifying GWAS loci including low frequency variants. Trans-ethnic meta-analyses further improved fine mapping of putative causal variants in loci shared between the African and European ancestry populations.

show abstract

The Interaction of Four Genes in the Inflammation Pathway Significantly Predicts Prostate Cancer Risk

Lowey

Wiklund

et al. 2005

View full text Add to dashboard Cite

It is widely hypothesized that the interactions of multiple genes influence individual risk to prostate cancer. However, current efforts at identifying prostate cancer risk genes primarily rely on single-gene approaches. In an attempt to fill this gap, we carried out a study to explore the joint effect of multiple genes in the inflammation pathway on prostate cancer risk. We studied 20 genes in the Toll-like receptor signaling pathway as well as several cytokines. For each of these genes, we selected and genotyped haplotype-tagging single nucleotide polymorphisms (SNP) among 1,383 cases and 780 controls from the CAPS (CAncer Prostate in Sweden) study population. A total of 57 SNPs were included in the final analysis. A data mining method, multifactor dimensionality reduction, was used to explore the interaction effects of SNPs on prostate cancer risk. Interaction effects were assessed for all possible n SNP combinations, where n = 2, 3, or 4. For each n SNP combination, the model providing lowest prediction error among 100 cross-validations was chosen. The statistical significance levels of the best models in each n SNP combination were determined using permutation tests. A four-SNP interaction (one SNP each from IL-10, IL-1RN, TIRAP, and TLR5) had the lowest prediction error (43.28%, P = 0.019). Our ability to analyze a large number of SNPs in a large sample size is one of the first efforts in exploring the effect of high-order gene-gene interactions on prostate cancer risk, and this is an important contribution to this new and quickly evolving field. (Cancer Epidemiol Biomarkers Prev 2005;14(11):2563 -8)

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.