Increasing the Power to Detect Causal Associations by Combining Genotypic and Expression Data in Segregating Populations

Zhu, Jun; Wiener, Matthew C.; Zhang, Chunsheng; Fridman, Arthur; Minch, Eric; Lum, Pek Yee; Sachs, Jeffrey R.; Schadt, Eric E.

doi:10.1371/journal.pcbi.0030069

Cited by 197 publications

(211 citation statements)

References 39 publications

Supporting

Mentioning

200

Contrasting

Unclassified

Order By: Relevance

“…Phenomics has a chance of changing how we view heritable diseases first by defining latent phenotypes that underlie genetically similar phenotype categories and second by revealing unexpected genetic links among disease entities. (75) The incorporation of genetic data clearly improves the quality of the predictions over those derived solely from trait correlations, (76) although phenotypic overlap is often a very good predictor of functional relatedness of the underlying genes. (77) Thus van Driel and colleagues (78) found that similarity between phenotypes did correlate positively with several measures of gene function, including relatedness at the level of protein sequence, protein motifs, functional annotation, and direct protein-protein interaction.…”

Section: Discussionmentioning

confidence: 99%

Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations

Gupta

Cheung

Hsu

et al. 2011

Journal of Bone and Mineral Research

View full text Add to dashboard Cite

Genome-wide association studies (GWAS) using high-density genotyping platforms offer an unbiased strategy to identify new candidate genes for osteoporosis. It is imperative to be able to clearly distinguish signal from noise by focusing on the best phenotype in a genetic study. We performed GWAS of multiple phenotypes associated with fractures [bone mineral density (BMD), bone quantitative ultrasound (QUS), bone geometry, and muscle mass] with approximately 433,000 single-nucleotide polymorphisms (SNPs) and created a database of resulting associations. We performed analysis of GWAS data from 23 phenotypes by a novel modification of a block clustering algorithm followed by gene-set enrichment analysis. A data matrix of standardized regression coefficients was partitioned along both axes-SNPs and phenotypes. Each partition represents a distinct cluster of SNPs that have similar effects over a particular set of phenotypes. Application of this method to our data shows several SNP-phenotype connections. We found a strong cluster of association coefficients of high magnitude for 10 traits (BMD at several skeletal sites, ultrasound measures, cross-sectional bone area, and section modulus of femoral neck and shaft). These clustered traits were highly genetically correlated. Gene-set enrichment analyses indicated the augmentation of genes that cluster with the 10 osteoporosis-related traits in pathways such as aldosterone signaling in epithelial cells, role of osteoblasts, osteoclasts, and chondrocytes in rheumatoid arthritis, and Parkinson signaling. In addition to several known candidate genes, we also identified PRKCH and SCNN1B as potential candidate genes for multiple bone traits. In conclusion, our mining of GWAS results revealed the similarity of association results between bone strength phenotypes that may be attributed to pleiotropic effects of genes. This knowledge may prove helpful in identifying novel genes and pathways that underlie several correlated phenotypes, as well as in deciphering genetic and phenotypic modularity underlying osteoporosis risk. ß

show abstract

Section: Discussionmentioning

confidence: 99%

Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations

Gupta

Cheung

Hsu

et al. 2011

Journal of Bone and Mineral Research

View full text Add to dashboard Cite

show abstract

“…From a statistical perspective, one can use the results of an eQTL study to prioritize a list of disease-associated loci to follow up on; that is, one can use the existence of a SNP-gene-expression association as prior evidence that variation at the locus is more likely to have disease consequences (2,6). Furthermore, eQTL studies can infuse causal information into gene-gene and protein-protein correlation networks by making use of the fact that DNA can affect gene expression, but not the other way around (1,(7)(8)(9). Finally, the utility of eQTL studies is likely to increase as larger and more diverse datasets are amassed, and with the advent of new technologies such as RNA sequencing and exon arrays (2).…”

mentioning

confidence: 99%

Correction for hidden confounders in the genetic analysis of gene expression

Listgarten

Kadie

Schadt

et al. 2010

Proc. Natl. Acad. Sci. U.S.A.

145

154

View full text Add to dashboard Cite

Understanding the genetic underpinnings of disease is important for screening, treatment, drug development, and basic biological insight. One way of getting at such an understanding is to find out which parts of our DNA, such as single-nucleotide polymorphisms, affect particular intermediary processes such as gene expression. Naively, such associations can be identified using a simple statistical test on all paired combinations of genetic variants and gene transcripts. However, a wide variety of confounders lie hidden in the data, leading to both spurious associations and missed associations if not properly addressed. We present a statistical model that jointly corrects for two particular kinds of hidden structure-population structure (e.g., race, family-relatedness), and microarray expression artifacts (e.g., batch effects), when these confounders are unknown. Applying our method to both real and synthetic, human and mouse data, we demonstrate the need for such a joint correction of confounders, and also the disadvantages of other possible approaches based on those in the current literature. In particular, we show that our class of models has maximum power to detect eQTL on synthetic data, and has the best performance on a bronze standard applied to real data. Lastly, our software and the associations we found with it are available at http://www.microsoft.com/science. differential expression | genome wide association | microarray | population structure | expression heterogeneity

show abstract

“…Bayesian network reconstruction (11, 12) is a powerful approach for simultaneously considering thousands of molecular or clinical variables and for identifying patterns of causal relationships between these variables in a completely data-driven fashion. We developed a way to overcome the chief limitation of this approach-deriving predictive models from correlation data (11,12,35)by leveraging DNA variation as a systematic source of perturbation (32). The resulting probabilistic causal networks are critical for understanding the behavior of any one gene in the context of human disease, because individual genes operate in molecular networks that define disease-associated biological and pathological events.…”

Section: New Medicinementioning

confidence: 99%

NEW: Network-Enabled Wisdom in Biology, Medicine, and Health Care

2012

View full text Add to dashboard Cite

Complete repertoires of molecular activity in and between tissues provided by new high-dimensional "omics" technologies hold great promise for characterizing human physiology at all levels of biological hierarchies. The combined effects of genetic and environmental perturbations at any level of these hierarchies can lead to vicious cycles of pathology and complex systemic diseases. The challenge lies in extracting all relevant information from the rapidly increasing volumes of omics data and translating this information first into knowledge and ultimately into wisdom that can yield clinically actionable results. Here, we discuss how molecular networks are central to the implementation of this new biology in medicine and translation to preventive and personalized health care.

show abstract

Increasing the Power to Detect Causal Associations by Combining Genotypic and Expression Data in Segregating Populations

Cited by 197 publications

References 39 publications

Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations

Identification of homogeneous genetic architecture of multiple genetically correlated traits by block clustering of genome-wide associations

Correction for hidden confounders in the genetic analysis of gene expression

NEW: Network-Enabled Wisdom in Biology, Medicine, and Health Care

Contact Info

Product

Resources

About