Helian Feng scite author profile

The breast cancer risk variants identified in genome-wide association studies explain only a small fraction of the familial relative risk, and the genes responsible for these associations remain largely unknown. To identify novel risk loci and likely causal genes, we performed a transcriptome-wide association study evaluating associations of genetically predicted gene expression with breast cancer risk in 122,977 cases and 105,974 controls of European ancestry. We used data from the Genotype-Tissue Expression Project to establish genetic models to predict gene expression in breast tissue and evaluated model performance using data from The Cancer Genome Atlas. Of the 8,597 genes evaluated, significant associations were identified for 48 at a Bonferroni-corrected threshold of P < 5.82 × 10, including 14 genes at loci not yet reported for breast cancer. We silenced 13 genes and showed an effect for 11 on cell proliferation and/or colony-forming efficiency. Our study provides new insights into breast cancer genetics and biology.

show abstract

Transcriptome-wide association studies accounting for colocalization using Egger regression

Barfield

Feng

Gusev

et al. 2017

Preprint

View full text Add to dashboard Cite

Integrating genome-wide association (GWAS) and expression quantitative trait locus (eQTL) data into transcriptome-wide association studies (TWAS) based on predicted expression can boost power to detect novel disease loci or pinpoint the susceptibility gene at a known disease locus. However, it is often the case that multiple eQTL genes colocalize at disease loci, making the identification of the true susceptibility gene challenging, due to confounding through linkage disequilibrium (LD). To distinguish between true susceptibility genes (where the genetic effect on phenotype is mediated through expression) and colocalization due to LD, we examine an extension of the Mendelian Randomization Egger regression method that allows for LD while only requiring summary association data for both GWAS and eQTL. We derive the standard TWAS approach in the context of Mendelian Randomization and show in simulations that the standard TWAS does not control Type I error for causal gene identification when eQTLs have pleiotropic or LD-confounded effects on disease. In contrast, LD Aware MR-Egger regression can control Type I error in this case while attaining similar power as other methods in situations where these provide valid tests. However, when the direct effects of genetic variants on traits are correlated with the eQTL associations, all of the methods we examined including LD Aware MR-Egger regression can have inflated Type I error. We illustrate these methods by integrating gene expression within a recent large-scale breast cancer GWAS to provide guidance on susceptibility gene identification.

show abstract

Transcriptome‐wide association studies accounting for colocalization using Egger regression

Barfield

Feng

Gusev

et al. 2018

Genetic Epidemiology

View full text Add to dashboard Cite

Integrating genome-wide association (GWAS) and expression quantitative trait locus (eQTL) data into transcriptome-wide association studies (TWAS) based on predicted expression can boost power to detect novel disease loci or pinpoint the susceptibility gene at a known disease locus. However, it is often the case that multiple eQTL genes colocalize at disease loci, making the identification of the true susceptibility gene challenging, due to confounding through linkage disequilibrium (LD). To distinguish between true susceptibility genes (where the genetic effect on phenotype is mediated through expression) and colocalization due to LD, we examine an extension of the Mendelian randomization (MR) egger regression method that allows for LD while only requiring summary association data for both GWAS and eQTL. We derive the standard TWAS approach in the context of MR and show in simulations that the standard TWAS does not control type I error for causal gene identification when eQTLs have pleiotropic or LD-confounded effects on disease. In contrast, LD-aware MR-Egger (LDA MR-Egger) regression can control type I error in this case while attaining similar power as other methods in situations where these provide valid tests. However, when the direct effects of genetic variants on traits are correlated with the eQTL associations, all of the methods we examined including LDA MR-Egger regression can have inflated type I error. We illustrate these methods by integrating gene expression within a recent large-scale breast cancer GWAS to provide guidance on susceptibility gene identification.

show abstract

Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improves the power of transcriptome-wide association studies

et al. 2021

View full text Add to dashboard Cite

Transcriptome-wide association studies (TWAS) test the association between traits and genetically predicted gene expression levels. The power of a TWAS depends in part on the strength of the correlation between a genetic predictor of gene expression and the causally relevant gene expression values. Consequently, TWAS power can be low when expression quantitative trait locus (eQTL) data used to train the genetic predictors have small sample sizes, or when data from causally relevant tissues are not available. Here, we propose to address these issues by integrating multiple tissues in the TWAS using sparse canonical correlation analysis (sCCA). We show that sCCA-TWAS combined with single-tissue TWAS using an aggregate Cauchy association test (ACAT) outperforms traditional single-tissue TWAS. In empirically motivated simulations, the sCCA+ACAT approach yielded the highest power to detect a gene associated with phenotype, even when expression in the causal tissue was not directly measured, while controlling the Type I error when there is no association between gene expression and phenotype. For example, when gene expression explains 2% of the variability in outcome, and the GWAS sample size is 20,000, the average power difference between the ACAT combined test of sCCA features and single-tissue, versus single-tissue combined with Generalized Berk-Jones (GBJ) method, single-tissue combined with S-MultiXcan, UTMOST, or summarizing cross-tissue expression patterns using Principal Component Analysis (PCA) approaches was 5%, 8%, 5% and 38%, respectively. The gain in power is likely due to sCCA cross-tissue features being more likely to be detectably heritable. When applied to publicly available summary statistics from 10 complex traits, the sCCA+ACAT test was able to increase the number of testable genes and identify on average an additional 400 additional gene-trait associations that single-trait TWAS missed. Our results suggest that aggregating eQTL data across multiple tissues using sCCA can improve the sensitivity of TWAS while controlling for the false positive rate.

show abstract

AAV-Txnip prolongs cone survival and vision in mouse models of retinitis pigmentosa

Xue

Wang

Rana

et al. 2021

View full text Add to dashboard Cite

Retinitis pigmentosa (RP) is an inherited retinal disease, affecting >20 million people worldwide. Loss of daylight vision typically occurs due to the dysfunction/loss of cone photoreceptors, the cell type that initiates our color and high acuity vision. Currently, there is no effective treatment for RP, other than gene therapy for a limited number of specific disease genes. To develop a disease gene-agnostic therapy, we screened 20 genes for their ability to prolong cone photoreceptor survival in vivo. Here, we report an adeno-associated virus (AAV) vector expressing Txnip, which prolongs the survival of cone photoreceptors and improves visual acuity in RP mouse models. A Txnip allele, C247S, which blocks the association of Txnip with thioredoxin, provides an even greater benefit. Additionally, the rescue effect of Txnip depends on lactate dehydrogenase b (Ldhb), and correlates with the presence of healthier mitochondria, suggesting that Txnip saves RP cones by enhancing their lactate catabolism.

show abstract

Transcriptome‐wide association study of breast cancer risk by estrogen‐receptor status

Feng

Gusev

Paşaniuc

et al. 2020

Genetic Epidemiology

View full text Add to dashboard Cite

Previous transcriptome‐wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome‐wide association studies (GWAS), but analyses of breast cancer subtype‐specific associations have been limited. In this study, we conducted a TWAS using gene expression data from GTEx and summary statistics from the hitherto largest GWAS meta‐analysis conducted for breast cancer overall, and by estrogen receptor subtypes (ER+ and ER−). We further compared associations with ER+ and ER− subtypes, using a case‐only TWAS approach. We also conducted multigene conditional analyses in regions with multiple TWAS associations. Two genes, STXBP4 and HIST2H2BA, were specifically associated with ER+ but not with ER– breast cancer. We further identified 30 TWAS‐significant genes associated with overall breast cancer risk, including four that were not identified in previous studies. Conditional analyses identified single independent breast‐cancer gene in three of six regions harboring multiple TWAS‐significant genes. Our study provides new information on breast cancer genetics and biology, particularly about genomic differences between ER+ and ER− breast cancer.

show abstract

Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improve the power of transcriptome-wide association studies

Feng

Mancuso

Gusev

et al. 2020

Preprint

View full text Add to dashboard Cite

AbstractTranscriptome-wide association studies (TWAS) test the association between traits and genetically predicted gene expression levels. The power of a TWAS depends in part on the strength of the correlation between a genetic predictor of gene expression and the causally relevant gene expression values. Consequently, TWAS power can be low when expression quantitative trait locus (eQTL) data used to train the genetic predictors have small sample sizes, or when data from causally relevant tissues are not available. Here, we propose to address these issues by integrating multiple tissues in the TWAS using sparse canonical correlation analysis (sCCA). We show that sCCA-TWAS combined with single-tissue TWAS using an aggregate Cauchy association test (ACAT) outperforms traditional single-tissue TWAS. In empirically motivated simulations, the sCCA+ACAT approach yielded the highest power to detect a gene associated with phenotype, even when expression in the causal tissue was not directly measured, while controlling the Type I error when there is no association between gene expression and phenotype. For example, when gene expression explains 2% of the variability in outcome, and the GWAS sample size is 20,000, the average power difference between the ACAT combined test of sCCA features and single-tissue, versus single-tissue combined with Generalized Berk-Jones (GBJ) method, single-tissue combined with S-MultiXcan or summarizing cross-tissue expression patterns using Principal Component Analysis (PCA) approaches was 5%, 8%, and 38%, respectively. The gain in power is likely due to sCCA cross-tissue features being more likely to be detectably heritable. When applied to publicly available summary statistics from 10 complex traits, the sCCA+ACAT test was able to increase the number of testable genes and identify on average an additional 400 additional gene-trait associations that single-trait TWAS missed. Our results suggest that aggregating eQTL data across multiple tissues using sCCA can improve the sensitivity of TWAS while controlling for the false positive rate.Author summaryTranscriptome-wide association studies (TWAS) can improve the statistical power of genetic association studies by leveraging the relationship between genetically predicted transcript expression levels and an outcome. We propose a new TWAS pipeline that integrates data on the genetic regulation of expression levels across multiple tissues. We generate cross-tissue expression features using sparse canonical correlation analysis and then combine evidence for expression-outcome association across cross- and single-tissue features using the aggregate Cauchy association test. We show that this approach has substantially higher power than traditional single-tissue TWAS methods. Application of these methods to publicly available summary statistics for ten complex traits also identifies associations missed by single-tissue methods.

show abstract

Endocrine resistance in breast cancer

Zheng¹,

Zhao

Feng

et al. 2013

Climacteric

View full text Add to dashboard Cite

Selective estrogen receptor modulators (SERMs) are synthetic molecules which bind to estrogen receptors (ER) and can modulate their transcriptional capabilities in different ways in diverse estrogen target tissues. Unfortunately, the use of resistant therapy is associated with acquired resistance. Several molecular mechanisms have been proposed to be responsible for endocrine resistance in breast cancer, including MIR-451, FGF and FGFR, ADAM12, fibronectin and other soluble stromal factors, PELP1-KDM1, HER2, NOTCH, δEF1, mTOR, AKT/mTOR, Pi3K/AKT, Pi3K/AKT/mTOR, NFκB, LMTK3, IGF1R, cyclin E2, IRF1, Tab2, and SRC-1. Further research is needed to know more about endocrine resistance.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Helian Feng

A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer

Transcriptome-wide association studies accounting for colocalization using Egger regression

Transcriptome‐wide association studies accounting for colocalization using Egger regression

Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improves the power of transcriptome-wide association studies

AAV-Txnip prolongs cone survival and vision in mouse models of retinitis pigmentosa

Transcriptome‐wide association study of breast cancer risk by estrogen‐receptor status

Leveraging expression from multiple tissues using sparse canonical correlation analysis and aggregate tests improve the power of transcriptome-wide association studies

Endocrine resistance in breast cancer

Contact Info

Product

Resources

About