Tianyi Zhao scite author profile

Identification of new drug–target interactions (DTIs) is an important but a time-consuming and costly step in drug discovery. In recent years, to mitigate these drawbacks, researchers have sought to identify DTIs using computational approaches. However, most existing methods construct drug networks and target networks separately, and then predict novel DTIs based on known associations between the drugs and targets without accounting for associations between drug–protein pairs (DPPs). To incorporate the associations between DPPs into DTI modeling, we built a DPP network based on multiple drugs and proteins in which DPPs are the nodes and the associations between DPPs are the edges of the network. We then propose a novel learning-based framework, ‘graph convolutional network (GCN)-DTI’, for DTI identification. The model first uses a graph convolutional network to learn the features for each DPP. Second, using the feature representation as an input, it uses a deep neural network to predict the final label. The results of our analysis show that the proposed framework outperforms some state-of-the-art approaches by a large margin.

show abstract

DeepLGP: a novel deep learning method for prioritizing lncRNA target genes

Zhao

Peng

et al. 2020

114

View full text Add to dashboard Cite

Motivation Although long non-coding RNAs (lncRNAs) have limited capacity for encoding proteins, they have been verified as biomarkers in the occurrence and development of complex diseases. Recent wet-lab experiments have shown that lncRNAs function by regulating the expression of protein-coding genes (PCGs), which could also be the mechanism responsible for causing diseases. Currently, lncRNA-related biological data is increasing rapidly. Whereas, no computational methods have been designed for predicting the novel target genes of lncRNA. Results In this study, we present a graph convolutional network (GCN) based method, named DeepLGP, for prioritizing target PCGs of lncRNA. First, gene and lncRNA features were selected, these included their location in the genome, expression in 13 tissues, and miRNA-mediated lncRNA-gene pairs. Next, GCN was applied to convolve a gene interaction network for encoding the features of genes and lncRNAs. Then, these features were used by the convolutional neural network (CNN) for prioritizing target genes of lncRNAs. In 10-cross validations on two independent datasets, DeepLGP obtained high AUCs (0.90, 0.98) and AUPRs (0.91, 0.98). We found that lncRNA pairs with high similarity had more overlapped target genes. Further experiments showed that genes targeted by the same lncRNA sets had a strong likelihood of causing the same diseases, which could help in identifying disease-causing PCGs. Availability and implementation https://github.com/zty2009/LncRNA-target-gene Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

Integrate GWAS, eQTL, and mQTL Data to Identify Alzheimer’s Disease-Related Genes

Zhao

Zang

et al. 2019

Front. Genet.

View full text Add to dashboard Cite

It is estimated that the impact of related genes on the risk of Alzheimer’s disease (AD) is nearly 70%. Identifying candidate causal genes can help treatment and diagnosis. The maturity of sequencing technology and the reduction of cost make genome-wide association study (GWAS) become an important means to find disease-related mutation sites. Because of linkage disequilibrium (LD), neither the gene regulated by SNP nor the specific SNP can be determined. Because GWAS is affected by sample size and interaction, we introduced empirical Bayes (EB) to make a meta-analysis of GWAS to greatly eliminate the bias caused by sample and the interaction of SNP. In addition, most SNPs are in the noncoding region, so it is not clear how they relate to phenotype. In this paper, expression quantitative trait locus (eQTL) studies and methylation quantitative trait locus (mQTL) studies are combined with GWAS to find the genes associated with Alzheimer disease in expression levels by pleiotropy. Summary data-based Mendelian randomization (SMR) is introduced to integrate GWAS and eQTL/mQTL data. Finally, we prioritized 274 significant SNPs, which belong to 20 genes by eQTL analysis and 379 significant SNPs, which belong to seven known genes by mQTL. Among them, 93 SNPs and 2 genes are overlapped. Finally, we did 10 case studies to prove the effectiveness of our method.

show abstract

SC2disease: a manually curated database of single-cell transcriptome for human diseases

Zhao

Lyu

et al. 2020

View full text Add to dashboard Cite

SC2disease (http://easybioai.com/sc2disease/) is a manually curated database that aims to provide a comprehensive and accurate resource of gene expression profiles in various cell types for different diseases. With the development of single-cell RNA sequencing (scRNA-seq) technologies, uncovering cellular heterogeneity of different tissues for different diseases has become feasible by profiling transcriptomes across cell types at the cellular level. In particular, comparing gene expression profiles between different cell types and identifying cell-type-specific genes in various diseases offers new possibilities to address biological and medical questions. However, systematic, hierarchical and vast databases of gene expression profiles in human diseases at the cellular level are lacking. Thus, we reviewed the literature prior to March 2020 for studies which used scRNA-seq to study diseases with human samples, and developed the SC2disease database to summarize all the data by different diseases, tissues and cell types. SC2disease documents 946 481 entries, corresponding to 341 cell types, 29 tissues and 25 diseases. Each entry in the SC2disease database contains comparisons of differentially expressed genes between different cell types, tissues and disease-related health status. Furthermore, we reanalyzed gene expression matrix by unified pipeline to improve the comparability between different studies. For each disease, we also compare cell-type-specific genes with the corresponding genes of lead single nucleotide polymorphisms (SNPs) identified in genome-wide association studies (GWAS) to implicate cell type specificity of the traits.

show abstract

Metabolomic Profiles and Heart Failure Risk in Black Adults: Insights From the Jackson Heart Study

Tahir

Katz

Zhao

et al. 2021

Circ: Heart Failure

View full text Add to dashboard Cite

Background: Heart failure (HF) is a heterogeneous disease characterized by significant metabolic disturbances; however, the breadth of metabolic dysfunction before the onset of overt disease is not well understood. The purpose of this study was to determine the association of circulating metabolites with incident HF to uncover novel metabolic pathways to disease. Methods: We performed targeted plasma metabolomic profiling in a deeply phenotyped group of Black adults from the JHS (Jackson Heart Study; n=2199). We related metabolites associated with incident HF to established etiological mechanisms, including increased left ventricular mass index and incident coronary heart disease. Furthermore, we evaluated differential associations of metabolites with HF with preserved ejection fraction versus HF with reduced ejection fraction. Results: Metabolites associated with incident HF included products of posttranscriptional modifications of RNA, as well as polyamine and nitric oxide metabolism. A subset of metabolite-HF associations was independent of well-established HF pathways such as increased left ventricular mass index and incident coronary heart disease and included homoarginine (per 1 SD increase in metabolite level, hazard ratio, 0.77; P =1.2×10 −3 ), diacetylspermine (hazard ratio, 1.34; P =3.4×10 −3 ), and uridine (hazard ratio, 0.79; P , 3×10 −4 ). Furthermore, metabolites involved in pyrimidine metabolism (orotic acid) and collagen turnover ( N -methylproline) among others were part of a distinct metabolic signature that differentiated individuals with HF with preserved ejection fraction versus HF with reduced ejection fraction. Conclusions: The integration of clinical phenotyping with plasma metabolomic profiling uncovered novel metabolic processes in nontraditional disease pathways underlying the heterogeneity of HF development in Black adults.

show abstract

Identifying diseases-related metabolites using random walk

Zhao

Zhang

et al. 2018

BMC Bioinformatics

View full text Add to dashboard Cite

BackgroundMetabolites disrupted by abnormal state of human body are deemed as the effect of diseases. In comparison with the cause of diseases like genes, these markers are easier to be captured for the prevention and diagnosis of metabolic diseases. Currently, a large number of metabolic markers of diseases need to be explored, which drive us to do this work.MethodsThe existing metabolite-disease associations were extracted from Human Metabolome Database (HMDB) using a text mining tool NCBO annotator as priori knowledge. Next we calculated the similarity of a pair-wise metabolites based on the similarity of disease sets of them. Then, all the similarities of metabolite pairs were utilized for constructing a weighted metabolite association network (WMAN). Subsequently, the network was utilized for predicting novel metabolic markers of diseases using random walk.ResultsTotally, 604 metabolites and 228 diseases were extracted from HMDB. From 604 metabolites, 453 metabolites are selected to construct the WMAN, where each metabolite is deemed as a node, and the similarity of two metabolites as the weight of the edge linking them. The performance of the network is validated using the leave one out method. As a result, the high area under the receiver operating characteristic curve (AUC) (0.7048) is achieved. The further case studies for identifying novel metabolites of diabetes mellitus were validated in the recent studies.ConclusionIn this paper, we presented a novel method for prioritizing metabolite-disease pairs. The superior performance validates its reliability for exploring novel metabolic markers of diseases.

show abstract

HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition

Fan

Zhao

Kuang

et al. 2017

IEEE Trans. on Image Process.

View full text Add to dashboard Cite

In this paper, a hierarchical deep multi-task learning (HD-MTL) algorithm is developed to support large-scale visual recognition (e.g., recognizing thousands or even tens of thousands of atomic object classes automatically). First, multiple sets of multi-level deep features are extracted from different layers of deep convolutional neural networks (deep CNNs), and they are used to achieve more effective accomplishment of the coarseto- fine tasks for hierarchical visual recognition. A visual tree is then learned by assigning the visually-similar atomic object classes with similar learning complexities into the same group, which can provide a good environment for determining the interrelated learning tasks automatically. By leveraging the inter-task relatedness (inter-class similarities) to learn more discriminative group-specific deep representations, our deep multi-task learning algorithm can train more discriminative node classifiers for distinguishing the visually-similar atomic object classes effectively. Our hierarchical deep multi-task learning (HD-MTL) algorithm can integrate two discriminative regularization terms to control the inter-level error propagation effectively, and it can provide an end-to-end approach for jointly learning more representative deep CNNs (for image representation) and more discriminative tree classifier (for large-scale visual recognition) and updating them simultaneously. Our incremental deep learning algorithms can effectively adapt both the deep CNNs and the tree classifier to the new training images and the new object classes. Our experimental results have demonstrated that our HD-MTL algorithm can achieve very competitive results on improving the accuracy rates for large-scale visual recognition.

show abstract

Deep-DRM: a computational method for identifying disease-related metabolites based on graph deep learning approaches

Zhao¹,

Hu²,

Liang

2020

View full text Add to dashboard Cite

Motivation: The functional changes of the genes, RNAs and proteins will eventually be reflected in the metabolic level. Increasing number of researchers have researched mechanism, biomarkers and targeted drugs by metabolites. However, compared with our knowledge about genes, RNAs, and proteins, we still know few about diseases-related metabolites. All the few existed methods for identifying diseases-related metabolites ignore the chemical structure of metabolites, fail to recognize the association pattern between metabolites and diseases, and fail to apply to isolated diseases and metabolites. Results: In this study, we present a graph deep learning based method, named Deep-DRM, for identifying diseases-related metabolites. First, chemical structures of metabolites were used to calculate similarities of metabolites. The similarities of diseases were obtained based on their functional gene network and semantic associations. Therefore, both metabolites and diseases network could be built. Next, Graph Convolutional Network (GCN) was applied to encode the features of metabolites and diseases, respectively. Then, the dimension of these features was reduced by Principal components analysis (PCA) with retainment 99% information. Finally, Deep neural network was built for identifying true metabolite-disease pairs (MDPs) based on these features. The 10-cross validations on three testing setups showed outstanding AUC (0.952) and AUPR (0.939) of Deep-DRM compared with previous methods and similar approaches. Ten of top 15 predicted associations between diseases and metabolites got support by other studies, which suggests that Deep-DRM is an efficient method to identify MDPs. Contact: liangcheng@hrbmu.edu.cn. Availability and implementation: https://github.com/zty2009/GPDNN-for-Identify-ing-Disease-related-Metabolites.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tianyi Zhao

Identifying drug–target interactions based on graph convolutional network and deep neural network

DeepLGP: a novel deep learning method for prioritizing lncRNA target genes

Integrate GWAS, eQTL, and mQTL Data to Identify Alzheimer’s Disease-Related Genes

SC2disease: a manually curated database of single-cell transcriptome for human diseases

Metabolomic Profiles and Heart Failure Risk in Black Adults: Insights From the Jackson Heart Study

Identifying diseases-related metabolites using random walk

HD-MTL: Hierarchical Deep Multi-Task Learning for Large-Scale Visual Recognition

Deep-DRM: a computational method for identifying disease-related metabolites based on graph deep learning approaches

Contact Info

Product

Resources

About