DeepCDA: deep cross-domain compound–protein affinity prediction through LSTM and convolutional neural networks

Abbasi, Karim; Razzaghi, Parvin; Poso, Antti; Amanlou, Massoud; Ghasemi, Jahan B.; Masoudi‐Nejad, Ali

doi:10.1093/bioinformatics/btaa544

Cited by 145 publications

(106 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…(i) Convolutional layer: The convolutional layer is a major building block of CNN, which contains a set of learnable filters where each filter is convolved with the input of the layer to encode the local knowledge of the small receptive field. This process helps conserve the dimensional relationship between numeric values in the vectors [ 33 ]. Thus, a 1D convolutional layer was used to construct a convolution kernel and then derive features encoded in the embedding layer [ 34 ].…”

Section: Methodsmentioning

confidence: 99%

DeepIPs: comprehensive assessment and computational identification of phosphorylation sites of SARS-CoV-2 infection using a deep learning-based approach

Dao

Zulfiqar

et al. 2021

Briefings in Bioinformatics

View full text Add to dashboard Cite

The rapid spread of SARS-CoV-2 infection around the globe has caused a massive health and socioeconomic crisis. Identification of phosphorylation sites is an important step for understanding the molecular mechanisms of SARS-CoV-2 infection and the changes within the host cells pathways. In this study, we present DeepIPs, a first specific deep-learning architecture to identify phosphorylation sites in host cells infected with SARS-CoV-2. DeepIPs consists of the most popular word embedding method and convolutional neural network-long short-term memory network architecture to make the final prediction. The independent test demonstrates that DeepIPs improves the prediction performance compared with other existing tools for general phosphorylation sites prediction. Based on the proposed model, a web-server called DeepIPs was established and is freely accessible at http://lin-group.cn/server/DeepIPs . The source code of DeepIPs is freely available at the repository https://github.com/linDing-group/DeepIPs .

show abstract

Section: Methodsmentioning

confidence: 99%

DeepIPs: comprehensive assessment and computational identification of phosphorylation sites of SARS-CoV-2 infection using a deep learning-based approach

Dao

Zulfiqar

et al. 2021

Briefings in Bioinformatics

View full text Add to dashboard Cite

show abstract

“…The protein is commonly encoded through sequence-based models. DeepDTA [11], DeepConv-DTI [9], GraphDTA [10], Tsubaki et al [5], MT-DTI [12] and TransformerCPI [3] apply September 29, 2021 2/15 1D-CNN layers to encode protein sequences, while DeepAffinity [1] and DeepCDA [7] combine 1D-CNN layers with recurrent neural network (RNN) or long short-term memory (LSTM) layers, respectively. The compound is encoded with sequence-based or graph-based models, depending on the input information.…”

Section: Some Of the Most Readily Available Data Representations In Cpi Datasets Arementioning

confidence: 99%

“…Given a compound-protein pair, CPI prediction methods aim to predict a binary value indicating whether the compound and the protein interact [3][4][5][6], a numeric value indicating their binding affinity [1,[7][8][9][10][11][12], or identify binding sites for a specific compound within the protein [13][14][15][16]. Existing CPI prediction methods are diverse in September 29, 2021 1/15 terms of feature engineering and machine learning models.…”

Section: Introductionmentioning

confidence: 99%

Predicting compound-protein interaction using hierarchical graph convolutional networks

Bui-Thi

Rivière

Meysman

et al. 2021

Preprint

View full text Add to dashboard Cite

Motivation: Convolutional neural networks have enabled unprecedented breakthroughs in a variety of computer vision tasks. They have also drawn much attention from other domains, including drug discovery and drug development. In this study, we develop a computational method based on convolutional neural networks to tackle a fundamental question in drug discovery and development, i.e. the prediction of compound-protein interactions based on compound structure and protein sequence. We propose a hierarchical graph convolutional network (HGCN) to encode small molecules. The HGCN aggregates a molecule embedding from substructure embeddings, which are synthesized from atom embeddings. As small molecules usually share substructures, computing a molecule embedding from those common substructures allows us to learn better generic models. We then combined the HGCN with a one-dimensional convolutional network to construct a complete model for predicting compound-protein interactions. Furthermore we apply an explanation technique, Grad-CAM, to visualize the contribution of each amino acid into the prediction. Results: Experiments using different datasets show the improvement of our model compared to other GCN-based methods and a sequence based method, DeepDTA, in predicting compound-protein interactions. Each prediction made by the model is also explainable and can be used to identify critical residues mediating the interaction. Availability and implementation: https://github.com/banhdzui/cpi_hgcn.git

show abstract

“…Overall, until recently, three types of machine learning methods, including supervised, semi-supervised, and unsupervised have been applied to the scope of drug discovery processes [15]. It has been also shown that some modified and improved versions of the present approaches such as deep neural networks could yield better predictive models [16], and overfitting and insufficient amounts of data have been the main challenges in front of the researchers who have been engaged in generating an appropriate predictive model [17][18][19]. ii) The theory-based researches: normally, researchers formulate the relationship among the biological entities based on the numerical and experimental experiences [20,21].…”

Section: Introductionmentioning

confidence: 99%

Synthetic repurposing of drugs against hypertension: a datamining method based on association rules and a novel discrete algorithm

Masoudi-Sobhanzadeh

Masoudi‐Nejad

2020

BMC Bioinformatics

Self Cite

View full text Add to dashboard Cite

Background: Drug repurposing aims to detect the new therapeutic benefits of the existing drugs and reduce the spent time and cost of the drug development projects. The synthetic repurposing of drugs may prove to be more useful than the single repurposing in terms of reducing toxicity and enhancing efficacy. However, the researchers have not given it serious consideration. To address the issue, a novel datamining method is introduced and applied to repositioning of drugs for hypertension (HT) which is a serious medical condition and needs some improved treatment plans to help treat it. Results: A novel two-step data mining method, which is based on the If-Then association rules as well as a novel discrete optimization algorithm, was introduced and applied to the synthetic repurposing of drugs for HT. The required data were also extracted from DrugBank, KEGG, and DrugR+ databases. The findings indicated that based on the different statistical criteria, the proposed method outperformed the other state-of-the-art approaches. In contrast to the previously proposed methods which had failed to discover a list on some datasets, our method could find a combination list for all of them. Conclusion: Since the proposed synthetic method uses medications in small dosages, it might revive some failed drug development projects and put forward a suitable plan for treating different diseases such as COVID-19 and HT. It is also worth noting that applying efficient computational methods helps to produce better results.

show abstract

DeepCDA: deep cross-domain compound–protein affinity prediction through LSTM and convolutional neural networks

Cited by 145 publications

References 34 publications

DeepIPs: comprehensive assessment and computational identification of phosphorylation sites of SARS-CoV-2 infection using a deep learning-based approach

DeepIPs: comprehensive assessment and computational identification of phosphorylation sites of SARS-CoV-2 infection using a deep learning-based approach

Predicting compound-protein interaction using hierarchical graph convolutional networks

Synthetic repurposing of drugs against hypertension: a datamining method based on association rules and a novel discrete algorithm

Contact Info

Product

Resources

About