2022
DOI: 10.3389/fgene.2022.859188
|View full text |Cite
|
Sign up to set email alerts
|

DTI-BERT: Identifying Drug-Target Interactions in Cellular Networking Based on BERT and Deep Learning Method

Abstract: Drug–target interactions (DTIs) are regarded as an essential part of genomic drug discovery, and computational prediction of DTIs can accelerate to find the lead drug for the target, which can make up for the lack of time-consuming and expensive wet-lab techniques. Currently, many computational methods predict DTIs based on sequential composition or physicochemical properties of drug and target, but further efforts are needed to improve them. In this article, we proposed a new sequence-based method for accurat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 67 publications
(90 reference statements)
0
4
0
Order By: Relevance
“…However, accurately predicting the complete 3D structure of a protein solely from its sequence using BERT-based models remains a complex and challenging problem [ 17 , 38 , 39 , 40 , 41 ]. While BERT-based models are not specifically designed to provide high-resolution 3D structural predictions similar to AlphaFold [ 42 ], they showed promise in capturing local structural features such as secondary structure (alpha-helices and beta-sheets), solvent accessibility, and other properties that can be inferred from sequence patterns [ 4 , 43 ]. In this study, we employ the ProtBert, a large protein language model, to extract contextual features from protein sequences utilizing transfer learning ( Figure 1 B).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, accurately predicting the complete 3D structure of a protein solely from its sequence using BERT-based models remains a complex and challenging problem [ 17 , 38 , 39 , 40 , 41 ]. While BERT-based models are not specifically designed to provide high-resolution 3D structural predictions similar to AlphaFold [ 42 ], they showed promise in capturing local structural features such as secondary structure (alpha-helices and beta-sheets), solvent accessibility, and other properties that can be inferred from sequence patterns [ 4 , 43 ]. In this study, we employ the ProtBert, a large protein language model, to extract contextual features from protein sequences utilizing transfer learning ( Figure 1 B).…”
Section: Methodsmentioning
confidence: 99%
“…Structure-based methods and ligand-based methods are the two main existing computational approaches for identifying drug–target interactions [ 4 ]. In structure-based strategies such as molecular docking, the three-dimensional (3D) structures of proteins and chemical compounds are utilized to explore potential binding poses at the atom level and identify binding affinities [ 5 ].…”
Section: Introductionmentioning
confidence: 99%
“…PPI databases can be migrated into PepPI predictions to address the lack of training samples due to inefficiencies and false positives, also known as the sparse distribution of training samples. [39,40] Currently, there are several available databases that provide PPIs and PepPIs and most of them are made up of sequence information, reference literature information, interactions, taxonomy, annotation, etc. This section introduces the most commonly used public databases of PPIs and PepPIs, and presents a brief list in Table 1.…”
Section: Databases For Ppi and Peppi Predictionsmentioning
confidence: 99%
“…Once the benchmark dataset has been prepared for the study, the next important step is formulating the samples and extracting the best feature set for constructing a robust and superior computational model. In recent years, various feature encoding strategies have been used to form biological sequence fragments, such as PseKNC (17), One-hot (21,22), physicochemical features, and word2vec (23)(24)(25). This study selected some of the most common feature encoding approaches, including six physicochemical feature encoding strategies and the frequency of occurrence of k-nearest neighbor nucleic acids, to describe RNA fragments.…”
Section: Sample Formulationmentioning
confidence: 99%