2023
DOI: 10.1093/bib/bbad082
|View full text |Cite
|
Sign up to set email alerts
|

MCANet: shared-weight-based MultiheadCrossAttention network for drug–target interaction prediction

Abstract: Accurate and effective drug–target interaction (DTI) prediction can greatly shorten the drug development lifecycle and reduce the cost of drug development. In the deep-learning-based paradigm for predicting DTI, robust drug and protein feature representations and their interaction features play a key role in improving the accuracy of DTI prediction. Additionally, the class imbalance problem and the overfitting problem in the drug–target dataset can also affect the prediction accuracy, and reducing the consumpt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
8
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 32 publications
0
8
0
Order By: Relevance
“…MCANet [ 48 ]: This model proposes a shared weights-based PolyLoss cross-attention algorithm, which extracts the interaction features of drug-target pairs to highlight the features of binding regions for a more robust feature representation. The PolyLoss loss function is applied to alleviate the overfitting problem and category imbalance problem in the drug-target dataset.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…MCANet [ 48 ]: This model proposes a shared weights-based PolyLoss cross-attention algorithm, which extracts the interaction features of drug-target pairs to highlight the features of binding regions for a more robust feature representation. The PolyLoss loss function is applied to alleviate the overfitting problem and category imbalance problem in the drug-target dataset.…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…These challenges compel even the most advanced models to continue using methods such as the CNN, RNN, 23 and LSTM 24 to extract protein features from sequences, as seen in models like MolTrans, 25 DeepPurpose, 26 DrugBAN, 27 and MCANet. 28 Although the CNN can extract local aggregated features akin to graph structures, the emergence of the gradient vanishing problem results in some shallow prior knowledge often getting lost with the deepening of the model. Balancing the incorporation of global semantic and intrinsic biological background feature embeddings while considering richer local aggregated features has become a complex issue.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, despite the significant progress AlphaFold from DeepMind has made in predicting protein 3D structures, recently generating predictions of 2 billion protein 3D structures from a million species, the reality is that the number of high-accuracy 3D structure proteins only account for a small fraction of known protein sequences. These challenges compel even the most advanced models to continue using methods such as the CNN, RNN, and LSTM to extract protein features from sequences, as seen in models like MolTrans, DeepPurpose, DrugBAN, and MCANet . Although the CNN can extract local aggregated features akin to graph structures, the emergence of the gradient vanishing problem results in some shallow prior knowledge often getting lost with the deepening of the model.…”
Section: Introductionmentioning
confidence: 99%
“…Some of the existing methods do not restrict their use of attention to only generating embeddings. DTITR [17] and MCANet [18] successfully demonstrated how cross-attention can be applied across the descriptors of protein targets and compounds for improved interaction predictions. Nevertheless, these methods rely solely on SMILES and AA sequences to generate descriptors for compounds and protein targets, respectively.…”
Section: Introductionmentioning
confidence: 99%