2023
DOI: 10.1093/bib/bbad261
|View full text |Cite
|
Sign up to set email alerts
|

HNSPPI: a hybrid computational model combing network and sequence information for predicting protein–protein interaction

Abstract: Most life activities in organisms are regulated through protein complexes, which are mainly controlled via Protein–Protein Interactions (PPIs). Discovering new interactions between proteins and revealing their biological functions are of great significance for understanding the molecular mechanisms of biological processes and identifying the potential targets in drug discovery. Current experimental methods only capture stable protein interactions, which lead to limited coverage. In addition, expensive cost and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(15 citation statements)
references
References 79 publications
0
15
0
Order By: Relevance
“…In previous works, computational scientists tried to access various public databases to obtain the interacting data (positive instances), which were mostly validated by experiments. To construct balanced datasets, they manually generated the same number of negative instances through subcellular localization 2,35,37 . However, these negative PPIs are only associated with a small portion of proteins (from 7.58% to 47.22%), indicating serious data distribution bias on most benchmarking datasets ( Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…In previous works, computational scientists tried to access various public databases to obtain the interacting data (positive instances), which were mostly validated by experiments. To construct balanced datasets, they manually generated the same number of negative instances through subcellular localization 2,35,37 . However, these negative PPIs are only associated with a small portion of proteins (from 7.58% to 47.22%), indicating serious data distribution bias on most benchmarking datasets ( Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
“…As mentioned in the section of “ Materials and Methods ”, most computational scientists tend to generate negative pairs by randomly pairing proteins with different subcellular localization 2,34,35,37 . Therefore, these representative benchmark PPI datasets are manually curated with an equal number of positive and negative samples.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…TAGPPI (8) incorporates both sequence features and predicted structural information and employs graph representation learning methods on contact maps to obtain 3D structure features of proteins. HNSPPI (9) adopts a feature fusion strategy of both network topology and sequence information for comprehensive feature extraction and employs a simple classifier for predictions, making it both lightweight and efficient. Graph-BERT (10) utilizes a language model-based embedding SeqVec to represent protein sequences and a graph convolutional neural network with the training strategy of subgraph batches using a top-k intimacy sampling approach.…”
Section: Introductionmentioning
confidence: 99%