2019
DOI: 10.1093/bioinformatics/btz721
|View full text |Cite
|
Sign up to set email alerts
|

DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites

Abstract: Motivation Proteases are enzymes that cleave target substrate proteins by catalyzing the hydrolysis of peptide bonds between specific amino acids. While the functional proteolysis regulated by proteases plays a central role in the ‘life and death’ cellular processes, many of the corresponding substrates and their cleavage sites were not found yet. Availability of accurate predictors of the substrates and cleavage sites would facilitate understanding of proteases’ functions and physiological r… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
46
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
6
4

Relationship

2
8

Authors

Journals

citations
Cited by 101 publications
(47 citation statements)
references
References 51 publications
1
46
0
Order By: Relevance
“…Although our proposed method showed improved performance over other methods, it still has room for improvement. Recently, several novel computational approaches have been proposed in computational biology [68,78,[80][81][82][83][84][85] to identify function from the sequence. Hence, developing a novel prediction model by utilizing such approaches may improve the prediction performance.…”
Section: Resultsmentioning
confidence: 99%
“…Although our proposed method showed improved performance over other methods, it still has room for improvement. Recently, several novel computational approaches have been proposed in computational biology [68,78,[80][81][82][83][84][85] to identify function from the sequence. Hence, developing a novel prediction model by utilizing such approaches may improve the prediction performance.…”
Section: Resultsmentioning
confidence: 99%
“…To evaluate the prediction performance of XG-m7G we used four metrics, that is, Sn, Sp, Acc, and MCC, which have previously been used to assess the performance of predictors in other studies. 33 , 34 We also used ROC curves, 35 , 36 , 37 , 38 which plot the true-positive rate against the false-positive rate, and AUC to further assess the model performance. Sn, Sp, Acc, and MCC are defined as follows: where represents the total number of m7G site-containing sequences, represents the total number of non-m7G sequences, represents the number of m7G site-containing sequences incorrectly predicted as non-m7G sequences, and represents the number of non-m7G sequences incorrectly predicted as m7G site-containing sequences.…”
Section: Methodsmentioning
confidence: 99%
“…To improve the performance of the NonClasGP model, an ensemble learning model was built in this study, which used majority voting to integrate the prediction results of the above ten individual models, each of which was built on the optimal feature combinations. The performance of the ensemble model NonClasGP-Pred was evaluated by five commonly used metrics [15,40,[46][47][48][49][50][51][52][53][54][55][56][57][58][59]…”
Section: Model Construction and Evaluationmentioning
confidence: 99%