2018
DOI: 10.1093/bioinformatics/bty746
|View full text |Cite
|
Sign up to set email alerts
|

Protease target prediction via matrix factorization

Abstract: Supplementary data are available at bioinformatics online.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 42 publications
0
7
0
Order By: Relevance
“…Both ensemble tree classifiers outperformed logistic regression and SVC for all caspases by comparing all three metrics. All evaluated metrics (accuracy, AUC PRC, AUC ROC, MCC, sensitivity and specificity) for all investigated classifiers comparing predictive performance of these models for all selected proteases using window P4-P4' are shown in S8 Table. In addition, predictive performance of RFC and GBC models was equal 21 for caspases-2 and 6, meanwhile RFC demonstrated higher accuracy for caspase-7 and GBC overperformed RFC model for caspase-1 that also can be related with the size of the training dataset. Lastly, we evaluated predictive performance of LR, SVC, GBC and RFC models on external dataset to understand if trained models were able to predict known cleavage sites with the top-ranking positions for the protease of interest.…”
Section: Resultsmentioning
confidence: 93%
See 1 more Smart Citation
“…Both ensemble tree classifiers outperformed logistic regression and SVC for all caspases by comparing all three metrics. All evaluated metrics (accuracy, AUC PRC, AUC ROC, MCC, sensitivity and specificity) for all investigated classifiers comparing predictive performance of these models for all selected proteases using window P4-P4' are shown in S8 Table. In addition, predictive performance of RFC and GBC models was equal 21 for caspases-2 and 6, meanwhile RFC demonstrated higher accuracy for caspase-7 and GBC overperformed RFC model for caspase-1 that also can be related with the size of the training dataset. Lastly, we evaluated predictive performance of LR, SVC, GBC and RFC models on external dataset to understand if trained models were able to predict known cleavage sites with the top-ranking positions for the protease of interest.…”
Section: Resultsmentioning
confidence: 93%
“…Proteolytic enzymes play critical role in many processes including cell proliferation, immune response, cell death and others [20]. Protease specificity was studied in different studies [21,36]. Protease cleavage of peptides is directed by short amino acid motifs, from two to eight amino acids around the scissile bond (site of cleavage, SoC) [17].…”
Section: Introductionmentioning
confidence: 99%
“…Even if the source is diligently measured on the cell-type level, more studied cell types might weight more than less studied ones in the gene network definition. Finally, even the knowledge of single gene and protein interactions is unbalanced toward “superstar” genes or proteins, which are more studied than others Lam et al (2016); Marini et al (2018). Users should select the gene label annotation source carefully and according to the problem they want to analyze.…”
Section: Resultsmentioning
confidence: 99%
“…Accordingly, we explore a space transformation—with concomitant dimension reduction—of the k -mer spectrum that identifies a set of (orthogonal) multiple features, i.e., metafeatures, each as an independent combination of the original k -mers contributing to a cumulative portion of the data variance. To do so, we apply a matrix factorization approach, which has been previously shown apt to tackle complex feature extraction problems, e.g., oncology and proteomics [ 36 , 37 ]. The method is based on non-negative matrix tri-factorization [ 38 ].…”
Section: Methodsmentioning
confidence: 99%