2020
DOI: 10.1016/j.gpb.2019.11.010
|View full text |Cite
|
Sign up to set email alerts
|

HybridSucc: A Hybrid-Learning Architecture for General and Species-Specific Succinylation Site Prediction

Abstract: As an important protein acylation modification, lysine succinylation (Ksucc) is involved in diverse biological processes, and participates in human tumorigenesis. Here, we collected 26,243 non-redundant known Ksucc sites from 13 species as the benchmark data set, combined 10 types of informative features, and implemented a hybrid-learning architecture by integrating deep-learning and conventional machine-learning algorithms into a single framework. We constructed a new tool named HybridSucc, which achieved are… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
28
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

2
6

Authors

Journals

citations
Cited by 31 publications
(28 citation statements)
references
References 49 publications
0
28
0
Order By: Relevance
“…Recent studies have identified global lysine succinylation sites at the proteomic level in microorganisms, animals, humans and plants [ 15 18 ], demonstrating that succinylation is ubiquitous in diverse organisms. Subsequent studies verified histone lysine succinylation in prokaryotes [ 19 ] and eukaryotic cells [ 20 ], and more comprehensive lysine succinylome studies in humans, yeast, mice and bacteria have confirmed that Ksuc is evolutionarily conserved and ubiquitous [ 21 , 22 ]. Hundreds of succinylation sites and proteins have been identified in a variety of microorganisms.…”
Section: Introductionmentioning
confidence: 99%
“…Recent studies have identified global lysine succinylation sites at the proteomic level in microorganisms, animals, humans and plants [ 15 18 ], demonstrating that succinylation is ubiquitous in diverse organisms. Subsequent studies verified histone lysine succinylation in prokaryotes [ 19 ] and eukaryotic cells [ 20 ], and more comprehensive lysine succinylome studies in humans, yeast, mice and bacteria have confirmed that Ksuc is evolutionarily conserved and ubiquitous [ 21 , 22 ]. Hundreds of succinylation sites and proteins have been identified in a variety of microorganisms.…”
Section: Introductionmentioning
confidence: 99%
“…For each candidate combination, we randomly generated a training data set and a testing data set with a ratio of approximately 4:1. The testing data set was only used to test the performance but not for training, and the final total AUC value was calculated as below: The least absolute shrinkage and selection operator (LASSO, L1 regularization) penalty and the ridge regression (L2 regularization) penalty in PLR 25-27 , were iteratively used to optimize the weight values of the 5 proteins or metabolites. To simplify the composition of a combination, one or multiple protein or metabolite was randomly dropped if the total AUC value of the 5-fold cross-validation was increased.…”
Section: Methodsmentioning
confidence: 99%
“…The accuracy of a model was evaluated by calculating the total area under curve (AUC) value, 7 / 33 and we also computed the total root mean squared error (RMSE) to measure the prediction bias. In the step of FCP, a widely used machine learning algorithm, penalized logistic regression (PLR) [25][26][27] , was used for model training and parameter optimization (Fig. 3A).…”
Section: Machine Learning-based Inference Of Cc-specific Biomarker Comentioning
confidence: 99%
See 1 more Smart Citation
“…Huang et al developed a computational predictor, named CNN-SuccSite, which has been developed based on deep learning architectures with different encoding schemes [ 16 ]. Recently, Ning et al developed HybridSucc using Group-based Prediction System (GPS) via diverse encoding systems including k-space amino acid pair composition (CKSAAP), amino acid index (AAindex) physicochemical properties and pseudo amino acid composition (PseAAC) [ 17 ]. More recently, Hasan et al also suggested a predictor termed GPSuc, by combining five sequence encoding schemes i.e.…”
Section: Introductionmentioning
confidence: 99%