2018
DOI: 10.1371/journal.pone.0191900
|View full text |Cite
|
Sign up to set email alerts
|

Improving succinylation prediction accuracy by incorporating the secondary structure via helix, strand and coil, and evolutionary information from profile bigrams

Abstract: Post-translational modification refers to the biological mechanism involved in the enzymatic modification of proteins after being translated in the ribosome. This mechanism comprises a wide range of structural modifications, which bring dramatic variations to the biological function of proteins. One of the recently discovered modifications is succinylation. Although succinylation can be detected through mass spectrometry, its current experimental detection turns out to be a timely process unable to meet the ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
56
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2

Relationship

4
4

Authors

Journals

citations
Cited by 54 publications
(56 citation statements)
references
References 86 publications
(107 reference statements)
0
56
0
Order By: Relevance
“…These are helix (ph), strand (pe) and coil (pc) motifs. Information from the secondary structure can contribute constructively to the general three-dimensional configuration of the polypeptide and the affinity for PTM of lysine residues [54,57]. Given a protein sequence, SPIDER2 produces a L × 3 matrix containing the predicted secondary structure, which we call SSpre.…”
Section: Feature Extractionmentioning
confidence: 99%
See 1 more Smart Citation
“…These are helix (ph), strand (pe) and coil (pc) motifs. Information from the secondary structure can contribute constructively to the general three-dimensional configuration of the polypeptide and the affinity for PTM of lysine residues [54,57]. Given a protein sequence, SPIDER2 produces a L × 3 matrix containing the predicted secondary structure, which we call SSpre.…”
Section: Feature Extractionmentioning
confidence: 99%
“…Protein sequences are of varying lengths and cannot be used directly in classification. Classifiers require dataset of fixed length [61] therefore we employed a widely used method of truncating the protein sequence into fixed length peptide segments [54,57,[62][63][64][65][66] proposed by Chou [67,68].…”
Section: Feature Vector Constructionmentioning
confidence: 99%
“…For this reason, dealing with class imbalance is a very crucial action in classification problems. To carry out the imbalance treatment, we utilized the commonly used scheme called the k-nearest neighbor strategy [26,28,32,55,63] where we removed a negative instance when one of its k neighbors was a positive instance. We started out the process by finding the initial value of k by dividing the number of samples in the negative set with the number of samples in the positive set.…”
Section: Data Imbalance Treatmentmentioning
confidence: 99%
“…The feature vector, therefore, consisted of 3 upstream and 3 downstream and 20 upstream and 20 downstream amino acid residues for the two different characteristics corresponding to phosphoglycerylated and non-phosphoglycerylated sites. In the benchmark dataset, there existed a high class imbalance between non-phosphoglycerylated and phosphoglycerylated lysine residues hence we adopted the k-nearest neighbors strategy to carry out the cleaning action [26,32,33]. EvolStruct-Phogly showed a substantial improvement in the detection of phosphoglycerylated and non-phosphoglycerylated residues when compared with the existing predictors [12,29] with sensitivity, specificity, precision, accuracy, and Mathews correlation coefficient equal to 0.7744, 0.8533, 0.7368, 0.8275 and 0.6242, respectively.…”
Section: Introductionmentioning
confidence: 99%
“…Lysine residue (sumoylated or non-sumoylated) is described by a segment of 31 amino acids (15 upstream and 15 downstream) as done by previous studies [45][46][47][48][49][50][51].…”
Section: Lysine Residue Descriptionmentioning
confidence: 99%