2021
DOI: 10.1186/s12859-021-04101-y
|View full text |Cite
|
Sign up to set email alerts
|

Prediction and analysis of multiple protein lysine modified sites based on conditional wasserstein generative adversarial networks

Abstract: Background Protein post-translational modification (PTM) is a key issue to investigate the mechanism of protein’s function. With the rapid development of proteomics technology, a large amount of protein sequence data has been generated, which highlights the importance of the in-depth study and analysis of PTMs in proteins. Method We proposed a new multi-classification machine learning pipeline MultiLyGAN to identity seven types of lysine modified s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(6 citation statements)
references
References 58 publications
0
6
0
Order By: Relevance
“…Within the field of deep learning, GANs have been shown to be very useful in the development of bioinformatic predictive models [ 44 ]. For example, this type of neural network has been used as a data augmentation technique in the study of protein post-translational modifications [ 45 ], antiviral peptides [ 28 ] and protein solubility [ 46 ], among other cases reported in an excellent review by Wan et al . [ 44 ], showing excellent results.…”
Section: Discussionmentioning
confidence: 99%
“…Within the field of deep learning, GANs have been shown to be very useful in the development of bioinformatic predictive models [ 44 ]. For example, this type of neural network has been used as a data augmentation technique in the study of protein post-translational modifications [ 45 ], antiviral peptides [ 28 ] and protein solubility [ 46 ], among other cases reported in an excellent review by Wan et al . [ 44 ], showing excellent results.…”
Section: Discussionmentioning
confidence: 99%
“…However, wet experiment methods are both cost- and time-consuming. Alternatively, several in silico methods ( Johansen et al, 2006 ; Liu et al, 2015 ; Ju et al, 2017 ; Xu H. et al, 2017 ; Zhao et al, 2017 ; Islam et al, 2018 ; Chen K. et al, 2019 ; Yu et al, 2019 ; Khanum et al, 2020 ; Yang et al, 2021 ; Yao et al, 2021 ) have been developed to predict the Kgly sites efficiently. In a pioneer work, Johansen et al proposed a predictor, GlyNN, built by neural networks based on a dataset with 89 Kgly sites and 126 non-Kgly sites of 20 proteins ( Johansen et al, 2006 ).…”
Section: Introductionmentioning
confidence: 99%
“…Leveraging Xu et al’s dataset as training dataset, Yao et al developed a model, ABC-Gly, by selecting the optimal feature subset with a two-step feature selection method by combining the Fisher score and an improved binary artificial bee colony algorithm ( Yao et al, 2021 ). All the previous methods were built on the dataset with less than 500 Kgly sites; however, four other methods, PredGly ( Yu et al, 2019 ), Gly-LysPred ( Khanum et al, 2020 ), MUscADEL ( Chen Z et al, 2019 ), and MultiLyGAN ( Yang et al, 2021 ), which were built on datasets with more than 1,000 Kgly sites. For building PredGly, Yu et al (2019) collected Kgly sites from PLMD ( Xu H. et al, 2017 ) and used CD-HIT ( Huang et al, 2010 ) to remove the redundancy for protein sequences and peptide segments, with a cutoff of 30%.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations