2019
DOI: 10.1101/730143
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Improving protein function prediction with synthetic feature samples created by generative adversarial networks

Abstract: Protein function prediction is a challenging but important task in bioinformatics. Many prediction methods have been developed, but are still limited by the bottleneck on training sample quantity. Therefore, it is valuable to develop a data augmentation method that can generate high-quality synthetic samples to further improve the accuracy of prediction methods. In this work, we propose a novel generative adversarial networks-based method, namely FFPred-GAN, to accurately learn the high-dimensional distributio… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 28 publications
0
2
0
Order By: Relevance
“…On these tasks, pretraining improved performance across multiple model architectures, though follow-up work suggests that similar results could be achieved without pretraining [50]. An additional line of work [33,61,62] benchmarks various protein sequence embeddings on the CAFA benchmark [64], though BERT and related architectures are not considered.…”
Section: Deep Learning For Proteinsmentioning
confidence: 99%
“…On these tasks, pretraining improved performance across multiple model architectures, though follow-up work suggests that similar results could be achieved without pretraining [50]. An additional line of work [33,61,62] benchmarks various protein sequence embeddings on the CAFA benchmark [64], though BERT and related architectures are not considered.…”
Section: Deep Learning For Proteinsmentioning
confidence: 99%
“…Works in [37,38] use GANs to analyze gene expression profiles and works in [39,40] attempt to synthesize genes and promoters by GANs. Recently authors in [41] have been proposed to perform data augmentation to generate synthetic training samples by a GAN to improve a classifier accuracy for annotating proteins.…”
Section: Introductionmentioning
confidence: 99%