2021
DOI: 10.7717/peerj.12019
|View full text |Cite
|
Sign up to set email alerts
|

Protein function prediction with gene ontology: from traditional to deep learning models

Abstract: Protein function prediction is a crucial part of genome annotation. Prediction methods have recently witnessed rapid development, owing to the emergence of high-throughput sequencing technologies. Among the available databases for identifying protein function terms, Gene Ontology (GO) is an important resource that describes the functional properties of proteins. Researchers are employing various approaches to efficiently predict the GO terms. Meanwhile, deep learning, a fast-evolving discipline in data-driven … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
10
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 13 publications
(10 citation statements)
references
References 79 publications
0
10
0
Order By: Relevance
“…In brief, these methods can be classified into two categories, that is, amino acid residue sequence-based prediction and integrated data-based prediction. 18 The former extracts information only from protein sequences. For instance, DEEPred 19 acquired features from sequences and fed them into a stack of multitask feed-forward deep neural networks (DNNs).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…In brief, these methods can be classified into two categories, that is, amino acid residue sequence-based prediction and integrated data-based prediction. 18 The former extracts information only from protein sequences. For instance, DEEPred 19 acquired features from sequences and fed them into a stack of multitask feed-forward deep neural networks (DNNs).…”
Section: Introductionmentioning
confidence: 99%
“…A considerable number of methods have been proposed to predict protein function. In brief, these methods can be classified into two categories, that is, amino acid residue sequence-based prediction and integrated data-based prediction . The former extracts information only from protein sequences.…”
Section: Introductionmentioning
confidence: 99%
“…In particular, throughput technologies have greatly increased the number of protein sequences in public repositories, but only about 1% of the sequences in the UniProtKB database have been functionally characterized [ 10 ]. This gap motivates computational approaches [ 11 ], and the computational literature on protein function prediction is rich [ 12 ].…”
Section: Introductionmentioning
confidence: 99%
“…The latter was evidently highly advantageous. For example, in cis -regulatory element (CRE) prediction, the CNN, in the absence of a priori knowledge on the target location, outperforms conventional k-mer enrichment, expectation maximization and Gibbs sampling methods with a lower false positive rate [ 13 , 14 , 15 ].…”
Section: Introductionmentioning
confidence: 99%