2020
DOI: 10.7717/peerj-cs.278
|View full text |Cite
|
Sign up to set email alerts
|

Genome annotation across species using deep convolutional neural networks

Abstract: Application of deep neural network is a rapidly expanding field now reaching many disciplines including genomics. In particular, convolutional neural networks have been exploited for identifying the functional role of short genomic sequences. These approaches rely on gathering large sets of sequences with known functional role, extracting those sequences from whole-genome-annotations. These sets are then split into learning, test and validation sets in order to train the networks. While the obtained networks p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
9
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(10 citation statements)
references
References 34 publications
(44 reference statements)
1
9
0
Order By: Relevance
“…For non-model organisms, it is more difficult to find accurate training data and very few CNN methods have been designed specifically to predict SS in non-model organisms. Since SS and other regulatory motifs may be conserved across similar species [ 20 ], some work has been done to try to transfer models trained on model organisms to related organisms, for example between different vertebrate genomes [ 66 ]. Others have built cross-species models for specific clades, such as animals or plants using Helixer [ 67 ], but unfortunately the source code for this program is not yet stable (according to the authors).…”
Section: Discussionmentioning
confidence: 99%
“…For non-model organisms, it is more difficult to find accurate training data and very few CNN methods have been designed specifically to predict SS in non-model organisms. Since SS and other regulatory motifs may be conserved across similar species [ 20 ], some work has been done to try to transfer models trained on model organisms to related organisms, for example between different vertebrate genomes [ 66 ]. Others have built cross-species models for specific clades, such as animals or plants using Helixer [ 67 ], but unfortunately the source code for this program is not yet stable (according to the authors).…”
Section: Discussionmentioning
confidence: 99%
“…Deep convolutional neural networks were used to annotate gene-start sites in different species by training the model using the sites from one species as the positive sample and random sequences from the same species as the negative sample. The model was able to identify gene-start sites in other species [ 97 ].…”
Section: Integrating Artificial Intelligence In Metabolic Engineeringmentioning
confidence: 99%
“…This method aims to discover the biomarkers (OTU features) and assess their functions. A CNN model has also shown the feasibility of DL in functional annotation of genome sequences (Khodabandelou et al, 2020). The authors have recognized the short sequences with certain known functions, such as functions of the promoters in different species.…”
Section: Functional Analysis By Deep Learning Methodsmentioning
confidence: 99%