2022
DOI: 10.7717/peerj.13613
|View full text |Cite
|
Sign up to set email alerts
|

Genomics enters the deep learning era

Abstract: The tremendous amount of biological sequence data available, combined with the recent methodological breakthrough in deep learning in domains such as computer vision or natural language processing, is leading today to the transformation of bioinformatics through the emergence of deep genomics, the application of deep learning to genomic sequences. We review here the new applications that the use of deep learning enables in the field, focusing on three aspects: the functional annotation of genomes, the sequence… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
19
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 22 publications
(19 citation statements)
references
References 136 publications
0
19
0
Order By: Relevance
“…The different GC content values of Mmyco and Mpneumo prompted us to investigate if the GC content could explain the establishment of Y and U chromatin. To do so, we turned to convolution neural networks (CNN), a computational approach increasingly used in genomics to learn genome annotations from the underlying DNA sequence 25,26 ( Supplementary Figure 1d) . We trained CNNs to learn MNase-seq, and Scc1 or PolII ChIP-seq profiles using yeast chromosomes I to XV sequences (Methods) 27 .…”
Section: Resultsmentioning
confidence: 99%
“…The different GC content values of Mmyco and Mpneumo prompted us to investigate if the GC content could explain the establishment of Y and U chromatin. To do so, we turned to convolution neural networks (CNN), a computational approach increasingly used in genomics to learn genome annotations from the underlying DNA sequence 25,26 ( Supplementary Figure 1d) . We trained CNNs to learn MNase-seq, and Scc1 or PolII ChIP-seq profiles using yeast chromosomes I to XV sequences (Methods) 27 .…”
Section: Resultsmentioning
confidence: 99%
“…Number of layers [1,6] Number of units per layer [4,50] Dropout rate [0. the estimation of the optimal cut-off: The final DL architecture estimated for each LASSO penalty was applied to classify the test data set. For each individual, the output from the last layer, resulting from the sigmoid activation function, was expressed as a probability of being mastitis-susceptible.…”
Section: Sampled Hyperparameters Rangementioning
confidence: 99%
“…Due to the development of high-throughput technology, the past few decades have seen a considerable increase in the availability of genomic data [1,2]. Among them, the most common data structure is the whole genome sequence (WGS) that is nowadays available for thousands of individuals representing various species e.g.…”
Section: Introductionmentioning
confidence: 99%
“…In this scenario of rare and complex genetic disorders where a diagnosis is not reached or a prognosis is not accurate enough, more sophisticated methods should be applied to analyze large-scale genomic data. The use of artificial intelligence (AI) and, particularly, machine learning (ML) algorithms has raised great interest in recent years due to its potential to uncover complex patterns in genomic data 8 . These ML algorithms have shown the capacity to learn from and act on large, heterogeneous datasets to extract new biological insights, improving the accuracy of the diagnosis of RDs [9][10][11][12] .…”
Section: Introductionmentioning
confidence: 99%