2021
DOI: 10.1093/bib/bbab351
|View full text |Cite
|
Sign up to set email alerts
|

Leveraging the attention mechanism to improve the identification of DNA N6-methyladenine sites

Abstract: DNA N6-methyladenine is an important type of DNA modification that plays important roles in multiple biological processes. Despite the recent progress in developing DNA 6mA site prediction methods, several challenges remain to be addressed. For example, although the hand-crafted features are interpretable, they contain redundant information that may bias the model training and have a negative impact on the trained model. Furthermore, although deep learning (DL)-based models can perform feature extraction and c… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(28 citation statements)
references
References 45 publications
0
26
0
Order By: Relevance
“…The Rice genome containing both 6mA-rice-Chen and 6mA-rice-Lv data sets, which are widely used to examine the performance of the previous methods ,,, over cross-validation tests, are also employed to evaluate our method by performing cross-validation tests. For more detailed information on the data set construction, please refer to refs , and . Table shows the details of the statistical composition of these data sets.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The Rice genome containing both 6mA-rice-Chen and 6mA-rice-Lv data sets, which are widely used to examine the performance of the previous methods ,,, over cross-validation tests, are also employed to evaluate our method by performing cross-validation tests. For more detailed information on the data set construction, please refer to refs , and . Table shows the details of the statistical composition of these data sets.…”
Section: Methodsmentioning
confidence: 99%
“…To overcome the drawback of ML-based methods, a few DL techniques that utilize multi-layer artificial neural networks to learn tasks have been successfully applied to solve computational biology problems, including 6mA site prediction. A few of these have been named below: SpineNet-6mA, GC6mA-Pred, iRicem6A-CNN, Deep6mA, i6mA-VC, SNNRice6mA, DeepM6A, iDNA6mA (five-step rule), i6mA-DNC, LA6mA, AL6mA, 6mA-RicePred, and Deep6mAPred . These DL-based methods often utilize only one-hot encoding of the DNA sequence to recognize 6mA sites with one or more DL algorithms, such as convolutional neural networks and fully connected hidden layers .…”
Section: Introductionmentioning
confidence: 99%
“…Compared with traditional laboratory experiments, bioinformatics tools have significant advantages in terms of price and time cost (Figure 1D). At present, there are many deep learning models used for predicting 6 mA, such as DNA6mA-MINT (Rehman and Chong, 2020), i6mA-stack (Khanal et al, 2021), SNNRice6mA (Yu and Dai, 2019), SMEP (Wang et al, 2021), Deep6mA (Li et al, 2021b), LA6mA, AL6mA (Zhang et al, 2021), GC6mA-Pred (Cai et al, 2022), Meta-i6mA (Hasan et al, 2021), and BERT6mA (Tsukiyama et al, 2022). Based on neural networks, Yu and Dai.…”
Section: Deep Learning Predictive Modelmentioning
confidence: 99%
“…For example, Deep6mA presents an accuracy of more than 90% in predicting plants such as Arabidopsis (Li et al, 2021b). LA6mA and AL6mA capture location information from DNA sequences through a self-attention mechanism (Zhang et al, 2021). GC6mA-Pred mainly identifies 6 mA sites in the rice genome and outperforms several prediction models, including DNA6MA-MINT, on independent datasets (Cai et al, 2022).…”
Section: Deep Learning Predictive Modelmentioning
confidence: 99%
“…DNA sequence analysis is similar to natural language processing, in which recurrent neural networks (RNNs) can be used to process the sequential data. As a popular and powerful RNN architecture, LSTM has been widely used to solve the problem of biological sequence analysis and has achieved excellent performance [43][44][45]. BiLSTM consists of two reversed unidirectional LSTM networks, which is a special type of RNN.…”
Section: Bidirectional Long Short-term Memory Networkmentioning
confidence: 99%