2019
DOI: 10.3390/cells8121635
|View full text |Cite
|
Sign up to set email alerts
|

Improving the Quantification of DNA Sequences Using Evolutionary Information Based on Deep Learning

Abstract: It is known that over 98% of the human genome is non-coding, and 93% of disease associated variants are located in these regions. Therefore, understanding the function of these regions is important. However, this task is challenging as most of these regions are not well understood in terms of their functions. In this paper, we introduce a novel computational model based on deep neural networks, called DQDNN, for quantifying the function of non-coding DNA regions. This model combines convolution layers for capt… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
8
1

Relationship

4
5

Authors

Journals

citations
Cited by 23 publications
(18 citation statements)
references
References 21 publications
0
18
0
Order By: Relevance
“…The proposed CNN model was trained and tested k intervals. There are four metrics to evaluate the performance of the model such as Accuracy (ACC), Mathew’s correlation coefficient (MCC), Specificity (Sp), and Sensitivity (Sn) with the given mathematical formulation 49 51 . Where TN and TP represent as true negative and true positive having the correct number of identified sequences related to 4mC and non-4mC, respectively.…”
Section: Performance Evaluation Metricsmentioning
confidence: 99%
“…The proposed CNN model was trained and tested k intervals. There are four metrics to evaluate the performance of the model such as Accuracy (ACC), Mathew’s correlation coefficient (MCC), Specificity (Sp), and Sensitivity (Sn) with the given mathematical formulation 49 51 . Where TN and TP represent as true negative and true positive having the correct number of identified sequences related to 4mC and non-4mC, respectively.…”
Section: Performance Evaluation Metricsmentioning
confidence: 99%
“…Because the benchmark datasets are imbalanced, PRC is the best choice for studying the performance of the proposed model 12 . Moreover, the accuracy (ACC), specificity (Sp), sensitivity (Sn), and Matthews correlation coefficient (MCC) were utilized in various recent published studies to evaluate classifier quality in the field of bioinformatics [30][31][32][33][34][35][36][37] . Thus, we also use them to evaluate the performance of the proposed model.…”
Section: Eiip+pseeiipmentioning
confidence: 99%
“…The model was trained and tested k times, recorded performance each time, and concised by taking the mean score for the performance evaluation. The most common criteria which is used to evaluate the performance of the predicted models are four metrics; Mathew’s correlation coefficient (MCC), accuracy (ACC), sensitivity (Sn) and specificity (Sp) with the following mathematical formulations [ 60 , 61 , 62 ]. where TP and TN as true positive and true negative indicate the correct numbers of predicted samples for 4mCs and non-4mCs, respectively.…”
Section: Performance Evaluation Metricsmentioning
confidence: 99%