2019
DOI: 10.1038/s41598-019-49430-4
|View full text |Cite
|
Sign up to set email alerts
|

csDMA: an improved bioinformatics tool for identifying DNA 6 mA modifications via Chou’s 5-step rule

Abstract: DNA N6-methyldeoxyadenosine (6 mA) modifications were first found more than 60 years ago but were thought to be only widespread in prokaryotes and unicellular eukaryotes. With the development of high-throughput sequencing technology, 6 mA modifications were found in different multicellular eukaryotes by using experimental methods. However, the experimental methods were time-consuming and costly, which makes it is very necessary to develop computational methods instead. In this study, a machine learning-based p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
23
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 30 publications
(23 citation statements)
references
References 56 publications
0
23
0
Order By: Relevance
“…We performed 5-fold cross-validation. In 5-fold cross validation, each fold was iteratively selected to test the model, and the remaining four folds were used to train the model [ 59 , 60 ]. For group normalization, we fix the number of groups to 8.…”
Section: Resultsmentioning
confidence: 99%
“…We performed 5-fold cross-validation. In 5-fold cross validation, each fold was iteratively selected to test the model, and the remaining four folds were used to train the model [ 59 , 60 ]. For group normalization, we fix the number of groups to 8.…”
Section: Resultsmentioning
confidence: 99%
“…To develop a useful bioinformatics predictor, a series of recent publications [43], [44]- [50] and review papers [51,52] demonstrate that one needs to follow Chou's 5-steps rule [51] that includes the following five steps: (1) construction of a gold standard dataset to train and test the model; (2) encoding samples with effective formulations; (3) conducting the prediction model with a powerful classifier; (4) evaluating model performance by using cross-validation tests and standard measures; (5) establishing a user-friendly tool for the predictor that can be accessible to the public. A new bioinformatics predictor presented according to the five-step rules [51] would have the following advantages: (i) clear logic deduction; (ii) better demonstration in stimulating other relevant tools; (iii) useful in practical application [56]. We followed this, five-step procedures to develop our new prediction method.…”
Section: Methodsmentioning
confidence: 99%
“…The prediction is based on encoding the genomic DNA samples using dinucleotide composition and optimized dinucleotide-based DNA properties [85]. (4) csDMA: Identification and prediction of DNA 6mA modification in different species via Chou's 5-step rule using three encoding features and different algorithms to generate the feature matrix [86]. (5) iDNA6mA-Rice: Evaluation ofm6A sites in the rice genome using the machine learning random forest algorithm to formulate the sample as an input to discriminate from the methylated and non-methylated sites [87].…”
Section: Bioinformatic Analysis Tools For 6mamentioning
confidence: 99%