2019
DOI: 10.3389/fgene.2019.00793
|View full text |Cite
|
Sign up to set email alerts
|

iDNA6mA-Rice: A Computational Tool for Detecting N6-Methyladenine Sites in Rice

Abstract: DNA N6-methyladenine (6mA) is a dominant DNA modification form and involved in many biological functions. The accurate genome-wide identification of 6mA sites may increase understanding of its biological functions. Experimental methods for 6mA detection in eukaryotes genome are laborious and expensive. Therefore, it is necessary to develop computational methods to identify 6mA sites on a genomic scale, especially for plant genomes. Based on this consideration, the study aims to develop a machine learning-based… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
62
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 61 publications
(62 citation statements)
references
References 80 publications
(101 reference statements)
0
62
0
Order By: Relevance
“…This modification plays important roles in DNA mismatch repair, chromosome replication, cell defense, cell cycle regulation, and transcription . 6mA shows similar properties in eukaryotes and prokaryotes (Hao et al, 2019).…”
Section: Introductionmentioning
confidence: 84%
See 2 more Smart Citations
“…This modification plays important roles in DNA mismatch repair, chromosome replication, cell defense, cell cycle regulation, and transcription . 6mA shows similar properties in eukaryotes and prokaryotes (Hao et al, 2019).…”
Section: Introductionmentioning
confidence: 84%
“…This algorithm can be understood as a unique representation of nucleotides and can be considered a one hot encoding algorithm. A random DNA sequence with m nucleotides can then be converted into a vector of 4 × m features (Hao et al, 2019;Chen et al, 2019c). The representation of nucleotides is not unique, and the representations of A, T, G, and C are interchangeable.…”
Section: Binary Encodingmentioning
confidence: 99%
See 1 more Smart Citation
“…A benchmark dataset is important to build a reliable prediction model. In this study, for convenience, we use the 6mA-rice-Lv dataset (17,18), including 154,000 positive samples and 154,000 negative samples, to evaluate the proposed method and to compare it with other methods. For each positive sample obtained from GEO, the sequence is 41nt long with the 6mA site locating at the center.…”
Section: Benchmark Datasetmentioning
confidence: 99%
“…Zhou et al (2018) found 265,290 of rice 6mA data through a variety of experimental methods, such as HPLC-MS/MS, 6mA immunoprecipitation sequencing and Single Molecule Real-Time (SMRT), which enables us to train complex models for 6mA identification. For example, Lv et al (2019) provided a new random forest model named iDNA6mA-rice based on the reconstructed 154,000 6mA data and 154,000 non-6ma data. iDNA6mA-rice is mainly realized by the random forest algorithm module (RF) based on three feature extraction techniques: K-tuple nucleotide frequency component, mono-nucleotide binary encoding and natural vector.…”
Section: Introductionmentioning
confidence: 99%