2016
DOI: 10.1039/c6mb00374e
|View full text |Cite
|
Sign up to set email alerts
|

Combining pseudo dinucleotide composition with the Z curve method to improve the accuracy of predicting DNA elements: a case study in recombination spots

Abstract: Pseudo dinucleotide composition (PseDNC) and Z curve showed excellent performance in the classification issues of nucleotide sequences in bioinformatics. Inspired by the principle of Z curve theory, we improved PseDNC to give the phase-specific PseDNC (psPseDNC). In this study, we used the prediction of recombination spots as a case to illustrate the capability of psPseDNC and also PseDNC fused with Z curve theory based on a novel machine learning method named large margin distribution machine (LDM). We verifi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
9
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
8
1

Relationship

2
7

Authors

Journals

citations
Cited by 20 publications
(9 citation statements)
references
References 37 publications
0
9
0
Order By: Relevance
“…The Z-curve has been widely used in the field of bioinformatics for tasks such as protein coding gene identification (Zhang and Wang 2000;Chen et al 2003;Guo et al 2003;Guo and Zhang 2006;Hua et al 2015), promoter recognition (Yang et al 2008), translation start recognition (Ou et al 2004), recombination spots recognition (Dong et al 2016), and nucleosome position mapping ). However, correlation and λ -interval nucleotide composition have not been incorporated into the Z-curve method.…”
Section: Discussionmentioning
confidence: 99%
“…The Z-curve has been widely used in the field of bioinformatics for tasks such as protein coding gene identification (Zhang and Wang 2000;Chen et al 2003;Guo et al 2003;Guo and Zhang 2006;Hua et al 2015), promoter recognition (Yang et al 2008), translation start recognition (Ou et al 2004), recombination spots recognition (Dong et al 2016), and nucleosome position mapping ). However, correlation and λ -interval nucleotide composition have not been incorporated into the Z-curve method.…”
Section: Discussionmentioning
confidence: 99%
“…The meiotic recombination does not take place randomly in a chromosome but occurs in some regions of a chromosome. In general, the region that exhibits a high frequency of recombination is considered as hotspots, whereas the region that exhibits low frequency of recombination is considered as coldspots (Liu et al, 2012 ; Dong et al, 2016 ). The study of recombination spots provides useful information about the basic functionality of inheritance and genome diversity.…”
Section: Introductionmentioning
confidence: 99%
“…A predictor called iRSpot-DACC was also presented to predict recombination hotspots and coldspots 20 . Recently, the same problem was further investigated by including the Z curve approach 21 , and the ensemble learning approach 22 .…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, most of those models are quite limited for practical applications. (ii) Some works 13 ; 14 ; 21 ; 24 used codon composition or coding region information to formulate DNA samples. However, recombination spots are not always located in coding regions.…”
Section: Introductionmentioning
confidence: 99%