2019
DOI: 10.1016/j.compbiolchem.2018.12.014
|View full text |Cite
|
Sign up to set email alerts
|

Physicochemical property based computational scheme for classifying DNA sequence elements of Saccharomyces cerevisiae

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…For binary classification, we used the same coding schemes as implemented in Orozco-Arias et al (2020) , such as DAX ( Yu et al, 2015 ), EIIP ( Nair & Sreenadhan, 2006 ), Complementary ( Akhtar, Epps & Ambikairajah, 2008 ), Enthalpy ( Kauer & Blöcker, 2003 ), and Galois (4) ( Rosen, 2006 ). Additionally, two techniques were applied to automatically extract features from the sequences: (i) k -mer frequencies were obtained for each element and (ii) three physical-chemical (PC) properties were extracted, such as average hydrogen bonding energy per base pair (bp), stacking energy (per bp), and solvation energy (per bp), which were calculated by taking the first di-nucleotide and moving in a sliding window of one base at a time ( Jaiswal & Krishnamachari, 2019 ). Moreover, we pre-processed the data by scaling, following the strategy implemented in Tabares-soto et al (2020) , and performed a dimensional reduction through a principal component analysis (PCA) ( Wold, Esbensen & Geladi, 1987 ) with a cumulative variance of 96% and tolerance of 1e−4.…”
Section: Methodsmentioning
confidence: 99%
“…For binary classification, we used the same coding schemes as implemented in Orozco-Arias et al (2020) , such as DAX ( Yu et al, 2015 ), EIIP ( Nair & Sreenadhan, 2006 ), Complementary ( Akhtar, Epps & Ambikairajah, 2008 ), Enthalpy ( Kauer & Blöcker, 2003 ), and Galois (4) ( Rosen, 2006 ). Additionally, two techniques were applied to automatically extract features from the sequences: (i) k -mer frequencies were obtained for each element and (ii) three physical-chemical (PC) properties were extracted, such as average hydrogen bonding energy per base pair (bp), stacking energy (per bp), and solvation energy (per bp), which were calculated by taking the first di-nucleotide and moving in a sliding window of one base at a time ( Jaiswal & Krishnamachari, 2019 ). Moreover, we pre-processed the data by scaling, following the strategy implemented in Tabares-soto et al (2020) , and performed a dimensional reduction through a principal component analysis (PCA) ( Wold, Esbensen & Geladi, 1987 ) with a cumulative variance of 96% and tolerance of 1e−4.…”
Section: Methodsmentioning
confidence: 99%
“…Codebook References Recently, an innovative way to convert sequences into numerical representations was proposed by Jaiswal & Krishnamachari (2019). The authors considered three physicochemical properties, namely, average hydrogen bonding energy per base pair (bp), stacking energy (per bp), and solvation energy (per bp), which are calculated by taking the first di-nucleotide and then moving a sliding window, one base at a time.…”
Section: Encoding Schemesmentioning
confidence: 99%
“…Recently, an innovative way to convert sequences into numerical representations was proposed by (Jaiswal & Krishnamachari, 2019). The authors considered three physicochemical properties, namely, average hydrogen bonding energy per base pair (bp), stacking energy (per bp), and solvation energy (per bp), which are calculated by taking the first di-nucleotide and then moving a sliding window, one base at a time.…”
Section: Most Used Dna Coding Schemes (Q4)mentioning
confidence: 99%
“…PRISMA flow diagram Publication identifier Year Q1 Q2 Q3 Q4 Reference Publication identifier Year Q1 Q2 Q3 Q4 Reference P1 X X X X (Yu, Yu & Pan, 2017) P19 2013 X X X (Loureiro et al, 2013b) P2 X X X (Schietgat et al, 2018) P20 2014 X X (Ma, Zhang & Wang, 2014) P3 X X X (Arango-López et al, 2017) P21 2010 X X (Dashti & Masoudi-Nejad, 2010) P4 X X X (Loureiro et al, 2013a) P22 2010 X X (Ding, Zhou & Guan, 2010) P5 X X X (Tsafnat et al, 2011) P23 2019 X (Jaiswal & Krishnamachari, 2019) P6 X X X (Zhang et al, 2018) P24 2015 X X X X (Girgis, 2015) P7 X X X (Eraslan et al, 2019) P25 2018 X X X (Nakano et al, 2018a) P8 X X X (Douville et al, 2018) P26 2018 X X X (Zamith Santos et al, 2018) P9 X X (Chen et al, 2018) P27 2009 X (Abrusan et al, 2009) P10 X X X X (Ashlock & Datta, 2012) P28 2019 X X (Su, Gu & Peterson, 2019) P11 X X X (Smith et al, 2017) P29 2017 X X X X (Nakano et al, 2017) P12 X X X X (Kamath, De Jong & Shehu, 2014) P30 2014 X X X (Brayet et al, 2014) P13 X X X (Kim et al, 2016) P31 2013 X (Zamani et al, 2013) P14 X X X (Segal et al, 2018) P32 2019 X (Hubbard et al, 2019) P15 X X X (Rawal & Ramaswamy, 2011) P33 2014 X X (Ryvkin et al, 2014) P16 X X X (Tang et al, 2017) P34 2013 X X X X (Zhang et al, 2013) P17 X X X (Ventola et al, 2017) P35 2019 X X…”
Section: Figurementioning
confidence: 99%