2017
DOI: 10.1016/j.ygeno.2016.12.002
|View full text |Cite
|
Sign up to set email alerts
|

A new method to analyze protein sequence similarity using Dynamic Time Warping

Abstract: Sequences similarity analysis is one of the major topics in bioinformatics. It helps researchers to reveal evolution relationships of different species. In this paper, we outline a new method to analyze the similarity of proteins by Discrete Fourier Transform (DFT) and Dynamic Time Warping (DTW). The original symbol sequences are converted to numerical sequences according to their physico-chemical properties. We obtain the power spectra of sequences from DFT and extend the spectra to the same length to calcula… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
5
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
3

Relationship

0
10

Authors

Journals

citations
Cited by 30 publications
(6 citation statements)
references
References 38 publications
1
5
0
Order By: Relevance
“…Phylogenetic tree generated by our method (Fig. 5) is consistent with phylogenetic trees generated in the previous studies 42,50,51 and alignment based method ClustalW using MEGA package 25 (Fig. S5).…”
Section: Resultssupporting
confidence: 84%
“…Phylogenetic tree generated by our method (Fig. 5) is consistent with phylogenetic trees generated in the previous studies 42,50,51 and alignment based method ClustalW using MEGA package 25 (Fig. S5).…”
Section: Resultssupporting
confidence: 84%
“…The digital-signal-based representation encodes a single amino acid (AA) into a number so that a protein sequence is converted into a digital signal sequence, which is processed by digital signal analysis tools to extract the features of the protein sequence. For example, in a study performed by Hou et al (2017) , protein sequence was converted into numerical sequences with their physicochemical properties to achieve the power spectra by Discrete Fourier Transform (DFT). Furthermore, Dynamic Time Warping (DTW) was used to extend the spectra to the same length in order to calculate the distance between different sequences.…”
Section: Introductionmentioning
confidence: 99%
“…Their range of lengths is from 602 to 610. This sample set is applied before in [12][13][14][15][16][17][18][19][20][21][22][23][24][25]. Table 3 shows the 3 rd sample set which consists of 29 spike protein sequences.…”
Section: Dataset Technology and Toolsmentioning
confidence: 99%