2010
DOI: 10.1007/s00726-010-0547-x
|View full text |Cite
|
Sign up to set email alerts
|

Use of the Burrows–Wheeler similarity distribution to the comparison of the proteins

Abstract: In this paper, we present an approach based on Burrows-Wheeler transform to compare the protein sequences. The strings representing amino acid sequences do not reflect the chemical physical properties better, and it is very hard to extract any key features by reading these long character strings directly. The use of the Burrows-Wheeler similarity distribution needs a suitable representation which can reflect some interesting properties of the proteins. For the comparison of the primary protein sequences we con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 46 publications
0
12
0
Order By: Relevance
“…Yang et al [34,35] defined the following similarity measures based on the BWSD to compare S 1 and S 2 .…”
Section: Burrows-wheeler Similarity Distributionmentioning
confidence: 99%
See 2 more Smart Citations
“…Yang et al [34,35] defined the following similarity measures based on the BWSD to compare S 1 and S 2 .…”
Section: Burrows-wheeler Similarity Distributionmentioning
confidence: 99%
“…A class of similarity measures was defined by Mantaci et al [17] over an extension of the Burrows-Wheeler transform for string collections, called eBWT [16]. Later, Yang et al [34,35] recrafted the method by Mantaci et al and introduced the Burrows-Wheeler similarity distribution (BWSD) of two strings S 1 and S 2 based on the BWT of their concatenation. The authors evaluated similarity measures based on the expectation and Shannon entropy of the BWSD to efficiently construct phylogenetic trees for DNA and protein sequences, thus contributing to an alternative to alignment-based similarity measure among biological sequences.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Sequence comparison helps us to find both similarities and differences among biological sequences, and assign the function of those sequences (Mitrophanov and Borodovsky 2006;Altschul et al 1997;Dai et al 2008;Pham and Zuegg 2004;Pham 2007;Felsenstein 1996;Huelsenbeck and Ronquist 2001;Komatsu et al 2001;Kumar et al 2004;Li et al 2001;Otu and Sayood 2003;Ronquist and Huelsenbeck 2003;Waddell et al 2001;Mohseni-Zadeh et al 2004;Pipenbacher et al 2002;Yao et al 2008;Yang et al 2010). For example, to identify genes and functionally related regulatory sequences in newly sequenced genomes, one must discern functional similarity among candidate subsequences based on their similarity/dissimilarity.…”
Section: Introductionmentioning
confidence: 98%
“…The ebwt of a multiset S is a word obtained by a letters permutation of the words in S induced by the sorting of the conjugates of words in S according with an order relation (denoted by ≺ ω ) defined by using lexicographic order among infinite words. The ebwt is a reversible transformation, that has been used for circular words comparison [18,20] and for circular pattern matching [13]. It is also used as preprocessing of compressors on a single text, where the ebwt is applied to the words obtained by a factorization of the text.…”
Section: Introductionmentioning
confidence: 99%