1998
DOI: 10.1093/comjnl/41.6.402
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Hash-Based Algorithm for Sequence Data Searching

Abstract: In real life, data collected day by day often appear in sequences and this type of data is called sequence data. The technique of searching for similar patterns among sequence data is very important in many applications. We first point out that there are some deficiencies in the existing definitions of sequence similarity. We then introduce a definition of sequence similarity based on the shape of sequences. The definition is also extended to handle sequence matching with linear scaling in both amplitude and t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
9
0

Year Published

2002
2002
2005
2005

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 26 publications
1
9
0
Order By: Relevance
“…For example, in music retrieval has been reported [9]: "To achieve tempo invariance, the targets are stretched by 19 different scaling factors from 0.5 to 2.0." Similar remarks can be found in the literature of gait analysis [14], handwritten archive indexing [28], bioinformatics [1] and data mining [8].…”
Section: Motivating the Need For Uniform Scalingsupporting
confidence: 78%
“…For example, in music retrieval has been reported [9]: "To achieve tempo invariance, the targets are stretched by 19 different scaling factors from 0.5 to 2.0." Similar remarks can be found in the literature of gait analysis [14], handwritten archive indexing [28], bioinformatics [1] and data mining [8].…”
Section: Motivating the Need For Uniform Scalingsupporting
confidence: 78%
“…Although we are particularly interested in motion capture data, as we noted above, our algorithm may have utility in domains as diverse as music retrieval [7] and space telemetry [8]. We will therefore perform all experiments on the following two datasets.…”
Section: Resultsmentioning
confidence: 99%
“…However, all previous work has focused on speeding up similarity search, when the scaling factor is known [8][17] [26]. The feature that differentiates our work from all the rest is that we allow a user to issue a single query, and find the best match at any scaling.…”
Section: Related Workmentioning
confidence: 99%
“…There exists a handful of techniques that can support similarity search under uniform scaling if the scaling factor is known in advance [3,9]; however, in most domains it is unlikely that we know the scaling factor. In such instances we must resort to multiple queries, one for each possible scaling factor.…”
Section: Introductionmentioning
confidence: 99%