2013
DOI: 10.14778/2732219.2732220
|View full text |Cite
|
Sign up to set email alerts
|

Efficient and effective KNN sequence search with approximate n-grams

Abstract: In this paper, we address the problem of finding k-nearest neighbors (KNN) in sequence databases using the edit distance. Unlike most existing works using short and exact ngram matchings together with a filter-and-refine framework for KNN sequence search, our new approach allows us to use longer but approximate n-gram matchings as a basis of KN-N candidates pruning. Based on this new idea, we devise a pipeline framework over a two-level index for searching KNN in the sequence database. By coupling this framewo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 23 publications
(23 citation statements)
references
References 29 publications
0
23
0
Order By: Relevance
“…As we can see from Figure 1, query Q 1 to retrieve the tuples with conditions [1,2]), (B, [1,1]), (C, [2,3]…”
Section: A Match-count Modelmentioning
confidence: 99%
See 4 more Smart Citations
“…As we can see from Figure 1, query Q 1 to retrieve the tuples with conditions [1,2]), (B, [1,1]), (C, [2,3]…”
Section: A Match-count Modelmentioning
confidence: 99%
“…Finally the output of the match-count model is the sum of the integers M C(Q, O) = ri∈Q C(r i , O). For example, in Figure 1, for Q 1 and O 1 we have C((A, [1,2]), O 1 ) = 1, C((B, [1,1]), O 1 ) = 0 and C((C, [2,3]), O 1 ) = 0, then we have M C(Q 1 , O 1 ) = 1 + 0 + 0 = 1.…”
Section: A Match-count Modelmentioning
confidence: 99%
See 3 more Smart Citations