2001
DOI: 10.1007/3-540-45650-3_37
|View full text |Cite
|
Sign up to set email alerts
|

A Practical Algorithm to Find the Best Episode Patterns

Abstract: Abstract. Episode pattern is a generalized concept of subsequence pattern where the length of substring containing the subsequence is bounded. Given two sets of strings, consider an optimization problem to find a best episode pattern that is common to one set but not common in the other set. The problem is known to be NP-hard. We give a practical algorithm to solve it exactly.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2002
2002
2006
2006

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 21 publications
(22 citation statements)
references
References 6 publications
0
21
0
Order By: Relevance
“…We declare that this work generalizes and outperforms the ones accomplished in [8,9], since it is capable of discovering more advanced and useful patterns. In fact, we show some experimental results that convince us of the accuracy of our algorithms as well as their fast performances.…”
Section: Introductionmentioning
confidence: 73%
See 1 more Smart Citation
“…We declare that this work generalizes and outperforms the ones accomplished in [8,9], since it is capable of discovering more advanced and useful patterns. In fact, we show some experimental results that convince us of the accuracy of our algorithms as well as their fast performances.…”
Section: Introductionmentioning
confidence: 73%
“…Against the fact that finding the best subsequence patterns to separate given two sets of strings is NP-hard, they proposed an algorithm to solve the problem with practically reasonable performance. More recently, an efficient algorithm to discover the best episode patterns was proposed in [9]. An episode pattern p, k , where p is a string and k is an integer, is said to match a string w if p is a subsequence of a substring u of w with |u| ≤ k [14,6,20].…”
Section: Introductionmentioning
confidence: 99%
“…The basic idea of the enumeration is similar to previous works [2][3][4][5][6]8]. The main contrivance of this paper is in the method for calculating the lower bound of the RSS score for specific subspaces of the pattern space.…”
Section: Branch and Bound Strategymentioning
confidence: 91%
“…Another situation is when we are given two sets of strings, where one set (positive set) consists of sequences known to possess some biological characteristic, while the other (negative set) consists of sequences known not to posses these characteristics. The problem is to find a discriminating pattern, that is, a pattern which occurs in most strings of the positive set, but does not occur in most of the strings of the negative set [2][3][4][5][6].…”
Section: Introductionmentioning
confidence: 99%
“…In particular, these unbounded episodes can be very useful in contexts such as the classification of documents or the intrusion detection systems. As argued in [5], a drawback of subsequence patterns is that they are not suitable for classifying long strings over small alphabet, since a short subsequence pattern matches with almost all long strings. So, the larger the episodes found in a text the better for the future predictions.…”
Section: Advantages Of Our Approachmentioning
confidence: 99%