2013 IEEE 13th International Conference on Data Mining 2013
DOI: 10.1109/icdm.2013.124
|View full text |Cite
|
Sign up to set email alerts
|

Mining Statistically Significant Sequential Patterns

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
34
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 35 publications
(34 citation statements)
references
References 28 publications
0
34
0
Order By: Relevance
“…There are a number of directions for further research. Among these, we find particularly interesting and challenging the extension of our method to other definitions of statistical significance for patterns and to other definitions of patterns such as sequential patterns [46]. Also interesting is the derivation of better lower bounds to the VC-dimension of the range set of a collection of itemsets.…”
Section: Discussionmentioning
confidence: 99%
“…There are a number of directions for further research. Among these, we find particularly interesting and challenging the extension of our method to other definitions of statistical significance for patterns and to other definitions of patterns such as sequential patterns [46]. Also interesting is the derivation of better lower bounds to the VC-dimension of the range set of a collection of itemsets.…”
Section: Discussionmentioning
confidence: 99%
“…[15] solves the problem of mining statistically significant substrings in a string generated from a memoryless Bernoulli distribution and uses the chi-square statistic as a quantitative measure of statistical significance. The statistical significance is considered for the sequential pattern mining problem as well in [16]. The approach developed by the authors is able to efficiently mine unexpected patterns in sequence of itemsets without considering overlapping occurrences or conditioning the length of the sequence.…”
Section: Related Workmentioning
confidence: 99%
“…Instead of using windows of fixed length, ranking based on minimal window lengths with respect to the independence model was suggested by Tatti [11]. Ranking serial episodes allowing multiple labels using the independence model was suggested by Low-Kam et al [7]. Achar et al [1] also considered a measure that downranks the episode if there is a non-edge (x, y) that occurs rarely, which suggests that we should augment the episode with the edge (y, x).…”
Section: Related Workmentioning
confidence: 99%