Seventh IEEE International Conference on Data Mining (ICDM 2007) 2007
DOI: 10.1109/icdm.2007.66
|View full text |Cite
|
Sign up to set email alerts
|

Mining Frequent Itemsets in a Stream

Abstract: Mining frequent itemsets in a datastream proves to be a difficult problem, as itemsets arrive in rapid succession and storing parts of the stream is typically impossible. Nonetheless, it has many useful applications; e.g., opinion and sentiment analysis from social networks. Current stream mining algorithms are based on approximations. In earlier work, mining frequent items in a stream under the max-frequency measure proved to be effective for items. In this paper, we extended our work from items to itemsets. … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
78
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 73 publications
(80 citation statements)
references
References 8 publications
0
78
0
Order By: Relevance
“…Further interestingness measures for episodes, either statistically motivated or aimed at removing bias towards smaller episodes, were made by Garriga [5], Gwadera et al [10,11], Calders et al [4], and Tatti [17]. All these methods, however, were limited to finding interesting episodes, and stopped short of discovering association rules between them.…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…Further interestingness measures for episodes, either statistically motivated or aimed at removing bias towards smaller episodes, were made by Garriga [5], Gwadera et al [10,11], Calders et al [4], and Tatti [17]. All these methods, however, were limited to finding interesting episodes, and stopped short of discovering association rules between them.…”
Section: Related Workmentioning
confidence: 99%
“…First of all, the confidence of rule G ⇒ H would depend on our choice of disjoint minimal windows -if we chose the first minimal window of H, s [1,5], we would find two occurrences of G outside it and the confidence of the rule would be 1 3 , whereas if we chose the second minimal window of H, s [4,10], we would find just one occurrence of G outside it, and the confidence would be 2 3 . More importantly, whichever choice we made, we would not be able to get the correct result, showing that every occurrence of G is contained within an occurrence of H. Now that we have seen that we cannot define the confidence of an association rule using either the disjoint-window frequencies, or the containment of the disjoint occurrences, of the two episodes, we are ready to present a definition that corresponds exactly to our intuition.…”
Section: Using Minimal Windowsmentioning
confidence: 99%
See 3 more Smart Citations