1987
DOI: 10.1016/0306-4379(87)90014-7
|View full text |Cite
|
Sign up to set email alerts
|

Approximating the number of unique values of an attribute without sorting

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0

Year Published

1991
1991
2018
2018

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 33 publications
(31 citation statements)
references
References 1 publication
0
31
0
Order By: Relevance
“…Haas et al [65] base their estimation on data samples and describe and empirically compare various estimators from the literature. Other works do scan the entire data but use only a small amount of memory to hash the values and estimate the number of distinct values, an early example being [11].…”
Section: Cardinalitiesmentioning
confidence: 99%
“…Haas et al [65] base their estimation on data samples and describe and empirically compare various estimators from the literature. Other works do scan the entire data but use only a small amount of memory to hash the values and estimate the number of distinct values, an early example being [11].…”
Section: Cardinalitiesmentioning
confidence: 99%
“…Each leaf node (as well as each internal node) of the complex structure is uniquely identified by an XPath expression; the set of XPaths (i.e., nodes) for a given data source can be computed simply by submitting every XML document from the data source to a generic SAX parser. There is a data set associated with each leaf node, and for the ith leaf node 2 ,…,v n }, where v j is the data value at the ith leaf node in the jth XML document from the specified data source. Observe that ds i is a multiset, because the same data value can appear in multiple documents.…”
Section: Repository Dataflowmentioning
confidence: 99%
“…The second key component of the synopsis is a hash-counting data structure. The idea is that each incoming data element is fed into a "probabilistic counting" algorithm; see, e.g., [2]. Such algorithms estimate the number of distinct values in a data set in a single pass using a fixed amount of memory.…”
Section: Synopsis Creationmentioning
confidence: 99%
See 2 more Smart Citations