Proceedings of the 17th ACM Conference on Information and Knowledge Management 2008
DOI: 10.1145/1458082.1458162
|View full text |Cite
|
Sign up to set email alerts
|

Clustered subset selection and its applications on it service metrics

Abstract: Motivated by the enormous amounts of data collected in a large IT service provider organization, this paper presents a method for quickly and automatically summarizing and extracting meaningful insights from the data. Termed Clustered Subset Selection (CSS), our method enables programguided data explorations of high-dimensional data matrices. CSS combines clustering and subset selection into a coherent and intuitive method for data analysis. In addition to a general framework, we introduce a family of CSS algo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
28
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(29 citation statements)
references
References 37 publications
1
28
0
Order By: Relevance
“…Boutsidis et al [6] use clustering in the setting of column subset selection problem. Column subset selection problems are important in large scale computations when the data matrix A is streaming in a way that it is impossible or impractical to store it entirely.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Boutsidis et al [6] use clustering in the setting of column subset selection problem. Column subset selection problems are important in large scale computations when the data matrix A is streaming in a way that it is impossible or impractical to store it entirely.…”
Section: Related Workmentioning
confidence: 99%
“…2 Ω, where p is a small oversampling parameter (typically set to [5][6][7][8][9][10]. Multiplying A with the random matrix Ω we obtain Y = AΩ.…”
Section: Introductionmentioning
confidence: 99%
“…5 Ω, where p is a small oversampling parameter (typically set to [5][6][7][8][9][10]. Multiplying A with the random matrix Ω, we obtain Y = AΩ.…”
Section: Preliminariesmentioning
confidence: 99%
“…Necessary and sufficient conditions are given in order to reconstruct the truncated SVD of the original data matrix A from the truncated SVDs of its block-column-wise partitioning. Boutsidis, Sun, and Anerousis [8] use clustering for the column subset selection problem when dealing with streaming matrix data. In a similar approach with data sampling, Zhang and Kwok [44] present a clustered Nyström method for large scale manifold learning applications, where the authors approximate the eigenfunctions of associated integral equation kernels.…”
Section: Principal Angles Assume We Have a Truncated Svd Approximatimentioning
confidence: 99%
See 1 more Smart Citation