2018
DOI: 10.1016/j.patrec.2018.07.003
|View full text |Cite
|
Sign up to set email alerts
|

Subspace clustering with the multivariate-t distribution

Abstract: Clustering procedures suitable for the analysis of very high-dimensional data are needed for many modern data sets. In model-based clustering, a method called high-dimensional data clustering (HDDC) uses a family of Gaussian mixture models for clustering. HDDC is based on the idea that high-dimensional data usually exists in lower-dimensional subspaces; as such, an intrinsic dimension for each sub-population of the observed data can be estimated and cluster analysis can be performed in this lower-dimensional s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…Clusters formed are mostly of spherical shape and include all samples hence maximal subspaces along with overlapping clusters and noisy points could not be found. An extended version of high dimensional data clustering using multivariate t-distribution is given by (Pesevski et al, 2018). The algorithm is evaluated on low dimension datasets i.e.…”
Section: Introductionmentioning
confidence: 99%
“…Clusters formed are mostly of spherical shape and include all samples hence maximal subspaces along with overlapping clusters and noisy points could not be found. An extended version of high dimensional data clustering using multivariate t-distribution is given by (Pesevski et al, 2018). The algorithm is evaluated on low dimension datasets i.e.…”
Section: Introductionmentioning
confidence: 99%
“…At present, certain research results have been obtained for data clustering methods in different fields. Literature [19] has analyzed a method named high-dimensional data clustering (HDDC), which uses a series of Gaussian mixture models for clustering. We estimate the intrinsic dimension of each subgroup of data observed and conducted clustering analysis in the low-dimensional subspace.…”
Section: Introductionmentioning
confidence: 99%
“…Early work on non-Gaussian mixtures was on mixtures of multivariate t-distributions (e.g., Peel and McLachlan, 2000). A little beyond the turn of the century, work on t-mixtures burgeoned into a substantial subfield of mixture model-based classification (e.g., McLachlan et al, 2007;McNicholas, 2011a,b, 2012;Baek and McLachlan, 2011;Steane et al, 2012;Lin et al, 2014;Pesevski et al, 2018). Around the same time, work on mixtures of skewed distributions took off, including work on skew-normal mixtures (e.g., Lin, 2009), skewt mixtures (e.g., Lin, 2010;McNicholas, 2012, 2014;Lee and McLachlan, 2013a,b;Murray et al, 2014), Laplace mixtures (e.g., Franczak et al, 2014), variance-gamma mixtures (McNicholas et al, 2017), generalized hyperbolic mixtures (Browne and McNicholas, 2015), and other non-elliptically contoured distributions (e.g., Karlis and Santourian, 2009;Murray et al, 2017;Tang et al, 2018).…”
Section: Introductionmentioning
confidence: 99%