“…In the multivariate case, the most popular extensions have been to inflate the tails by using multivariate t ‐kernels (McLachlan and Peel, 1998; Lee and McLachlan, 2016) and/or to generalize the Gaussian distribution to allow skewness (Azzalini and Dalla Valle, 1996; Arellano‐Valle and Azzalini, 2009). Motivated by the problem of more robust clustering, methods are also available for non‐parametrically estimating the kernels subject to unimodality (Rodríguez and Walker, 2014) and log‐concavity (Hu et al ., 2016) restrictions. Also, to improve robustness of clustering, one can use a mixture of mixtures model employing multiple kernels having similar location parameters to characterize the data within a cluster (Malsiner‐Walli et al ., 2017).…”