“…First of all, the quantitative parameters of the generated data and cluster structure are specified: the number of entities N, the number of generated clusters K*, and the number of variables M. In most publications, these are kept relatively small: N ranges from about 50 to 200, M is in many cases 2 and, anyway, not greater than 10, and K* is of the order of 3, 4 or 5 (see, for example, Casillas et al 2003, Chae et al 2006, Hand and Krzhanowski 2005, Hardy 1996, Kuncheva and Petrov 2005, McLachlan and Khan 2004, Milligan and Cooper 1985. Larger sizes appear in Feng and Hamerly (2006) (N= 4000, M is up to 16 and K*=20) and Steinley and Brusco (2007) (N is up to 5000, M=25, 50 and 125, and K* =5, 10, 20). Our choice of these parameters is based on the idea that the data should imitate the conditions of real-world data analysis, under the timing constraints of the computational capacity.…”