“…Obviously, releasing a perturbed (or even unperturbed) sample has lower disclosure risk than releasing the complete data set, because less information is released. However, as far as data utility is concerned, data-mining results based on a sample, even unperturbed, could be substantially different from those based on the complete set (Li and Jacob 2005). In terms of methodology, the approach proposed by Gouweleeuw et al (1998) works essentially on individual or blocks of attributes independently, therefore, "the precise effect on more complicated analyses, such as regression models, can be difficult to assess" (Fienberg and McIntyre 2004, p. 24).…”