2014
DOI: 10.29012/jpc.v6i1.637
|View full text |Cite
|
Sign up to set email alerts
|

Noise Multiplication for Statistical Disclosure Control of Extreme Values in Log-normal Regression Samples

Abstract: In this article multiplication of original data values by random noise is suggested as a disclosure control strategy when only the top part of the data is sensitive, as is often the case with income data. The proposed method can serve as an alternative to top coding which is a standard method in this context. Because the log-normal distribution usually fits income data well, the present investigation focuses exclusively on the log-normal. It is assumed that the log-scale mean of the sensitive variable is descr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(13 citation statements)
references
References 23 publications
(42 reference statements)
0
13
0
Order By: Relevance
“…The above criterion was also used by Klein et al (2014) to measure the level of privacy protection in the context of noise multiplication to protect extreme values, and it is similar to a criterion used by Lin and Wise (2012). Notice that if the probability (23) is small, then we would conclude that there is a high level of protection against disclosure; and if this probability is large, then we would conclude that there is a low level of protection against disclosure.…”
Section: Disclosure Risk Evaluation Of Singly Versus Multiply Imputedmentioning
confidence: 99%
See 2 more Smart Citations
“…The above criterion was also used by Klein et al (2014) to measure the level of privacy protection in the context of noise multiplication to protect extreme values, and it is similar to a criterion used by Lin and Wise (2012). Notice that if the probability (23) is small, then we would conclude that there is a high level of protection against disclosure; and if this probability is large, then we would conclude that there is a low level of protection against disclosure.…”
Section: Disclosure Risk Evaluation Of Singly Versus Multiply Imputedmentioning
confidence: 99%
“…In Subsection 6.2 we compare the disclosure risk of singly imputed partially synthetic data with that of multiply imputed partially synthetic data in the context of this CPS data example. These CPS data were previously used by Drechsler and Reiter (2010) and Reiter (2005a;c) for illustrating aspects of synthetic data methodology, and by Klein et al (2014) for illustrating methodology of noise multiplication for statistical disclosure control. While the entire data file contains household, family, and individual records, we focus only on the household records, as did Drechsler and Reiter (2010), Reiter (2005a;c), and Klein et al (2014).…”
Section: Empirical Evaluations Using Current Population Survey Datamentioning
confidence: 99%
See 1 more Smart Citation
“…It is not necessary for the realization ofŶ 1 to be exactly equal to y 1 -the degree of accuracy required for disclosure must be determined by the statistical agency. The following definition of disclosure risk we adopt is similar to that used by Lin and Wise (2012) and Klein et al(2014).…”
Section: Defining Disclosurementioning
confidence: 99%
“…Typically, some industries will be dominated by large businesses whose information is difficult to conceal by existing data masking methods. Non-perturbative data masking, such as top coding (Klein et al 2014), suppression (Salazar-González 2005) and micro-aggregation (Defays and Nanopoulos 1993), significantly reduce information of continuous data items such as turnover or profit, which are of key interest to data users. Perturbation methods, such as data swapping (Moore 1996), synthetic data (Rubin 1993) and noise addition (Kim and Winkler 1995), cannot efficiently protect businesses with distinct continuous-valued characteristics.…”
Section: Introductionmentioning
confidence: 99%