2012
DOI: 10.1016/j.ejor.2011.03.050
|View full text |Cite
|
Sign up to set email alerts
|

Recent advances in optimization techniques for statistical tabular data protection

Abstract: One of the main services of National Statistical Agencies (NSAs) for the current Information Society is the dissemination of large amounts of tabular data, which is obtained from microdata by crossing one or more categorical variables. NSAs must guarantee that no confidential individual information can be obtained from the released tabular data. Several statistical disclosure control methods are available for this purpose. These methods result in large linear, mixed integer linear, or quadratic mixed integer l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
37
0

Year Published

2012
2012
2020
2020

Publication Types

Select...
4
2
1

Relationship

3
4

Authors

Journals

citations
Cited by 25 publications
(38 citation statements)
references
References 33 publications
1
37
0
Order By: Relevance
“…since one seeks for the released values x that are closest (in the given norm) to the true values a, compatible with the relationships that a is known to have to satisfy, and protected according to (6). Of course, the disjunctive constraints (6) are the difficult part of the problem, their feasible region being nonconvex.…”
Section: Formulations Of the Cta Problemmentioning
confidence: 99%
See 1 more Smart Citation
“…since one seeks for the released values x that are closest (in the given norm) to the true values a, compatible with the relationships that a is known to have to satisfy, and protected according to (6). Of course, the disjunctive constraints (6) are the difficult part of the problem, their feasible region being nonconvex.…”
Section: Formulations Of the Cta Problemmentioning
confidence: 99%
“…This justifies the interest in statistical disclosure control, i.e., the set of techniques that can be deployed to protect sensitive information. In particular, the focus of this work is on tabular data protection; seminal work on this field can be found in [2], and the current state-of-the-art is described in the recent surveys of [25] and [6], as well as in the monographs [27,22]. Although tabular data provide aggregated information, the publication of some cells may jeopardize individual information.…”
Section: Introductionmentioning
confidence: 99%
“…For each cell, the table may report either the number of individuals (frequency tables) or information about another variable (magnitude tables). More details can be found in the recent survey [5] and the monographs [26,27]. Although cell tables report aggregated information for several respondents-so they could be considered anonymized-there is a risk of disclosing individual data.…”
Section: Introductionmentioning
confidence: 99%
“…These tables are obtained by crossing a particular categorical variable with a set of, say, h categorical variables that have a hierarchical relation; this results in a set of h two-dimensional tables with some common cells. For instance, Figure 2 (from [5]) illustrates a particular 1H2D table. The left subtable shows number of respondents for "region"×"profession"; the middle subtable is a "zoom in" of region R 2 , providing the number of respondents in municipalities of this region; finally the right subtable details the ZIP codes of municipality R 21 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation