2010
DOI: 10.1007/978-3-642-16292-3_26
|View full text |Cite
|
Sign up to set email alerts
|

Towards Semantic Microaggregation of Categorical Data for Confidential Documents

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2011
2011
2019
2019

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(16 citation statements)
references
References 8 publications
0
16
0
Order By: Relevance
“…However, the accurate centroid calculus for non-numerical data is challenging due to the lack of semantic aggregation operators and the necessity of considering a discrete set of centroid values. Related works propose methods to compute centroids for non-numerical data either relying on the distributional features of data, where the centroid is the modal value [23], or on background semantic, where the centroid is the term that generalises all aggregated values in a taxonomy [24]. Since only one dimension of data (distribution or semantics) is considered, both approaches result in suboptimal results [25].…”
Section: The Centroid Of Categorical Valuesmentioning
confidence: 98%
See 2 more Smart Citations
“…However, the accurate centroid calculus for non-numerical data is challenging due to the lack of semantic aggregation operators and the necessity of considering a discrete set of centroid values. Related works propose methods to compute centroids for non-numerical data either relying on the distributional features of data, where the centroid is the modal value [23], or on background semantic, where the centroid is the term that generalises all aggregated values in a taxonomy [24]. Since only one dimension of data (distribution or semantics) is considered, both approaches result in suboptimal results [25].…”
Section: The Centroid Of Categorical Valuesmentioning
confidence: 98%
“…Since arithmetic functions cannot be applied to this kind of data, a straightforwardway to apply MDAV to categorical data consists on using Boolean equality/inequality operators [18,23] or to use the common abstraction of a set of values in an ontology as the centroid [24].…”
Section: Semantic Microaggregationmentioning
confidence: 99%
See 1 more Smart Citation
“…In [1] authors use the WordNet structured thesaurus [37] as ontology to assist the classification and masking of confidential textual documents. WordNet models and semantically interlinks more than 100,000 concepts referred by means of English textual labels.…”
Section: Related Workmentioning
confidence: 99%
“…Hence, when constructing the centroid of a dataset with textual attributes, the similarities between their meanings (evaluated at a conceptual level) should be taken into consideration (e.g., for hobbies attribute, ''trekking'' value is more similar to ''jogging'' than to ''classical dance''). Related works constructing centroids for textual attribute values typically omit [16,46] or slightly consider [1,22] data semantics during their analysis, hampering the quality of the results [6,33].…”
Section: Introductionmentioning
confidence: 99%