2019
DOI: 10.1007/978-3-030-17138-4_6
|View full text |Cite
|
Sign up to set email alerts
|

Generalised Differential Privacy for Text Document Processing

Abstract: We address the problem of how to "obfuscate" texts by removing stylistic clues which can identify authorship, whilst preserving (as much as possible) the content of the text. In this paper we combine ideas from "generalised differential privacy" and machine learning techniques for text processing to model privacy for text documents. We define a privacy mechanism that operates at the level of text documents represented as "bags-of-words" -these representations are typical in machine learning and contain suffici… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
85
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 74 publications
(86 citation statements)
references
References 40 publications
0
85
0
1
Order By: Relevance
“…Our d χ -privacy algorithm is similar to the model introduced by [29] for privacy preserving text analysis, and [30] for author obfuscation. The algorithms are all analogous to that originally proposed by [13] and we describe it here using the Euclidean distance for word embedding vectors.…”
Section: The Privacy Mechanism In Euclidean Spacementioning
confidence: 99%
See 4 more Smart Citations
“…Our d χ -privacy algorithm is similar to the model introduced by [29] for privacy preserving text analysis, and [30] for author obfuscation. The algorithms are all analogous to that originally proposed by [13] and we describe it here using the Euclidean distance for word embedding vectors.…”
Section: The Privacy Mechanism In Euclidean Spacementioning
confidence: 99%
“…Furthermore, for Euclidean models such as [29], [30], the utility degrades badly as the privacy guarantees increase. This is because the noise injected (line 4 of Alg.…”
Section: The Case For Hyperbolic Spacementioning
confidence: 99%
See 3 more Smart Citations