2009
DOI: 10.1007/978-3-642-04174-7_47
|View full text |Cite
|
Sign up to set email alerts
|

Protecting Sensitive Topics in Text Documents with PROTEXTOR

Abstract: Abstract. This is a demonstration of a system for protecting sensitive topics present in text documents. Our system works in a privacy framework where the topic is characterized as a multiclass classification problem in a generative setting. We show how our system helps a user redact a document in a business setting to obscure what company the text pertains to, and show some experimental results on redacting the topic for a standard text classification data set.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2014
2014

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 3 publications
0
1
0
Order By: Relevance
“…In this paper, we will call these words sensitive words. Even though there are solutions that partially automate the redaction process [18,15,52], sensitive information is still leaked as the recent case with Transportation Security Administration shows, where sensitive airport screening procedures were leaked as a result of the failure to properly redact documents [61]. As difficult as it is to redact plain text documents, there are no solutions for redacting sensitive information in software artifacts.…”
Section: Sensitive Information In Software Artifactsmentioning
confidence: 99%
“…In this paper, we will call these words sensitive words. Even though there are solutions that partially automate the redaction process [18,15,52], sensitive information is still leaked as the recent case with Transportation Security Administration shows, where sensitive airport screening procedures were leaked as a result of the failure to properly redact documents [61]. As difficult as it is to redact plain text documents, there are no solutions for redacting sensitive information in software artifacts.…”
Section: Sensitive Information In Software Artifactsmentioning
confidence: 99%