RelDiff: Enriching Knowledge Graph Relation Representations for Sensitivity Classification

Narvala, Hitarth; McDonald, Graham; Ounis, Iadh

doi:10.18653/v1/2021.findings-emnlp.311

Cited by 2 publications

(3 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In particular, we first describe: (1) the document collection used in the studies and for training clustering approaches, (2) the specific clustering approaches that we evaluate, (3) selection of the appropriate number of clusters in the collection. Sensitivity Collection: To train the clustering approaches we use a collection (GovSensitivity [16]) of 3801 government documents (502 sensitive) that are annotated at document-level and sentence-level by government sensitivity reviewers for two FOI sensitivities, i.e, "Personal Information" and "International Relations". In the user studies we use passages of the documents instead of the documents itself to reduce the complexity in reviewing large documents.…”

Section: Preliminary Setupmentioning

confidence: 99%

“…We deployed an SVM text classification approach as described in [13] to classify the documents as either sensitive or non-sensitive. To train the classifier, we used a 5-fold cross validation with stratified samples of the GovSensitivity collection as described in [16]. The effectiveness of the learned classifier was 0.733 BAC.…”

Section: User Study#2: Review Opennessmentioning

confidence: 99%

“…On a collection of government documents with real sensitivities (GovSensitivity [16]), our user studies show that reviewing documents in semantic clusters can significantly improve the reviewing speed by 15.65% (T-Test, 𝑝 < 0.05). Furthermore, we show that our proposed review prioritisation strategy that leverages document metadata attributes for ranking clusters with finer grained sensitivity proportions can significantly improve the hourly openness by 37.99% and openness as a function of time (area under the curve) by 23.78% (T-Test, 𝑝 < 0.05).…”

Section: Introductionmentioning

confidence: 96%

See 2 more Smart Citations