Proceedings of the 35th Annual ACM Symposium on Applied Computing 2020
DOI: 10.1145/3341105.3375774
|View full text |Cite
|
Sign up to set email alerts
|

Using natural language processing to detect privacy violations in online contracts

Abstract: As information systems deal with contracts and documents in essential services, there is a lack of mechanisms to help organizations in protecting the involved data subjects. In this paper, we evaluate the use of named entity recognition as a way to identify, monitor and validate personally identifiable information. In our experiments, we use three of the most well-known Natural Language Processing tools (NLTK, Stanford CoreNLP, and spaCy). First, the effectiveness of the tools is evaluated in a generic dataset… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 7 publications
0
6
0
Order By: Relevance
“…Through the application of named entity recognition (NER) algorithms, machine learning models can precisely categorize and label con dential details within both generic and context-speci c datasets [70] [71]. This strategy functions as a Privacy Enhancing Technology (PET), automatically obscuring sensitive data and upholding the right to privacy [72]. The utilization of NLP tools, such as NLTK, Stanford CoreNLP, and spaCy, in conjunction with machine learning models, has yielded favorable outcomes in the accurate classi cation of PII within contracts and documents [73][74].…”
Section: Arti Cial Intelligence In Smart Contracts For Safeguardingmentioning
confidence: 99%
“…Through the application of named entity recognition (NER) algorithms, machine learning models can precisely categorize and label con dential details within both generic and context-speci c datasets [70] [71]. This strategy functions as a Privacy Enhancing Technology (PET), automatically obscuring sensitive data and upholding the right to privacy [72]. The utilization of NLP tools, such as NLTK, Stanford CoreNLP, and spaCy, in conjunction with machine learning models, has yielded favorable outcomes in the accurate classi cation of PII within contracts and documents [73][74].…”
Section: Arti Cial Intelligence In Smart Contracts For Safeguardingmentioning
confidence: 99%
“…The paper titled "Using Natural Language Processing to Detect Privacy Violations in Online Contracts" by P. Silva et.al. [35] and another paper titled "Extractive Automatic Text Summarization using SpaCy in Python & NLP" [36] made comparison between spaCy, coreNLP and NLTK and came to conclusion that coreNLP has the best performance in terms of precision, recall and F1 score followed by spaCy and lastly NLTK. SpaCy is supported in number of programming languages like "R", "ruby", "cpp", "java script", ".net", "python", etc.…”
Section: B Nltk Vs Spacymentioning
confidence: 99%
“…NLP can be used to extract and process clinical data or information for both structured and unstructured forms. NLP also could be used for patient's classification and supporting critical clinical tasks like clinical decisionmaking and producing quality reports [10] . A clinical decision-making system can be developed using patient's behavior toward a product, medicine, or treatment using NLP techniques.…”
Section: Nlp Applicationsmentioning
confidence: 99%