2019
DOI: 10.48550/arxiv.1902.08939
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Text Analysis in Adversarial Settings: Does Deception Leave a Stylistic Trace?

Abstract: Textual deception constitutes a major problem for online security. Many studies have argued that deceptiveness leaves traces in writing style, which could be detected using text classification techniques. By conducting an extensive literature review of existing empirical work, we demonstrate that while certain linguistic features have been indicative of deception in certain corpora, they fail to generalize across divergent semantic domains. We suggest that deceptiveness as such leaves no content-invariant styl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2019
2019

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 87 publications
(194 reference statements)
0
2
0
Order By: Relevance
“…In recent years, the new research field author obfuscation (AO) evolved, which concerns itself with the task to fool AA or AV methods in a way that the true author cannot be correctly recognized anymore. To achieve this, AO approaches which, according to Gröndahl and Asokan [7] can be divided into manual, computer-assisted and automatic types, perform a variety of modifications on the texts. These include simple synonym replacements, rule-based substitutions or word order permutations.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In recent years, the new research field author obfuscation (AO) evolved, which concerns itself with the task to fool AA or AV methods in a way that the true author cannot be correctly recognized anymore. To achieve this, AO approaches which, according to Gröndahl and Asokan [7] can be divided into manual, computer-assisted and automatic types, perform a variety of modifications on the texts. These include simple synonym replacements, rule-based substitutions or word order permutations.…”
Section: Related Workmentioning
confidence: 99%
“…As a first corpus, we compiled C DBLP that represents a collection of 80 excerpts from scientific works including papers, dissertations, book chapters and technical reports, which we have chosen from the well-known Digital Bibliography & Library Project (DBLP) platform 7 . Overall, the documents 8 were written by 40 researchers, where for each author A, there are exactly two documents.…”
Section: Dblp Corpusmentioning
confidence: 99%