Authorship Attribution for Neural Text Generation

Uchendu, Adaku; Le, Thai; Shu, Kai; Lee, Dongwon

doi:10.18653/v1/2020.emnlp-main.673

Cited by 56 publications

(70 citation statements)

References 30 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They show that word order does not matter much as a bag-of-words detector performs very similar to detectors based on complex encoder (e.g., transformer). This result is consistent with the recent work done by Uchendu et al, (2020), which shows that simple models (traditional ML models trained on psychological features and simple neural network architectures) perform well in three settings: (i) classify if two given articles are generated by the same TGM; (ii) classify if a given article is written by a human or a TGM (the original detection problem); (iii) identify the TGM that generated a given article (similar to Tay et al, (2020)). For the original detection problem, the authors find that the text generated by the GPT-2 model to be hard to detect among several TGMs (see Appendix for the list of studied TGMs).…”

Section: Classifiers Trained From Scratchsupporting

confidence: 94%

“…TGMs can also be used to generate text that approximately matches the style of human language, which benefits applications such as story generation (Fan et al, 2018), conversational response generation (Zhang et al, 2020), code auto-completion (TabNine, 2020), and radiology report generation (Liu et al, 2019a). Malicious usage: TGMs can have unfortunate uses by (even low-skilled) adversaries for malicious purposes, such as fake news generation (Zellers et al, 2019;Brown et al, 2020;Uchendu et al, 2020), fake product reviews generation (Adelani et al, 2020), and spamming/phishing (Weiss, 2019). Humans can spot fake news articles (Brown et al, 2020), fake product reviews (Adelani et al, 2020), and fake comments (Weiss, 2019) generated by TGM only at chance level.…”

Section: Social Impacts Of Tgmsmentioning

confidence: 99%

“…The RoBERTa detector also outperforms existing detectors in spotting news articles generated by several TGMs (Uchendu et al, 2020) and product reviews generated by the GPT-2 model fine-tuned on Amazon product reviews (Adelani et al, 2020).…”

Section: Fine-tuning Nlmmentioning

confidence: 99%

“…TGMs are useful in a wide variety of applications, including story generation (Fan et al, 2018), conversational response generation (Zhang et al, 2020), code auto-completion (Solaiman et al, 2019), and radiology report generation (Liu et al, 2019a). However, TGMs can also be misused for fake news generation (Zellers et al, 2019;Brown et al, 2020;Uchendu et al, 2020), fake product reviews generation (Adelani et al, 2020), and spamming/phishing. (Weiss, 2019).…”

Section: Introductionmentioning

confidence: 99%

“…The classifier, henceforth called detector, can be used to automatically remove machine generated text from online platforms such as social media, e-commerce, email clients, and government forums, when the intention of the TGM generated text is abuse. An ideal detector should be: (i) accurate, that is, good accuracy with a good trade-off for false positives and false negatives depending on the online platform (email client, social media) on which TGM is applied (Solaiman et al, 2019); (ii) data-efficient, that is, needs as few examples as possible from the TGM used by the attacker (Zellers et al, 2019); (iii) generalizable, that is, detects text generated by different modeling choices of the TGM used by the attacker such as model architecture, TGM training data, TGM conditioning prompt length, model size, and text decoding method (Solaiman et al, 2019;Bakhtin et al, 2020;Uchendu et al, 2020); and (iv) interpretable, that is, detector decisions need to be understandable to humans (Gehrmann et al, 2019); and (v) robust, that is, detector can handle adversarial examples (Wolff, 2020). Given the importance of this problem, there has been a flurry of research recently from both NLP and ML communities on building useful detectors.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Automatic Detection of Machine Generated Text: A Critical Survey

Jawahar¹,

Abdul-Mageed²,

Lakshmanan³

2020

Proceedings of the 28th International Conference on Computational Linguistics

View full text Add to dashboard Cite

Text generative models (TGMs) excel in producing text that matches the style of human language reasonably well. Such TGMs can be misused by adversaries, e.g., by automatically generating fake news and fake product reviews that can look authentic and fool humans. Detectors that can distinguish text generated by TGM from human written text play a vital role in mitigating such misuse of TGMs. Recently, there has been a flurry of works from both natural language processing (NLP) and machine learning (ML) communities to build accurate detectors for English. Despite the importance of this problem, there is currently no work that surveys this fast-growing literature and introduces newcomers to important research challenges. In this work, we fill this void by providing a critical survey and review of this literature to facilitate a comprehensive understanding of this problem. We conduct an in-depth error analysis of the state-of-the-art detector and discuss research directions to guide future work in this exciting area.

show abstract

Section: Classifiers Trained From Scratchsupporting

confidence: 94%