Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2021
DOI: 10.18653/v1/2021.emnlp-main.70
|View full text |Cite
|
Sign up to set email alerts
|

Learning Universal Authorship Representations

Abstract: Determining whether two documents were composed by the same author, also known as authorship verification, has traditionally been tackled using statistical methods. Recently, authorship representations learned using neural networks have been found to outperform alternatives, particularly in large-scale settings involving hundreds of thousands of authors. But do such representations learned in a particular domain transfer to other domains? Or are these representations inherently entangled with domain-specific f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 13 publications
0
2
0
Order By: Relevance
“…To evaluate authorship style transfer, we adopt the Confusion metric from the evaluation framework defined by Patel, Andrews, and Callison-Burch (2022), where the authors utilize pretrained style embedders (Wegmann, Schraagen, and Nguyen 2022;Rivera-Soto et al 2021) to measure style transfer success. Confusion, which is similar to style transfer accuracy, is the percentage of the time that the style transfer output is closer to the target author than the source author in representational embedding space.…”
Section: Authorship Style Transfermentioning
confidence: 99%
See 1 more Smart Citation
“…To evaluate authorship style transfer, we adopt the Confusion metric from the evaluation framework defined by Patel, Andrews, and Callison-Burch (2022), where the authors utilize pretrained style embedders (Wegmann, Schraagen, and Nguyen 2022;Rivera-Soto et al 2021) to measure style transfer success. Confusion, which is similar to style transfer accuracy, is the percentage of the time that the style transfer output is closer to the target author than the source author in representational embedding space.…”
Section: Authorship Style Transfermentioning
confidence: 99%
“…We compute the above metrics for both Style Embeddings (Wegmann, Schraagen, and Nguyen 2022) and Universal Authorship Representations (UAR) (Rivera-Soto et al 2021). Similar to our external style classifier for attribute transfer, UAR provides a holdout embedding space that PARAGUIDE does not directly optimize at inference time.…”
Section: Authorship Style Transfermentioning
confidence: 99%
“…The AAVC is a widely recognized corpus specifically designed for authorship verification studies, as evidenced by its utilization in various studies (Boenninghoff et al 2019;Halvani et al 2020;Ishihara 2023;Rivera-Soto et al 2021). Certain aspects of the data, such as genre and document length, are well-controlled.…”
Section: Databasementioning
confidence: 99%
“…These approaches encompass TF-IDF-based clustering and classification techniques (Agarwal et al, 2019;İzzet Bozkurt et al, 2007), conventional convolutional neural networks (CNNs) (Rhodes, 2015;Shrestha et al, 2017), recurrent neural networks (RNNs) (Zhao et al, 2018;Jafariakinabad et al, 2019;Gupta et al, 2019), and contextualized transformers (Fabien et al, 2020a;Ordoñez et al, 2020;Uchendu et al, 2020;Barlas and Stamatatos, 2021). Moreover, researchers have recently demonstrated the effectiveness of contrastive learning approaches (Gao et al, 2022) for authorship tasks (Rivera-Soto et al, 2021;Ai et al, 2022). These advancements have led to applications in style representational approaches (Hay et al, 2020;Zhu and Jurgens, 2021;Wegmann et al, 2022), which currently represent the state-of-the-art (SOTA) for authorship tasks.…”
Section: Authorship Attribution In Nlpmentioning
confidence: 99%