Unary and Binary Classification Approaches and their Implications for Authorship Verification

Halvani, Oren; Winter, Christian; Graner, Lukas

doi:10.48550/arxiv.1901.00399

Cited by 2 publications

(3 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the past two decades, researchers from different disciplines, including linguistics, psychology, computer science, and mathematics, proposed a range of techniques and concepts for this task [181][182][183][184]. Stylometric learning is a computational approach for many linguistic tasks, such as authorship verification [185].…”

Section: Review On Authorship Verification Methodsmentioning

confidence: 99%

Authorship Attribution Methods, Challenges, and Future Research Directions: A Comprehensive Survey

He,

Lashkari,

Vombatkere

et al. 2024

Information

View full text Add to dashboard Cite

Over the past few decades, researchers have put their effort and paid significant attention to the authorship attribution field, as it plays an important role in software forensics analysis, plagiarism detection, security attack detection, and protection of trade secrets, patent claims, copyright infringement, or cases of software theft. It helps new researchers understand the state-of-the-art works on authorship attribution methods, identify and examine the emerging methods for authorship attribution, and discuss their key concepts, associated challenges, and potential future work that could help newcomers in this field. This paper comprehensively surveys authorship attribution methods and their key classifications, used feature types, available datasets, model evaluation criteria and metrics, and challenges and limitations. In addition, we discuss the potential future research directions of the authorship attribution field based on the insights and lessons learned from this survey work.

show abstract

Section: Review On Authorship Verification Methodsmentioning

confidence: 99%

Authorship Attribution Methods, Challenges, and Future Research Directions: A Comprehensive Survey

He,

Lashkari,

Vombatkere

et al. 2024

Information

View full text Add to dashboard Cite

show abstract

“…As a general observation, even in later challenges, SVMs have proven to be the most effective for AA tasks (Kestemont et al, 2019). More specifically, in a survey of freely available AA systems, GLAD showed best performance and especially high adaptability to new datasets (Halvani et al, 2018). Lastly, de Vries (2020) has explored fine-tuning a pre-trained model for AV in Dutch, a less-resourced language compared to English.…”

Section: Modelmentioning

confidence: 99%

“…The GLAD system of Hürlimann et al (2015) was specifically developed to solve AV problems, and has been shown to be highly adaptable to new datasets (Halvani et al, 2018). GLAD uses an SVM with a variety of features including character level ones, which have proved to be most effective for AA tasks (Stamatatos, 2009;Moreau et al, 2015;Hürlimann et al, 2015), and is freely available.…”

Section: Introductionmentioning

confidence: 99%

Datasets and Models for Authorship Attribution on Italian Personal Writings

Ruggiero¹,

Gatt²,

Nissim³

2020

Preprint

View full text Add to dashboard Cite

Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is available (e.g novels), mainly in English. We approach AA via Authorship Verification on short Italian texts in two novel datasets, and analyze the interaction between genre, topic, gender and length. Results show that AV is feasible even with little data, but more evidence helps. Gender and topic can be indicative clues, and if not controlled for, they might overtake more specific aspects of personal style.

show abstract

Unary and Binary Classification Approaches and their Implications for Authorship Verification

Cited by 2 publications

References 20 publications

Authorship Attribution Methods, Challenges, and Future Research Directions: A Comprehensive Survey

Authorship Attribution Methods, Challenges, and Future Research Directions: A Comprehensive Survey

Datasets and Models for Authorship Attribution on Italian Personal Writings

Contact Info

Product

Resources

About