2010
DOI: 10.1007/s10579-009-9111-2
|View full text |Cite
|
Sign up to set email alerts
|

Authorship attribution in the wild

Abstract: Most previous work on authorship attribution has focused on the case in which we need to attribute an anonymous document to one of a small set of candidate authors. In this paper, we consider authorship attribution as found in the wild: the set of known candidates is extremely large (possibly many thousands) and might not even include the actual author. Moreover, the known texts and the anonymous texts might be of limited length. We show that even in these difficult cases, we can use similarity-based methods a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
121
1
3

Year Published

2012
2012
2017
2017

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 190 publications
(131 citation statements)
references
References 15 publications
1
121
1
3
Order By: Relevance
“…Our ensemble-based algorithm is a variation of the method presented by Koppel et al [8] for the task of author identification. In the original approach, there is only one training example for each author and a number of simple classifiers is learned based on random feature subspacing.…”
Section: Ensemble-based Algorithmmentioning
confidence: 99%
See 3 more Smart Citations
“…Our ensemble-based algorithm is a variation of the method presented by Koppel et al [8] for the task of author identification. In the original approach, there is only one training example for each author and a number of simple classifiers is learned based on random feature subspacing.…”
Section: Ensemble-based Algorithmmentioning
confidence: 99%
“…It is possible to use this algorithm in combination with any text similarity measure. The cosine distance has provided good results in the experiments of [8] and is also used in this study.…”
Section: Ensemble-based Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…We discussed the state of the art of the authorship attribution problem in our previous work on application of syntactic n-grams for authorship attribution [1]; also see various related works on authorship attribution [7,8,9,10], among many others. Here we will just briefly state the problem.…”
Section: Authorship Attribution Problemmentioning
confidence: 99%