2020
DOI: 10.1109/access.2020.2967449
|View full text |Cite
|
Sign up to set email alerts
|

$CAG$ : Stylometric Authorship Attribution of Multi-Author Documents Using a Co-Authorship Graph

Abstract: Stylometry has been successfully applied to perform authorship identification of single-author documents (AISD). The AISD task is concerned with identifying the original author of an anonymous document from a group of candidate authors. However, AISD techniques are not applicable to the authorship identification of multi-author documents (AIMD). Unlike AISD, where each document is written by one single author, AIMD focuses on handling multi-author documents. Due to the combinatoric nature of documents, AIMD la… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 52 publications
(113 reference statements)
0
6
0
Order By: Relevance
“…Sarwar et al [23] developed a multi-authorship classification system that achieved 76.92% accuracy for 1, 360 text documents. Their classification system depends on co-author information.…”
Section: A Non-bengali Language-based Authorship Classificationmentioning
confidence: 99%
“…Sarwar et al [23] developed a multi-authorship classification system that achieved 76.92% accuracy for 1, 360 text documents. Their classification system depends on co-author information.…”
Section: A Non-bengali Language-based Authorship Classificationmentioning
confidence: 99%
“…• Type-token ratio: the ratio of the total number of unique tokens to the total number of tokens: uniq(N i,tokens )/N i,tokens (11) where N i,tokens and uniq(N i,tokens ) are the total number of tokens and the total number of unique tokens in text x i,d , respectively. A token is a general term that could refer, for example, to a word, a number, or a punctuation mark.…”
Section: A Vocabulary Richnessmentioning
confidence: 99%
“…In this case, x i,d [g] denotes the shape of the g th word in text x i,d . Example grams: [11] = ''sss''.…”
Section: B Classical N-gramsmentioning
confidence: 99%
See 1 more Smart Citation
“…Consequently, a huge amount of UGC (user-generated-content) such as blog posts, product reviews, articles and novels is continuously being generated by the non-native writers [9,30]. Therefore, performing NLI with UGC can be useful in several areas such as forensic linguistics, author profling and authorship identifcation [9,18,29,30,34,37,38]. For example, in the context of the forensic linguistics, a juncture where the linguistic stylistics and the legal system intersect [23], NLI can be considered as a useful tool to provide evidence regarding the linguistic background of an author.…”
Section: Introductionmentioning
confidence: 99%