2014
DOI: 10.1016/j.procs.2014.08.201
|View full text |Cite
|
Sign up to set email alerts
|

Influence of Data Discretization on Efficiency of Bayesian Classifier for Authorship Attribution

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 20 publications
(8 citation statements)
references
References 19 publications
0
8
0
Order By: Relevance
“…They have been shown to be effective for capturing the differences in writing styles [Koppel et al 2002]. Usually the frequency value of the function words are used to represent the features [Baron 2014;Halvani and Steinebach 2014;HaCohen-Kerner and Margaliot 2014]. They are effective for identifying the first language of the authors [Torney et al 2012;Argamon et al 2009], identifying the actual author of French literature [Boukhaled and Ganascia 2014], and characterizing the gender of e-mails [Corney et al 2002].…”
Section: Joint Learning Model For Topical Modality and Lexical Modalitymentioning
confidence: 99%
See 1 more Smart Citation
“…They have been shown to be effective for capturing the differences in writing styles [Koppel et al 2002]. Usually the frequency value of the function words are used to represent the features [Baron 2014;Halvani and Steinebach 2014;HaCohen-Kerner and Margaliot 2014]. They are effective for identifying the first language of the authors [Torney et al 2012;Argamon et al 2009], identifying the actual author of French literature [Boukhaled and Ganascia 2014], and characterizing the gender of e-mails [Corney et al 2002].…”
Section: Joint Learning Model For Topical Modality and Lexical Modalitymentioning
confidence: 99%
“…Syntactic features are usually considered as deep linguistic features that are comparatively more difficult to consciously manipulate [Gamon 2004]. Typically, the Part-of-Speech (POS) tags n-grams are used as the features for this modality [Boukhaled and Ganascia 2014;Baron 2014;Qian et al 2014]. Given a token t b , its Part-of-Speech (POS) tag represents its grammatical role in the sentence.…”
Section: The Syntactic Modalitymentioning
confidence: 99%
“…Two main circumstances can be mentioned, where discretization may or even must be applied. The first situation is when there are some suspicions about possible improvement of a decision system quality when discretized data is applied [2]. The second one is when method or algorithm employed in decision system can operate only on nominal, discrete data.…”
Section: Theoretical Backgroundmentioning
confidence: 99%
“…The main idea behind AISD is to identify the true author of a disputed document from a set of candidate authors. Existing studies of AISD have reported good results [17,3]. However, as already explained in the introduction section, existing authorship identification techniques designed to handle single-author documents are inapplicable to multi-author documents [5].…”
Section: Authorship Identificationmentioning
confidence: 99%