2007
DOI: 10.1145/1187415.1187416
|View full text |Cite
|
Sign up to set email alerts
|

Author verification by linguistic profiling

Abstract: This article explores the effects of parameter settings in linguistic profiling, a technique in which large numbers of counts of linguistic features are used as a text profile which can then be compared to average profiles for groups of texts. Although the technique proves to be quite effective for authorship verification, with the best overall parameter settings yielding an equal error rate of 3% on a test corpus of student essays, the optimal parameters vary greatly depending on author and evaluation criteri… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0
1

Year Published

2007
2007
2022
2022

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 64 publications
(10 citation statements)
references
References 4 publications
0
9
0
1
Order By: Relevance
“…In one implementation of the hybrid sampling approach, all training text samples for an author are handled separately, as in the instance-based approach. All samples from each author are then combined as an average of the feature vectors to produce a single profile vector [39,40]. In the implementation of another hybrid sampling approach, the reverse process of the previous order was applied; the profile sample is first produced by combining all of the training samples for each author and is then divided to obtain segments of the same size [4,41].…”
Section: The Proposed Bbm-assisted 2d-av Systemmentioning
confidence: 99%
“…In one implementation of the hybrid sampling approach, all training text samples for an author are handled separately, as in the instance-based approach. All samples from each author are then combined as an average of the feature vectors to produce a single profile vector [39,40]. In the implementation of another hybrid sampling approach, the reverse process of the previous order was applied; the profile sample is first produced by combining all of the training samples for each author and is then divided to obtain segments of the same size [4,41].…”
Section: The Proposed Bbm-assisted 2d-av Systemmentioning
confidence: 99%
“…In a verification problem (see above) one is given writing examples of an author A, and one is asked to verify whether or not a document d of unknown authorship in fact is written by A. Recent contributions to the authorship attribution problem include (Rudman 1997;Stamatatos 2001Stamatatos , 2007Stamatatos , 2009Chaski 2005;Juola 2006;Malyutov 2006;Sanderson and Guenter 2006b); the authorship verification problem is addressed in Koppel and Schler (2004b), van Halteren (2004van Halteren ( , 2007, Meyer zu Eissen and Stein (2006Stein ( , 2007, Koppel et al (2007), , Stein et al 2008 andPavelec et al (2008).…”
Section: Existing Researchmentioning
confidence: 99%
“…As a method capable of capturing language learners' individual differences in their performance, linguistic profiling has been more frequently applied in studies related to language learning. According to Halteren (2007), the concept of profiling focuses on linguistic features, the statistical calculation of which could assist researchers in looking for information underlying the text.…”
Section: Introductionmentioning
confidence: 99%