Research and Development in Intelligent Systems XXV 2009
DOI: 10.1007/978-1-84882-171-2_29
|View full text |Cite
|
Sign up to set email alerts
|

Universum Inference and Corpus Homogeneity

Abstract: Abstract. Universum Inference is re-interpreted for assessment of corpus homogeneity in computational stylometry. Recent stylometric research quantifies strength of characterization within dramatic works by assessing the homogeneity of corpora associated with dramatic personas. A methodological advance is suggested to mitigate the potential for the assessment of homogeneity to be achieved by chance. Baseline comparison analysis is constructed for contributions to debates by nonfictional participants: the corpu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2013
2013
2023
2023

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…Various methods have been proposed in the literature to address author verification. In particular ensemble methods have proved quite sucessful at tackling the challenge of capturing the features relevant to the author's style and discarding the ones which are not [7][8][9][10]21]. Author verification has also been the focus of several iterations of the PAN shared tasks, 4 e.g.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Various methods have been proposed in the literature to address author verification. In particular ensemble methods have proved quite sucessful at tackling the challenge of capturing the features relevant to the author's style and discarding the ones which are not [7][8][9][10]21]. Author verification has also been the focus of several iterations of the PAN shared tasks, 4 e.g.…”
Section: Related Workmentioning
confidence: 99%
“…A common option is to use the results of some Google queries formed by randomly picking words from the set of input documents as impostors. 8 In the experiments presented below (see section 5), we opt for using all the training documents as impostors. While this option is not ideal since the documents obviously include precisely the documents to be compared, it is a reasonable simplification if the training set is diverse enough in terms of authors and if the number of iterations is large enough to prevent the occasional wrong comparison from having a significant effect on the output features.…”
Section: General Impostor Strategymentioning
confidence: 99%
See 2 more Smart Citations
“…It is convenient to conflate the notions of "methods" and "tools". Various aspects of the tools and analysis conducted using the tools have been published (Appel & Vogel, 2001;Van Gijsel & Vogel, 2003;O'Brien & Vogel, 2003;Vogel, 2007b;Healey, Vogel, & Eshghi, 2007;Vogel, 2007a;Vogel & Brisset, 2007;Frontini, Lynch, & Vogel, 2008;Vogel, Lynch, & Janssen, 2008). Student projects (e.g.…”
Section: Introductionmentioning
confidence: 99%