2012
DOI: 10.1073/pnas.1115407109
|View full text |Cite
|
Sign up to set email alerts
|

Quantitative patterns of stylistic influence in the evolution of literature

Abstract: Literature is a form of expression whose temporal structure, both in content and style, provides a historical record of the evolution of culture. In this work we take on a quantitative analysis of literary style and conduct the first large-scale temporal stylometric study of literature by using the vast holdings in the Project Gutenberg Digital Library corpus. We find temporal stylistic localization among authors through the analysis of the similarity structure in feature vectors derived from content-free word… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
86
0
3

Year Published

2014
2014
2017
2017

Publication Types

Select...
8

Relationship

1
7

Authors

Journals

citations
Cited by 132 publications
(89 citation statements)
references
References 21 publications
0
86
0
3
Order By: Relevance
“…The K -L divergence is commonly used in studies related to ours [24,25]. Being a measure of relative entropy, it measures the divergence of two frequency distributions with regard to their informational content.…”
Section: Methodsmentioning
confidence: 99%
“…The K -L divergence is commonly used in studies related to ours [24,25]. Being a measure of relative entropy, it measures the divergence of two frequency distributions with regard to their informational content.…”
Section: Methodsmentioning
confidence: 99%
“…These results suggest that later prose authors were influenced by the style of Caesar and the writers in Caesar's wake, including Livy, to a greater extent than has been previously acknowledged, even when writing about very different subject matter. Analogous phenomena have also been observed for the evolution of genres and literary style in English and other Latin corpora (7,10,25,40). Throughout our work, we show the usefulness of incorporating syntactic and metrical features in addition to diction, noncontent words, and punctuation marks, which have been considered previously by Jockers (10) and others (25), into such comparative analyses.…”
Section: Anomaly Detection Differentiates Suspected Citations From Othermentioning
confidence: 76%
“…Analogous phenomena have also been observed for the evolution of genres and literary style in English and other Latin corpora (7,10,25,40). Throughout our work, we show the usefulness of incorporating syntactic and metrical features in addition to diction, noncontent words, and punctuation marks, which have been considered previously by Jockers (10) and others (25), into such comparative analyses.…”
Section: Anomaly Detection Differentiates Suspected Citations From Othermentioning
confidence: 76%
See 1 more Smart Citation
“…In addition to tracking human performance, use of the explicit semantic representation provided by Roget's provides categories fixed independently of corpus frequency. [Unsupervised methods, such as topic modeling via latent Dirichlet allocation (29), can provide a different and complementary window onto this process, including the study of both nonsemantic pattern (30) and unmarked semantic distinctions within indictment classes. ]…”
Section: Methodsmentioning
confidence: 99%