RANLP 2017 - Recent Advances in Natural Language Processing Meet Deep Learning 2017
DOI: 10.26615/978-954-452-049-6_003
|View full text |Cite
|
Sign up to set email alerts
|

A Comparison of Feature-Based and Neural Scansion of Poetry

Abstract: Automatic analysis of poetic rhythm is a challenging task that involves linguistics, literature, and computer science. When the language to be analyzed is known, rule-based systems or data-driven methods can be used. In this paper, we analyze poetic rhythm in English and Spanish. We show that the representations of data learned from character-based neural models are more informative than the ones from hand-crafted features, and that a Bi-LSTM+CRF-model produces state-of-the art accuracy on scansion of poetry i… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(18 citation statements)
references
References 23 publications
(20 reference statements)
0
18
0
Order By: Relevance
“…We only chose <real> when <met> doesn't match the syllable count (ca. 200 cases), likely deviating from the setup in (Agirrezabal et al, 2016(Agirrezabal et al, , 2019.…”
Section: Learning Metermentioning
confidence: 93%
See 1 more Smart Citation
“…We only chose <real> when <met> doesn't match the syllable count (ca. 200 cases), likely deviating from the setup in (Agirrezabal et al, 2016(Agirrezabal et al, , 2019.…”
Section: Learning Metermentioning
confidence: 93%
“…The annotated corpora for English include: (1) The for-better-for-verse (FORB) collection 18 with around 1200 lines which was used by Agirrezabal et al (2016Agirrezabal et al ( , 2019, and (2) the 1700 lines of poetry against which prosodic 19 ( Anttila and Heuser, 2016;Algee-Hewitt et al, 2014) was evaluated (PROS). We merge these with our own (3) 1200 lines in 64 English poems (EPG64).…”
Section: Additional Data and Formatmentioning
confidence: 99%
“…Accuracy (Gervas, 2000) 88.73 (Navarro-Colorado, 2017) 94.44 (Agirrezabal et al, 2017) 90.84 Rantanplan (ours) 96.23 Lastly, the PoetryLab API provides a pluggable architecture that allows for the integration of external packages developed in languages other than Python. This is the case for our named entity recognition system, HisMeTag (Platas et al, 2021), developed in Java and connected to the PoetryLab API through an internal REST API exposed via Docker.…”
Section: Methodsmentioning
confidence: 99%
“…With ever increasing corpora sizes and the popularization of distant reading techniques, the possibility to automate part of the analysis became very attractive. Although solutions exist, they are either incomplete, e.g., scansion of fixed-metre poetry (Agirrezabal et al, 2016;Navarro-Colorado, 2017;Gervas, 2000;Agirrezabal et al, 2017), not applicable to Spanish (Agirrezabal et al, 2017;Hartman, 2005), or not open or reproducible (Gervas, 2000). Moreover, disparate input and output formats, operating system requirements and dependencies, and the lack of interoperability between software packages, further complicated the limited ecosystem of tools to analyze Spanish poetry.…”
Section: Poetrylabmentioning
confidence: 99%
“…However, these results are restricted to hendecasyllabic verses. Shortly after that, Agirrezabal used neural networks to predict the metrical pattern of lines of verses [27]. The model proposed was a character-based bidirectional long short term (BiLSTM) neural network with conditional random fields.…”
Section: Related Workmentioning
confidence: 99%