Manex Agirrezabal scite author profile

Manex Agirrezabal

24Publications

43Citation Statements Received

202Citation Statements Given

How they've been cited

How they cite others

226

200

Affiliations

University of Copenhagen, Software (Spain), University of the Basque Country

Publications

Order By: Most citations

A Comparison of Feature-Based and Neural Scansion of Poetry

Agirrezabal¹,

Alegria²,

Hulden³

2017

View full text Add to dashboard Cite

Automatic analysis of poetic rhythm is a challenging task that involves linguistics, literature, and computer science. When the language to be analyzed is known, rule-based systems or data-driven methods can be used. In this paper, we analyze poetic rhythm in English and Spanish. We show that the representations of data learned from character-based neural models are more informative than the ones from hand-crafted features, and that a Bi-LSTM+CRF-model produces state-of-the art accuracy on scansion of poetry in two languages. Results also show that the information about whole word structure, and not just independent syllables, is highly informative for performing scansion.

show abstract

ZeuScansion: A tool for scansion of English poetry

Agirrezabal¹,

Astigarraga²,

Arrieta³

et al. 2016

JLM

View full text Add to dashboard Cite

Keywords: scansion, English, poetry, out-of-vocabulary wordsWe present a finite-state technology (FST) based system capable of performing metrical scansion of verse written in English. Scansion is the traditional task of analyzing the lines of a poem, marking the stressed and non-stressed elements and dividing the line into metrical feet. The system's workflow is composed of several subtasks designed around finite-state machines that analyze verse by performing tokenization, part-of-speech tagging, stress placement, and stress-pattern prediction for unknown words. The scanner also classifies poems according to the predominant type of metrical foot found. We present a brief evaluation of the system using a gold standard corpus of humanscanned verse, on which a per-syllable accuracy of 86.78% is achieved.The program uses open-source components and is released under the GNU GPL license.

show abstract

Bertsobot: The first minstrel robot

Astigarraga¹,

Agirrezabal

Lazkano³

et al. 2013

View full text Add to dashboard Cite

We describe a robot capable of composing and playing traditional Basque impromptu verses -bertsoak. The system, called Bertsobot, is able to construct improvised verses according to given constraints on rhyme and meter, and to perform it in public. Towards this end, several tools and applications have been developed and integrated in Bertsobot, including: speech-based communication system, text applications for verse generation, and robot behaviours to interact with the environment in a public performance. We describe the tools and processes behind our approach, present some early experimental results and illustrative verses, and finally, remark the conclusions and future steps.

show abstract

Interpreting Character Embeddings With Perceptual Representations: The Case of Shape, Sound, and Color

Boldsen¹,

Agirrezabal²,

Hollenstein³

2022

View full text Add to dashboard Cite

Textual Coherence in a Verse-Maker Robot

Astigarraga

Jauregi

Lazkano

et al. 2014

View full text Add to dashboard Cite

Identifying Temporal Trends Based on Perplexity and Clustering: Are We Looking at Language Change?

Boldsen¹,

Agirrezabal²,

Paggio³

2019

View full text Add to dashboard Cite

In this work we propose a data-driven methodology for identifying temporal trends in a corpus of medieval charters. We have used perplexities derived from RNNs as a distance measure between documents and then, performed clustering on those distances. We argue that perplexities calculated by such language models are representative of temporal trends. The clusters produced using the K-Means algorithm give an insight of the differences in language in different time periods at least partly due to language change. We suggest that the temporal distribution of the individual clusters might provide a more nuanced picture of temporal trends compared to discrete bins, thus providing better results when used in a classification task.

show abstract

KUCST@LT-EDI-ACL2022: Detecting Signs of Depression from Social Media Text

Agirrezabal¹,

Janek²

2022

Preprint

View full text Add to dashboard Cite

In this paper we present our approach for detecting signs of depression from social media text. Our model relies on word unigrams, partof-speech tags, readabilitiy measures and the use of first, second or third person and the number of words. Our best model obtained a macro F1-score of 0.439 and ranked 25th, out of 31 teams. We further take advantage of the interpretability of the Logistic Regression model and we make an attempt to interpret the model coefficients with the hope that these will be useful for further research on the topic.

show abstract

PoeLM: A Meter- and Rhyme-Controllable Language Model for Unsupervised Poetry Generation

Ormazabal¹,

Artetxe²,

Agirrezabal³

et al. 2022

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.