Context Sensitive Lemmatization Using Two Successive Bidirectional
            Gated Recurrent Networks

Chakrabarty, Abhisek; Pandit, Onkar; Garain, Utpal

doi:10.18653/v1/p17-1136

Cited by 34 publications

(48 citation statements)

References 12 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…More recently, Cotterell et al (2015) have used an extended set of features and a second-order CRF to jointly predict POS-tags and edit-trees with stateof-the-art performance. Finally, Chakrabarty et al (2017) employed a softmax classifier to predict edit-trees based on sentence-level features implicitly learned with a neural encoder over the input sentence.…”

Section: Related Workmentioning

confidence: 99%

Improving Lemmatization of Non-Standard Languages with Joint Learning

Manjavacas¹,

Kádár²,

Kestemont³

2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

Lemmatization of standard languages is concerned with (i) abstracting over morphological differences and (ii) resolving token-lemma ambiguities of inflected words in order to map them to a dictionary headword. In the present paper we aim to improve lemmatization performance on a set of non-standard historical languages in which the difficulty is increased by an additional aspect (iii): spelling variation due to lacking orthographic standards. We approach lemmatization as a stringtransduction task with an encoder-decoder architecture which we enrich with sentence context information using a hierarchical sentence encoder. We show significant improvements over the state-of-the-art when training the sentence encoder jointly for lemmatization and language modeling. Crucially, our architecture does not require POS or morphological annotations, which are not always available for historical corpora. Additionally, we also test the proposed model on a set of typologically diverse standard languages showing results on par or better than a model without enhanced sentence representations and previous state-of-the-art systems. Finally, to encourage future work on processing of non-standard varieties, we release the dataset of non-standard languages underlying the present study, based on openly accessible sources.

show abstract

Section: Related Workmentioning

confidence: 99%

Improving Lemmatization of Non-Standard Languages with Joint Learning

Manjavacas¹,

Kádár²,

Kestemont³

2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

show abstract

“…Lemmatization is the task of predicting the base form (lemma) of an inflected word. A lemmatizer may make use of the context to get (implicit) information about the source form of the word (Koskenniemi 1984; Kanis and Müller 2005;Chrupała et al 2008;Jongejan and Dalianis 2009;Chakrabarty et al 2017). In comparison, our task does not offer contextual information, but instead provides the (similarly implicit) cues for the forms from the demo relation.…”

Section: Other Morphological Transformationsmentioning

confidence: 99%

Character-based recurrent neural networks for morphological relational reasoning

Mogren¹,

Johansson

2019

JLM

View full text Add to dashboard Cite

We present a model for predicting inflected word forms based on morphological analogies. Previous work includes rule-based algorithms that determine and copy affixes from one word to another, with limited support for varying inflectional patterns. In related tasks such as morphological reinflection, the algorithm is provided with an explicit enumeration of morphological features which may not be available in all cases. In contrast, our model is feature-free: instead of explicitly representing morphological features, the model is given a demo pair that implicitly specifies a morphological relation (such as write:writes specifying infinitive:present). Given this demo relation and a query word (e.g. watch), the model predicts the target word (e.g. watches). To address this task, we devise a character-based recurrent neural network architecture using three separate encoders and one decoder.Our experimental evaluation on five different languages shows tha the exact form can be predicted with high accuracy, consistently beating the baseline methods. Particularly, for English the prediction accuracy is 95.60%. The solution is not limited to copying affixes from the demo relation, but generalizes to words with varying inflectional patterns, and can abstract away from the orthographic level to the level of morphological forms.

show abstract

“…The ISI system started with a heuristically induced candidate set, using the edit tree approach described by Chrupała et al (2008), and then chose the best edit tree. This approach is effectively a neuralized version of the lemmatizer proposed in Müller et al (2015) and, indeed, was originally intended for that task (Chakrabarty et al, 2017). The UA team, following their 2016 submission, proposed a linear reranking on top of the k-best output of their transduction system.…”

Section: System Descriptionsmentioning

confidence: 99%

“…Our model is related to the encoder-decoder based approaches such as (Aharoni et al, 2016;Kann and Schütze, 2016a,b), but the main difference is that the proposed network is not designed to generate sequence of characters as output. Rather, we formulate the problem as to classify the transformation process required to convert a source form to its target form (Chakrabarty et al, 2017). Our goal is to model such a system which receives an input word and the morphological tags and returns the proper transformation that induces the target word.…”

Section: Introductionmentioning

confidence: 99%

Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection

Hulden¹

2017

View full text Add to dashboard Cite

Context Sensitive Lemmatization Using Two Successive Bidirectional Gated Recurrent Networks

Cited by 34 publications

References 12 publications

Improving Lemmatization of Non-Standard Languages with Joint Learning

Improving Lemmatization of Non-Standard Languages with Joint Learning

Character-based recurrent neural networks for morphological relational reasoning

Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection

Contact Info

Product

Resources

About