Multi-space Variational Encoder-Decoders for Semi-supervised
            Labeled Sequence Transduction

Zhou, Chunting; Neubig, Graham

doi:10.18653/v1/p17-1029

Cited by 50 publications

(63 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this work, we further examine the method proposed in (Zhou and Neubig, 2017) for the shared task of SIGMORPHON 2017 on 52 languages and demonstrate the effectiveness of this approach. We will further improve our model's sophistication by investigating strategies for choosing appropriate semi-supervised data, and examining the model's performance on languages with a high inflection level.…”

Section: Discussionmentioning

confidence: 99%

Morphological Inflection Generation with Multi-space Variational Encoder-Decoders

Zhou¹,

Neubig²

2017

Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Discussionmentioning

confidence: 99%

Morphological Inflection Generation with Multi-space Variational Encoder-Decoders

Zhou¹,

Neubig²

2017

Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection

Self Cite

View full text Add to dashboard Cite

show abstract

“…, y |Σy| }, respectively. In tasks where the tag is provided, i.e., labeled transduction (Zhou and Neubig, 2017), we denote the tag as an ordered set t ∈ Σ * t with a finite tag vocabulary Σ t = {t 1 , . .…”

Section: Preliminarymentioning

confidence: 99%

Exact Hard Monotonic Attention for Character-Level Transduction

Cotterell

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

Many common character-level, string-tostring transduction tasks, e.g. graphemeto-phoneme conversion and morphological inflection, consist almost exclusively of monotonic transduction. Neural sequence-tosequence models with soft attention, which are non-monotonic, often outperform popular monotonic models. In this work, we ask the following question: Is monotonicity really a helpful inductive bias in these tasks? We develop a hard attention sequence-to-sequence model that enforces strict monotonicity and learns a latent alignment jointly while learning to transduce. With the help of dynamic programming, we are able to compute the exact marginalization over all monotonic alignments. Our models achieve state-of-the-art performance on morphological inflection. Furthermore, we find strong performance on two other character-level transduction tasks. Code is available at https://github.com/ shijie-wu/neural-transducer.1 The state of the art for morphological inflection is held by ensemble systems, much like parsing and other structured

show abstract

“…com/sigmorphon/conll2017/tree/master/ evaluation Zhou and Neubig (2017), where even after 150k unlabeled examples, performance still appears to be increasing.) After controlling for the amount of additional data, we see only a small benefit from autoencoding corpus words (AE-CW) rather than random strings (AE- Figure 2: The accuracy of our best systems on all languages.…”

Section: Multilingual Trainingmentioning

confidence: 99%

“…We train our system using Stochastic Gradient Descent. Our system is implemented using the Dynet toolkit (Neubig et al, 2017) 4 and our code is freely available. 5 There are three hyper-parameters in our system: the character embedding dimension, the size of the hidden layer of the LSTM models and the size of the hidden layer of the attention network.…”

Section: Rnn Encoder-decoder With Attentionmentioning

confidence: 99%

“…This makes a variety of natural language processing tasks more challenging, as it increases the number of words in a language drastically; a problem morphological analysis and generation help to mitigate. However, a big issue when developing methods for morphological processing is that for many morphologically rich languages, there are only few or no relevant training data available, making it impossible to train state-of-the-art machine learning models (e.g., Kann and Schütze, 2016b;Aharoni et al, 2016;Zhou and Neubig, 2017)). This is the motivation for the CoNLL-SIGMORPHON-2017 shared task on universal morphological reinflection (Cotterell et al, 2017a), which animates the development of systems for as many as 52 different languages in 6 different low-resource settings for morphological reinflection: to generate an inflected form, given a target morphological tag and either the lemma (task 1) or a partial paradigm (task 2).…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection

Hulden¹

2017

View full text Add to dashboard Cite

Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction

Cited by 50 publications

References 20 publications

Morphological Inflection Generation with Multi-space Variational Encoder-Decoders

Morphological Inflection Generation with Multi-space Variational Encoder-Decoders

Exact Hard Monotonic Attention for Character-Level Transduction

Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection

Contact Info

Product

Resources

About