Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection

Corkery, Maria; Matusevych, Yevgen; Goldwater, Sharon

doi:10.18653/v1/p19-1376

Cited by 26 publications

(26 citation statements)

References 23 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When given the task of predicting the judgement and production data from Albright and Hayes’ (2003) novel verb study, each of these exemplar models – depending on the particular instantiation – equals or betters both a state-of-the-art connectionist model (cf. Chandler, 2010, Table 1; Kirov & Cotterell, 2018, Table 5; though see Corkery, Matusevych, & Goldwater, 2019, for concerns regarding the stability of these simulations) and Albright and Hayes’ (2003) own model which constructs an explicit micro-rule for each and every sub regularity; an approach which shows no regard for psychological plausibility, in contrast to many exemplar models which have their origins in models and findings from the non-linguistic categorization literature.…”

Section: Morphologically Inflected Wordsmentioning

confidence: 99%

Against stored abstractions: A radical exemplar model of language acquisition

Ambridge

2019

First Language

115

View full text Add to dashboard Cite

The goal of this article is to make the case for a radical exemplar account of child language acquisition, under which unwitnessed forms are produced and comprehended by on-the-fly analogy across multiple stored exemplars, weighted by their degree of similarity to the target with regard to the task at hand. Across the domains of (1) word meanings, (2) morphologically inflected words, (3) n-grams, (4) sentence-level constructions and (5) phonetics and phonology, accounts based on independently-represented abstractions (whether formal rules or prototype categories) fail for two reasons. First, it is not possible to posit abstractions that delineate possible and impossible form; e.g. that (1) rule in pool tables and data tables, but rule out chairs, (2) rule in the past-tense forms netted and bet but rule out * setted and * jet, (3) rule in the bigram f+t but rule out (probabilistically) v+t, (4) rule in both John feared Bill and John frightened Bill but rule out * John laughed Bill, (5) rule in Speaker A but rule out Speaker B as the person who produced a particular word (e.g. Sa’urday). Second, for each domain, empirical data provide evidence of exemplar storage that cannot be captured by putative abstractions: e.g. speakers prefer and/or show an advantage for (1) exemplar variation even within word-meaning ‘category boundaries’, (2) novel inflected forms that are similar to existing exemplars, (3) n-grams that have occurred frequently in the input, (4) SVO sentences with he as SUBJECT and it as OBJECT and (5) repeated productions of ‘the same’ word that are phonologically similar or, better still, identical. An exemplar account avoids an intractable lumping-or-splitting dilemma facing abstraction-based accounts and provides a unitary explanation of language acquisition across all domains; one that is consistent with models and empirical findings from the computational modelling and neuroimaging literature.

show abstract

Section: Morphologically Inflected Wordsmentioning

confidence: 99%

Against stored abstractions: A radical exemplar model of language acquisition

Ambridge

2019

First Language

115

View full text Add to dashboard Cite

show abstract

“…This neural network architecture was originally designed for machine translation, but has been proposed as a baseline for morphophonological learning, and correlates well with human behaviour in a number of such tasks (Kirov 2017). For example, when tested on the experimental results from Albright & Hayes (2003), a Seq2Seq model's predictions correlated with human behaviour better than any previously proposed model (Kirov & Cotterell 2018; although see Corkery et al 2019 for a critique of these results). The Seq2Seq network learns string-to-string mappings (UR to SR mappings in this case) by updating weights for connections between nodes that are organised into multiple layers.…”

Section: Learning Simulationsmentioning

confidence: 96%

Learning biases in opaque interactions

Prickett

2019

Phonology

View full text Add to dashboard Cite

This study uses an artificial language learning experiment and computational modelling to test Kiparsky's claims about Maximal Utilisation and Transparency biases in phonological acquisition. A Maximal Utilisation bias would prefer phonological patterns in which all rules are maximally utilised, and a Transparency bias would prefer patterns that are not opaque. Results from the experiment suggest that these biases affect the learnability of specific parts of a language, with Maximal Utilisation affecting the acquisition of individual rules, and Transparency affecting the acquisition of rule orderings. Two models were used to simulate the experiment: an expectation-driven Harmonic Serialism learner and a sequence-to-sequence neural network. The results from these simulations show that both models’ learning is affected by these biases, suggesting that the biases emerge from the learning process rather than any explicit structure built into the model.

show abstract

“…Sequenceto-sequence models are also capable of learning stem-affix relationships, both morphological and phonological, as discussed in Section 3. Faruqui et al (2016) illustrates this for the case of Finnish vowel harmony (see also Corkery et al 2019); many earlier models with explicit morpheme segmentation were forced to represent this process as suppletive allomorphy (for example, positing the two phonological [ 71 ] variants of the inessive suffix, -ssa and -ssä, as suppletive allomorphs), which could lead to overassessment of the system's complexity (Stump and Finkel 2015). But the sequence-to-sequence model learns a generalizable harmony rule.…”

Section: 1mentioning

confidence: 99%

“…We address these questions below. acquisition Cotterell et al (2018b) suggests that sequence-to-sequence models can function as cognitive models of infant language learners (though see Corkery et al (2019) for some differences in behavior for nonce words). But to use a sequence-to-sequence model as a credible stand-in for the human infant, we must determine what the input for acquisition of morphology looks like -the right representation and learning algorithm cannot tell us anything if it is supplied with the wrong data.…”

Section: Modeling Morphological Learningmentioning

confidence: 99%

Modeling morphological learning, typology, and change: What can the neural sequence-to-sequence framework contribute?

Elsner

Sims

Erdmann

et al. 2019

JLM

View full text Add to dashboard Cite

We survey research using neural sequence-to-sequence models as computational models of morphological learning and learnability. We discuss their use in determining the predictability of inflectional exponents, in making predictions about language acquisition and in modeling language change. Finally, we make some proposals for future work in these areas. 1 introduction Theoretical morphologists have long appealed to notions of learning, or learnability, to explain language change and the varied typological patterns of the world's languages. The high-level argument is simple: all natural languages must be learned, and "unlearnable" linguistic systems cannot survive. Therefore, the learning mechanism provides constraints on what sorts of languages can exist in the world. In the realm of morphology, however, it has not proven simple to define

show abstract

Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection

Cited by 26 publications

References 23 publications

Against stored abstractions: A radical exemplar model of language acquisition

Against stored abstractions: A radical exemplar model of language acquisition

Learning biases in opaque interactions

Modeling morphological learning, typology, and change: What can the neural sequence-to-sequence framework contribute?

Contact Info

Product

Resources

About