INTRODUCTIONAuthors, in both the area of experimental research and the area of computer simulation in developmental psychology, have considered neural networks (including connectionist PDP networks) as important process models ofWe would like to thank Han van der Maas, Anny Bosman, and the reviewers for their helpful comments on this article.Correspondence and requests for reprints should be sent to Maartje Raijmakers, Department of Psychology, University of Amsterdam, Roetersstraat 15, 1018 WB Amsterdam, The Netherlands, E-mail: op_raijmakers@macmaiI.psy.uva.nl.
101
102
RAIJMAKERS, VAN KOTEN, AND MOLENAARcognitive development (Bates & Elman, 1993; Grossberg, 1980; McClelland, 1989; Plunkett & Sinha, 1992;Siegler, 1989). Contrary to the relatively fixed architectures of the symbol manipulating production systems commonly employed in cognitive science, neural networks develop their knowledge base in adaptive interaction with the environment. In this respect, neural networks are compatible with epigenetical theory, according to which such interactions are the prime source of the emergence of more powerful cognitive structures (cf. Molenaar, 1986b). PDP networks, introduced by McClelland, Rumelhart, and the PDP research group (1986), constitute a distinct subset of neural network models. They are thought to be capable of both acquiring symbolic systems up to the level of natural language and modeling specific developmental phenomena like the accommodation process, which lies at the heart of Piaget's theory of developmental change (McClelland & Jenkins, 1991, p. 69). The applicability of those models to developmental processes is mainly studied by comparing their performance with empirical data, pertaining to, for example, English verb morphology, concept formation and vocabulary growth, and stagewise cognitive development (Plunkett & Sinha, 1992).In this article, we will take a closer look at the latter application: the simulation of stagewise cognitive development. McClelland and others (e.g., McClelland & Jenkins, 1991) have drawn two main conclusions from their study of a PDP network that learns the balance scale task. First, the learning behaviour can be described as the acquisition of increasingly complex rules. This conclusion is based on an application of Siegler's (1981) rule-assessment methodology, in which observed response patterns are classified as being generated by one of four distinct rules. Second, the acquisition of more complex rules by the network appears to proceed in a stagewise manner. That is, the performance of the network is for a while consistent with a particular arule and then suddenly shifts to another rule.However, questions have been raised about the evidence on which these conclusions are based. The four increasingly complex rules in Siegler's (1981) scheme constitute a measurement scale with discrete values. Hence, an application of this scheme to the learning behaviour of the network implies that the observed response patterns, which can vary continuously along multiple dimens...