“…Different trial-by-trial characteristics like the amount of referential ambiguity (number of objects on the screen), number of to-be-learned words, how often each word is repeated, and the complexity of the word-object-mappings impact CSWL: Observed learning rate will be highest if there are only two visual referents on each screen and a small number of to-be learned one-to-one-mappings (i.e., each word maps onto one referent only) that are repeated often (e.g., Yu and Smith, 2007;Poepsel and Weiss, 2016;Roembke and McMurray, 2016). Some studies have combined the learning of cross-situational statistics with other cues, such as morphological (Finley, 2022) or social ones (Frank et al, 2013;MacDonald et al, 2017); these can facilitate how easy it is to learn or what type of information is encoded.…”