Proceedings of the 55th Annual Meeting of the Association For Computational Linguistics (Volume 2: Short Papers) 2017
DOI: 10.18653/v1/p17-2028
|View full text |Cite
|
Sign up to set email alerts
|

The Role of Prosody and Speech Register in Word Segmentation: A Computational Modelling Perspective

Abstract: This study explores the role of speech register and prosody for the task of word segmentation. Since these two factors are thought to play an important role in early language acquisition, we aim to quantify their contribution for this task. We study a Japanese corpus containing both infant-and adult-directed speech and we apply four different word segmentation models, with and without knowledge of prosodic boundaries. The results showed that the difference between registers is smaller than previously reported … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
1

Relationship

7
0

Authors

Journals

citations
Cited by 9 publications
(13 citation statements)
references
References 17 publications
0
13
0
Order By: Relevance
“…prosody larger prosodic breaks (Ludusan et al, 2016) more difficult word form clustering (Exp. 3) easier word segmentation (Ludusan et al, 2017) phonetic categories discriminative learning clustering protolexicon segmentation Figure 6. Summary of IDS characteristics relative to ADS in a top-down model of phonetic category learning for the RIKEN corpus.…”
Section: Caregiver Infantmentioning
confidence: 99%
“…prosody larger prosodic breaks (Ludusan et al, 2016) more difficult word form clustering (Exp. 3) easier word segmentation (Ludusan et al, 2017) phonetic categories discriminative learning clustering protolexicon segmentation Figure 6. Summary of IDS characteristics relative to ADS in a top-down model of phonetic category learning for the RIKEN corpus.…”
Section: Caregiver Infantmentioning
confidence: 99%
“…These studies are informative for a host of learnability questions, such as to test the sheer feasibility of a proposed word segmentation solution (Gambell & Yang, 2005), to compare alternative algorithms (Goldwater, Griffiths, & Johnson, 2009;Pearl, Goldwater, & Steyvers, 2010), to see whether languages differ in their intrinsic segmentability (Fourtassi, Börschinger, Johnson, & Dupoux, 2013), or whether child-directed speech is intrinsically easier to segment than adult-directed speech (Ludusan, Mazuka, Bernard, Cristia, & Dupoux, 2017). Additionally, there is emergent evidence suggesting that computational word segmentation results may also be relevant for infant psycholinguistics, by predicting the contents of infants' long-term vocabulary better than lexical status (Ngon et al, 2013) or pure frequency (Larsen, Cristia, & Dupoux, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…Batchelder (2002) applied a lexical algorithm onto the American English Bernstein Ratner corpus (Bernstein Ratner, 1984), and found a 15% advantage for CDS over ADS. Ludusan, Mazuka, Bernard, Cristia, and Dupoux (2017) applied two lexical and two sublexical algorithms to the Japanese-spoken Riken corpus (Mazuka, Igarashi, & Nishikawa, 2006), where the CDS advantage was between 2% and 10%. Still, it is unclear whether either corpus is representative of the CDS and ADS children hear every day.…”
Section: Introductionmentioning
confidence: 99%
“…Therefore, we studied an ecological child-centered corpus containing both ADS and CDS. We followed Ludusan and colleagues (2017) by using both lexical and sublexical algorithms; in addition, we varied important parameters within these classes and further added two baselines. In all, we aimed to provide a more accurate and generalizable estimate of the size of segmentability differences in CDS versus ADS.…”
Section: Introductionmentioning
confidence: 99%