2018
DOI: 10.1353/jcl.2018.0002
|View full text |Cite
|
Sign up to set email alerts
|

A Multimedia Corpus of Child Mandarin: The Tong Corpus

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 29 publications
(8 citation statements)
references
References 43 publications
0
7
0
1
Order By: Relevance
“…The Adam and Eve sections from the Brown Corpus are then used to evaluate the depthbounded model defined in Section 4. Transcribed child-directed speech data in Chinese Mandarin (Tong; Deng et al 2018) and German (Leo;Behrens 2006) are also collected from the CHILDES corpus with reference trees automatically generated using the stateof-the-art Kitaev and Klein (2018) supervised parser trained with the Chinese (Xia et al 2000; The Chinese Treebank) and German (Skut et al 1998;NEGRA) treebanks. They are used as held-out data sets for the bounded grammar induction experiments, using cross-linguistic hyperparameters tuned on English.…”
Section: Experiments 3: Evaluation Of Bounded Pcfg Induction On Child-mentioning
confidence: 99%
“…The Adam and Eve sections from the Brown Corpus are then used to evaluate the depthbounded model defined in Section 4. Transcribed child-directed speech data in Chinese Mandarin (Tong; Deng et al 2018) and German (Leo;Behrens 2006) are also collected from the CHILDES corpus with reference trees automatically generated using the stateof-the-art Kitaev and Klein (2018) supervised parser trained with the Chinese (Xia et al 2000; The Chinese Treebank) and German (Skut et al 1998;NEGRA) treebanks. They are used as held-out data sets for the bounded grammar induction experiments, using cross-linguistic hyperparameters tuned on English.…”
Section: Experiments 3: Evaluation Of Bounded Pcfg Induction On Child-mentioning
confidence: 99%
“…The data came from two boys and one girl whose language was regularly recorded, transcribed, and accessed for the present research from the CHILDES repository; naturalistic language data gathered by primary investigators are deposited in CHILDES to enable analysis by other researchers. We included Tong, a boy described by Xiangjun and Yip (2018) in their language acquisition research. Tong was raised in Shenzhen where both Mandarin and Cantonese are spoken.…”
Section: Participantsmentioning
confidence: 99%
“…Finally, to examine whether a similar relationship between morphological typology and induction per- formance is observed in languages other than English and Korean, the NeuralChar and NeuralWord models were also evaluated on Mandarin Chinese and German child-directed speech corpora from CHILDES. The Chinese corpus consists of 19,541 caregiver utterances from the Tong section (Deng et al, 2018) with a mean sentence length of 5.7 words, which were recorded at ages from 1 year 0 months and 4 years 5 months. The German corpus contains 20,000 child-directed utterances randomly sampled from the Leo section (Behrens, 2006), as the original corpus contained many duplicate utterances in interactions between Leo and his caregivers between ages 1 year 11 months and 4 years 11 months.…”
Section: Replication Using Silver Datamentioning
confidence: 99%