Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology 2019
DOI: 10.18653/v1/w19-4217
|View full text |Cite
|
Sign up to set email alerts
|

Data mining Mandarin tone contour shapes

Abstract: In spontaneous speech, Mandarin tones that belong to the same tone category may exhibit many different contour shapes. We explore the use of data mining and NLP techniques for understanding the variability of tones in a large corpus of Mandarin newscast speech. First, we adapt a graph-based approach to characterize the clusters (fuzzy types) of tone contour shapes observed in each tone n-gram category. Second, we show correlations between these realized contour shape types and a bag of automatically extracted … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 17 publications
(27 reference statements)
0
2
0
Order By: Relevance
“…Levow (2006) performed unsupervised and semi-supervised tone clustering in Mandarin, using average pitch and slope as features, and k-means and asymmetric k-lines for clustering. Graph-based community detection techniques have been applied to group n-grams of contiguous contours into clusters in Mandarin (Zhang, 2019). In recent work concurrent to ours, Fry (2020) uses adversarial autoencoders and hierarchical clustering to identify tone inventories, and evaluate their method on Mandarin, Cantonese, Fungwa, and English data.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Levow (2006) performed unsupervised and semi-supervised tone clustering in Mandarin, using average pitch and slope as features, and k-means and asymmetric k-lines for clustering. Graph-based community detection techniques have been applied to group n-grams of contiguous contours into clusters in Mandarin (Zhang, 2019). In recent work concurrent to ours, Fry (2020) uses adversarial autoencoders and hierarchical clustering to identify tone inventories, and evaluate their method on Mandarin, Cantonese, Fungwa, and English data.…”
Section: Related Workmentioning
confidence: 99%
“…Levow [12] performed unsupervised and semisupervised tone clustering in Mandarin, using average pitch and slope as features, and k-means and asymmetric k-lines for clustering. Graph-based community detection techniques have been applied to group n-grams of contiguous contours into clusters in Mandarin [13]. Our work appears to be the first model to use unsupervised deep neural networks for phonemic tone clustering.…”
Section: Related Workmentioning
confidence: 99%