Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change 2022
DOI: 10.18653/v1/2022.lchange-1.11
|View full text |Cite
|
Sign up to set email alerts
|

Lexicon of Changes: Towards the Evaluation of Diachronic Semantic Shift in Chinese

Abstract: Recent research has brought a wind of using computational approaches to the classic topic of semantic change, aiming to tackle one of the most challenging issues in the evolution of human language. While several methods for detecting semantic change have been proposed, such studies are limited to a few languages, where evaluation datasets are available.This paper presents the first dataset for evaluating Chinese semantic change in contexts preceding and following the Reform and Openingup, covering a 50-year pe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 15 publications
(30 reference statements)
0
7
0
Order By: Relevance
“…The weighted mean pairwise Spearman score for inter-rater agreements is 0.691, and the Krippendorff's alpha is 0.602, which are quite high if compared to other DURel datasets (Schlechtweg et al, 2021(Schlechtweg et al, , 2020(Schlechtweg et al, , 2018Erk et al, 2013;Chen et al, 2022). For more statistics, see Table 3.…”
Section: Human Annotationmentioning
confidence: 93%
See 2 more Smart Citations
“…The weighted mean pairwise Spearman score for inter-rater agreements is 0.691, and the Krippendorff's alpha is 0.602, which are quite high if compared to other DURel datasets (Schlechtweg et al, 2021(Schlechtweg et al, , 2020(Schlechtweg et al, , 2018Erk et al, 2013;Chen et al, 2022). For more statistics, see Table 3.…”
Section: Human Annotationmentioning
confidence: 93%
“…The DURel framework and its extension DWUGs have been applied to constructing evaluation datasets for a variety of languages, such as English, Swedish, German, and Latin released in the SemEval 2020 (Schlechtweg et al, 2020), and later for Russian, Norwegian, Spanish, and Chinese (Rodina and Kutuzov, 2020;Kutuzov and Pivovarova, 2021;Kutuzov et al, 2022;Zamora-Reina et al, 2022;Chen et al, 2022). Since the nature of this paradigm is to measure usage differences between sentence pairs, it has also been extended to the construction of synchronic disambiguation datasets (Aksenova et al, 2022;Hätty et al, 2019) and to diatopic variation (i.e., usage differences across regional variations) (Baldissin et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The DURel framework and its extension DWUGs have been applied to constructing evaluation datasets for a variety of languages, such as English, Swedish, German, and Latin released in the SemEval 2020 , and later for Russian, Norwegian, Spanish, and Chinese (Rodina and Kutuzov, 2020;Kutuzov and Pivovarova, 2021;Kutuzov et al, 2022;Chen et al, 2022). Since the nature of this paradigm is to measure usage differences between sentence pairs, it has also been extended to the construction of synchronic disambiguation datasets (Aksenova et al, 2022; and to diatopic variation (i.e., usage differences across regional variations) (Baldissin et al, 2022).…”
Section: Related Workmentioning
confidence: 99%
“…The increasing number of published evaluation datasets further fostered the domain, enabling different models and hyperparameters to be quantitatively tested on the same benchmarks (Kutuzov et al, 2022;Aksenova et al, 2022;Chen et al, 2022;Basile et al, 2019). These datasets are predominantly constructed within the framework of Diachronic Usage Relatedness (DURel), wherein changing scores are generated by calculating human ratings on semantic relatedness across a variety of usage pairs for targets Rodina and Kutuzov, 2020;Chen et al, 2022). In the extended DURel framework, namely Diachronic Word Usage Graphs (DWUGs) , the usages could be further populated through Word Usage Graphs (WUGs) for visualization (McCarthy et al, 2016;Kutuzov et al, 2022).…”
Section: Introductionmentioning
confidence: 99%