Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2018
DOI: 10.18653/v1/p18-1227
|View full text |Cite
|
Sign up to set email alerts
|

Incorporating Chinese Characters of Words for Lexical Sememe Prediction

Abstract: Sememes are minimum semantic units of concepts in human languages, such that each word sense is composed of one or multiple sememes. Words are usually manually annotated with their sememes by linguists, and form linguistic commonsense knowledge bases widely used in various NLP tasks. Recently, the lexical sememe prediction task has been introduced. It consists of automatically recommending sememes for words, which is expected to improve annotation efficiency and consistency. However, existing methods of lexica… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
34
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
3
1

Relationship

4
5

Authors

Journals

citations
Cited by 26 publications
(34 citation statements)
references
References 34 publications
0
34
0
Order By: Relevance
“…Sememe prediction is a well-defined task Jin et al, 2018;Qi et al, 2018), aimed at selecting appropriate sememes for unannotated words or phrases from the set of all the sememes. Existing works model sememe prediction as a multi-label classification problem, where sememes are regarded as the labels of words and phrases.…”
Section: Training For Mwe Sememe Predictionmentioning
confidence: 99%
See 1 more Smart Citation
“…Sememe prediction is a well-defined task Jin et al, 2018;Qi et al, 2018), aimed at selecting appropriate sememes for unannotated words or phrases from the set of all the sememes. Existing works model sememe prediction as a multi-label classification problem, where sememes are regarded as the labels of words and phrases.…”
Section: Training For Mwe Sememe Predictionmentioning
confidence: 99%
“…In HowNet, there are 118,346 Chinese words annotated with 2,138 sememes in total. Following previous work Jin et al, 2018), we filter out the low-frequency sememes, which are considered unimportant. The final number of sememes we use is 1,335.…”
Section: Datasetmentioning
confidence: 99%
“…Previous methods of sememe prediction for words usually compute an association score for each sememe and select the sememes with scores higher than a threshold to form the predicted sememe set (Xie et al 2017;Jin et al 2018). Following this formulation, we havê…”
Section: Spbs Task Formalizationmentioning
confidence: 99%
“…apply matrix factorization to predict sememes for words. Jin et al (2018) improve their work by incorporating character-level information. Our work extends the previous works and tries to combine word-sense-sememe hierar-chy with the sequential model.…”
Section: Sememementioning
confidence: 99%