Language Modeling with Sparse Product of Sememe Experts

Gu, Yihong; Yan, Jun; Zhu, Hao; Liu, Zhiyuan; Xie, Rong; Sun, Maosong; Lin, Feng; Lin, Leyu

doi:10.18653/v1/d18-1493

Cited by 30 publications

(30 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…HowNet, as the most well-known sememe KB, has attracted wide research attention. Previous work applies the sememe knowledge of HowNet to various NLP applications, such as word similarity computation (Liu and Li, 2002), word sense disambiguation (Gan and Wong, 2000;Zhang et al, 2005;Duan et al, 2007), sentiment analysis (Zhu et al, 2006;Dang and Zhang, 2010;Fu et al, 2013), word representation learning (Niu et al, 2017), language modeling (Gu et al, 2018), lexicon expansion (Zeng et al, 2018) and semantic rationality evaluation .…”

Section: Sememes and Hownetmentioning

confidence: 99%

“…HowNet (Dong and Dong, 2003) is a widely acknowledged sememe knowledge base (KB), which defines about 2,000 sememes and uses them to annotate over 100,000 Chinese words together with their English translations. Sememes and HowNet have been successfully utilized in a variety of NLP tasks including sentiment analysis (Dang and Zhang, 2010), word representation learning (Niu et al, 2017), language modeling (Gu et al, 2018), etc.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Modeling Semantic Compositionality with Sememe Knowledge

Qi¹,

Huang²,

Yang³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

Semantic compositionality (SC) refers to the phenomenon that the meaning of a complex linguistic unit can be composed of the meanings of its constituents. Most related works focus on using complicated compositionality functions to model SC while few works consider external knowledge in models. In this paper, we verify the effectiveness of sememes, the minimum semantic units of human languages, in modeling SC by a confirmatory experiment. Furthermore, we make the first attempt to incorporate sememe knowledge into SC models, and employ the sememeincorporated models in learning representations of multiword expressions, a typical task of SC. In experiments, we implement our models by incorporating knowledge from a famous sememe knowledge base HowNet and perform both intrinsic and extrinsic evaluations. Experimental results show that our models achieve significant performance boost as compared to the baseline methods without considering sememe knowledge. We further conduct quantitative analysis and case studies to demonstrate the effectiveness of applying sememe knowledge in modeling SC. All the code and data of this paper can be obtained on https: //github.com/thunlp/Sememe-SC.

show abstract

Section: Sememes and Hownetmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Modeling Semantic Compositionality with Sememe Knowledge

Qi¹,

Huang²,

Yang³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

Self Cite

View full text Add to dashboard Cite

show abstract

“…Xin et al (2018) use a similarly purposed entity typing module and a LM-enhancement module. Instead of entity type generation, Gu et al (2018) propose to explicitly decompose word generation into sememe (a semantic language unit of meaning) generation and sense generation, but requires sememe labels. Yang et al (2016) propose a pointer-network LM that can point to a 1-D or 2-D database record during inference.…”

Section: Related Workmentioning

confidence: 99%

Knowledge-Augmented Language Model and Its Application to Unsupervised Named-Entity Recognition

Liu¹,

Du²,

Stoyanov³

2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

Traditional language models are unable to efficiently model entity names observed in text. All but the most popular named entities appear infrequently in text providing insufficient context. Recent efforts have recognized that context can be generalized between entity names that share the same type (e.g., person or location) and have equipped language models with access to an external knowledge base (KB). Our Knowledge-Augmented Language Model (KALM) continues this line of work by augmenting a traditional model with a KB. Unlike previous methods, however, we train with an end-to-end predictive objective optimizing the perplexity of text. We do not require any additional information such as named entity tags. In addition to improving language modeling performance, KALM learns to recognize named entities in an entirely unsupervised way by using entity type information latent in the model. On a Named Entity Recognition (NER) task, KALM achieves performance comparable with state-of-the-art supervised models. Our work demonstrates that named entities (and possibly other types of world knowledge) can be modeled successfully using predictive learning and training on large corpora of text without any additional information.

show abstract

“…Since HowNet was published (Dong and Dong, 2003), it has attracted wide attention of re-searchers. Most of related works focus on applying HowNet to specific NLP tasks (Liu and Li, 2002;Zhang et al, 2005;Sun et al, 2007;Dang and Zhang, 2010;Fu et al, 2013;Niu et al, 2017;Zeng et al, 2018;Gu et al, 2018). To the best of our knowledge, only and Jin et al (2018) conduct studies of augmenting HowNet by recommending sememes for new words.…”

Section: Related Workmentioning

confidence: 99%

Cross-lingual Lexical Sememe Prediction

Lin

Sun

et al. 2018

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

Self Cite

View full text Add to dashboard Cite

Sememes are defined as the minimum semantic units of human languages. As important knowledge sources, sememe-based linguistic knowledge bases have been widely used in many NLP tasks. However, most languages still do not have sememe-based linguistic knowledge bases. Thus we present a task of cross-lingual lexical sememe prediction, aiming to automatically predict sememes for words in other languages. We propose a novel framework to model correlations between sememes and multilingual words in low-dimensional semantic space for sememe prediction. Experimental results on real-world datasets show that our proposed model achieves consistent and significant improvements as compared to baseline methods in cross-lingual sememe prediction. The codes and data of this paper are available at https: //github.com/thunlp/CL-SP.

show abstract

Language Modeling with Sparse Product of Sememe Experts

Cited by 30 publications

References 34 publications

Modeling Semantic Compositionality with Sememe Knowledge

Modeling Semantic Compositionality with Sememe Knowledge

Knowledge-Augmented Language Model and Its Application to Unsupervised Named-Entity Recognition

Cross-lingual Lexical Sememe Prediction

Contact Info

Product

Resources

About