Hongxu Hou scite author profile

Keywords are considered to be important words in the text and can provide a concise representation of the text. With the surge of unlabeled short text on the Internet, automatic keyword extraction task has proven useful in other information processing applications. Graph-based approaches are prevalent unsupervised models for this task. However, most of these methods emphasize the importance of the relation between words without considering other importance factors. Furthermore, when measuring the importance of a word in a text, the damping factor is set to 0.85 following PageRank. To the best of our knowledge, there is no existing work investigating the impact of the damping factor on the keyword extraction task. In addition, there are few publicly available labeled Chinese short text datasets for this task. In this article, we investigate the importance parts of words in a given document and propose an improved graph-based method for keyword extraction from short documents. Moreover, we analyze the impact of importance factors on performance. We also provide annotated long and short Chinese datasets for this task. The model is performed on Chinese and English datasets, and results show that our model obtains improvements in performance over the previous unsupervised models on short documents. Comparative experiments show that the damping factor is related to the text length, which is neglected in traditional methods.

show abstract

Dynamic Mask Curriculum Learning for Non-Autoregressive Neural Machine Translation

Wang¹,

Hou²,

Sun³

et al. 2022

View full text Add to dashboard Cite

GCNDA: Graph Convolutional Networks with Dual Attention Mechanisms for Aspect Based Sentiment Analysis

Chen

Hou

Gao

et al. 2019

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hongxu Hou

Graph Convolutional Networks with Structural Attention Model for Aspect Based Sentiment Analysis

Neural Machine Translation Based on Improved Actor-Critic Method

Inside Importance Factors of Graph-Based Keyword Extraction on Chinese Short Text

Dynamic Mask Curriculum Learning for Non-Autoregressive Neural Machine Translation

GCNDA: Graph Convolutional Networks with Dual Attention Mechanisms for Aspect Based Sentiment Analysis

Contact Info

Product

Resources

About