Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1240
|View full text |Cite
|
Sign up to set email alerts
|

Topic-Aware Neural Keyphrase Generation for Social Media Language

Abstract: A huge volume of user-generated content is daily produced on social media. To facilitate automatic language understanding, we study keyphrase prediction, distilling salient information from massive posts. While most existing methods extract words from source posts to form keyphrases, we propose a sequence-to-sequence (seq2seq) based neural keyphrase generation framework, enabling absent keyphrases to be created. Moreover, our model, being topic-aware, allows joint modeling of corpus-level latent topic represen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
4
2

Relationship

1
9

Authors

Journals

citations
Cited by 69 publications
(60 citation statements)
references
References 28 publications
(42 reference statements)
0
52
0
Order By: Relevance
“…They report improvements on keyphrase diversity measured using distinct-1 and distinct-2 metrics described in [77]. In [78] they create another hybrid system that infuses topical information into the encoder-decoder framework. They use an NTM (Neural Topic Model) for grasping the latent topic aspects of the input text.…”
Section: B Enhanced and Hybrid Solutionsmentioning
confidence: 99%
“…They report improvements on keyphrase diversity measured using distinct-1 and distinct-2 metrics described in [77]. In [78] they create another hybrid system that infuses topical information into the encoder-decoder framework. They use an NTM (Neural Topic Model) for grasping the latent topic aspects of the input text.…”
Section: B Enhanced and Hybrid Solutionsmentioning
confidence: 99%
“…Moreover, to well align the images' semantics to texts', we adopt image wordings and define two forms for that -explicit optical characters (such as "NBA Finals" in post (b)) detected from the optical character reader (OCR) and implicit image attributes (Wu et al, 2006), high-level text labels predicted to summarize the image's semantic concepts (such as a "cat" label for post (a)). Furthermore, unlike prior work employing either classification or generation models (Wang et al, 2019a), we propose a unified framework to couple the advantages of keyphrase classification and generation. Specifically, in addition to the joint training of both modules, we further extend the copy mechanism (See et al, 2017) to explicitly aggregate classification outputs together with tokens from the source input.…”
Section: Introductionmentioning
confidence: 99%
“…In this work, we define two kinds of agents for two generation tasks which can interact and share useful information. We also notice that in other areas, there are also some works (Xing et al, 2017;Wang et al, 2019c) consider incorporating topic information.…”
Section: Case Studymentioning
confidence: 65%