Xingwu Sun scite author profile

In this paper, we focus on the problem of question generation (QG). Recent neural networkbased approaches employ the sequence-tosequence model which takes an answer and its context as input and generates a relevant question as output. However, we observe two major issues with these approaches: (1) The generated interrogative words (or question words) do not match the answer type. (2) The model copies the context words that are far from and irrelevant to the answer, instead of the words that are close and relevant to the answer. To address these two issues, we propose an answer-focused and position-aware neural question generation model. (1) By answerfocused, we mean that we explicitly model question word generation by incorporating the answer embedding, which can help generate an interrogative word matching the answer type. (2) By position-aware, we mean that we model the relative distance between the context words and the answer. Hence the model can be aware of the position of the context words when copying them to generate a question. We conduct extensive experiments to examine the effectiveness of our model. The experimental results show that our model significantly improves the baseline and outperforms the state-of-the-art system.

show abstract

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval

Tang¹,

Sun²,

Jin³

et al. 2021

View full text Add to dashboard Cite

Recently, the retrieval models based on dense representations have been gradually applied in the first stage of the document retrieval tasks, showing better performance than traditional sparse vector space models. To obtain high efficiency, the basic structure of these models is Bi-encoder in most cases. However, this simple structure may cause serious information loss during the encoding of documents since the queries are agnostic. To address this problem, we design a method to mimic the queries on each of the documents by an iterative clustering process and represent the documents by multiple pseudo queries (i.e., the cluster centroids). To boost the retrieval process using approximate nearest neighbor search library, we also optimize the matching function with a two-step score calculation procedure. Experimental results on several popular ranking and QA datasets show that our model can achieve state-of-the-art results.

show abstract

Answer-Focused and Position-Aware Neural Network for Transfer Learning in Question Generation

Sun²,

Cao

et al. 2019

View full text Add to dashboard Cite

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval

Tang

Sun

Jin

et al. 2021

Preprint

View full text Add to dashboard Cite

TABLE: A Task-Adaptive BERT-based ListwisE Ranking Model for Document Retrieval

Sun¹,

Tang

Zhang³

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Xingwu Sun

Answer-focused and Position-aware Neural Question Generation

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval

Answer-Focused and Position-Aware Neural Network for Transfer Learning in Question Generation

Improving Document Representations by Generating Pseudo Query Embeddings for Dense Retrieval

TABLE: A Task-Adaptive BERT-based ListwisE Ranking Model for Document Retrieval

Contact Info

Product

Resources

About