A Semantic-based Method for Unsupervised Commonsense Question Answering

Niu, Yanling; Huang, Fei; Liang, Jiaming; Chen, Wenkai; Zhu, Xiaoyan; Huang, Minlie

doi:10.18653/v1/2021.acl-long.237

Cited by 8 publications

(11 citation statements)

References 25 publications

(44 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, we report the experimental results on their development sets for a fair comparison (Shwartz et al, 2020). For CoPA that only provides development and test sets, we follow Niu et al (2021) to train models on the development set and evaluate the performance on the test set. For commonsense KG, we adopt Con-ceptNet (Speer et al, 2017), a general-domain and task-agnostic CSKG, as our external knowledge source G for all the above models and tasks.…”

Section: Methodsmentioning

confidence: 99%

Great~Truths~are ~Always ~Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models

Jiang¹,

Zhou²,

Wen³

et al. 2022

Findings of the Association for Computational Linguistics: NAACL 2022

View full text Add to dashboard Cite

Commonsense reasoning in natural language is a desired ability of artificial intelligent systems. For solving complex commonsense reasoning tasks, a typical solution is to enhance pre-trained language models (PTMs) with a knowledge-aware graph neural network (GNN) encoder that models a commonsense knowledge graph (CSKG). Despite the effectiveness, these approaches are built on heavy architectures, and can't clearly explain how external knowledge resources improve the reasoning capacity of PTMs. Considering this issue, we conduct a deep empirical analysis, and find that it is indeed relation features from CSKGs (but not node features) that mainly contribute to the performance improvement of PTMs. Based on this finding, we design a simple MLP-based knowledge encoder that utilizes statistical relation paths as features. Extensive experiments conducted on five benchmarks demonstrate the effectiveness of our approach, which also largely reduces the parameters for encoding CSKGs. Our codes and data are publicly available at https://github.com/RUCAIBox/SAFE.

show abstract

Section: Methodsmentioning

confidence: 99%

Great~Truths~are ~Always ~Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models

Jiang¹,

Zhou²,

Wen³

et al. 2022

Findings of the Association for Computational Linguistics: NAACL 2022

View full text Add to dashboard Cite

show abstract

“…This is computed as the conditional probability of the answer given a domain-specific prefix such as "The sentiment of the movie is" for sentiment analysis or "The answer is" for general QA tasks. SEQA (Niu et al, 2021) mitigates the sensitivity to word choice by generating answers using GPT-2, and selecting the answer choice most similar to the generated answers.…”

Section: Plausibility Scoringmentioning

confidence: 99%

“…In multiple-choice question answering (MCQA) tasks, zero-shot methods typically rely on the language model (LM) probabilities as a proxy for plausibility, predicting the answer choice with the highest probability conditioned on the question. LM score is a naïve proxy for plausibility, since it confounds factors such as length, unigram frequency, and more (Holtzman et al, 2021;Niu et al, 2021). Indeed, in Figure 1, a GPT-2 based LM score incorrectly predicts that the woman hired a lawyer because she decided to run for office, rather than because she decided to sue her employer.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

CASE: Commonsense-Augmented Score with an Expanded Answer Space

Chen,

Ravi,

Shwartz

2023

Findings of the Association for Computational Linguistics: EMNLP 2023

View full text Add to dashboard Cite

LLMs have demonstrated impressive zero-shot performance on NLP tasks thanks to the knowledge they acquired in their training. In multiplechoice QA tasks, the LM probabilities are used as an imperfect measure of the plausibility of each answer choice. One of the major limitations of the basic score is that it treats all words as equally important. We propose CASE, a Commonsense-Augmented Score with an Expanded Answer Space. CASE addresses this limitation by assigning importance weights for individual words based on their semantic relations to other words in the input. The dynamic weighting approach outperforms basic LM scores, not only because it reduces noise from unimportant words, but also because it informs the model of implicit commonsense knowledge that may be useful for answering the question. We then also follow prior work in expanding the answer space by generating lexically-divergent answers that are conceptually-similar to the choices. When combined with answer space expansion, our method outperforms strong baselines on 5 commonsense benchmarks. We further show these two approaches are complementary and may be especially beneficial when using smaller LMs. The woman hired a lawyer because A. she decided to sue her employer. B. she decided to run for office. C. she wanted to sue her former employer. The woman hired a lawyer because ___ A. she decided to sue her employer. B. she decided to run for office. C. she wanted to sue her former employer. LM score = 2.53 CASE score = 2.70 LM score = 2.35 CASE score = 2.76 LM score = 1.80 CASE score = 1.89

show abstract

“…Semantic similarity. Niu et al (2021) show that semantic similarity matching can be used to make PLMs robust against irrelevant factors such as word frequencies. Specifically, we can first use PLMs to generate plausible answers so that we can compute the similarity between each generated answer and each of the provided answer candidates, and then we can select the answer candidate that has the highest similarity score as the correct answer.…”

Section: Linguistic Reasoningmentioning

confidence: 99%

Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey

Bhargava¹,

Ng²

2022

AAAI

View full text Add to dashboard Cite

While commonsense knowledge acquisition and reasoning has traditionally been a core research topic in the knowledge representation and reasoning community, recent years have seen a surge of interest in the natural language processing community in developing pre-trained models and testing their ability to address a variety of newly designed commonsense knowledge reasoning and generation tasks. This paper presents a survey of these tasks, discusses the strengths and weaknesses of state-of-the-art pre-trained models for commonsense reasoning and generation as revealed by these tasks, and reflects on future research directions.

show abstract

A Semantic-based Method for Unsupervised Commonsense Question Answering

Cited by 8 publications

References 25 publications

Great~Truths~are ~Always ~Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models

Great~Truths~are ~Always ~Simple: A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models A Rather Simple Knowledge Encoder for Enhancing the Commonsense Reasoning Capacity of Pre-Trained Models

CASE: Commonsense-Augmented Score with an Expanded Answer Space

Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey

Contact Info

Product

Resources

About