Factual Probing Is [MASK]: Learning vs. Learning to Recall

Zhong, Zexuan; Friedman, Daniel; Chen, Danqi

doi:10.18653/v1/2021.naacl-main.398

Cited by 152 publications

(113 citation statements)

References 22 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The answer is simple. Typical language Prompt handcrafting (Petroni et al, 2019; Automatic prompt engineering (Jiang et al, 2020b;Shin et al, 2020;Zhong et al, 2021;Qin and Eisner, 2021) Adversarial prompt modification Poerner et al, 2020; Varying base prompts (Elazar et al, 2021;Heinzerling and Inui, 2021;Jiang et al, 2020a; Symbolic rule-based prompting Talmor et al, 2020a) Statement scores ( § 3.2) Single-LM scoring (Tamborrino et al, 2020;) Dual-LM scoring (Davison et al, 2019;Shwartz et al, 2020) modeling corpora like Wikipedia are known to contain KB-like assertions about the world (Da and Kasai, 2019). LMs trained on enough such data can be expected to acquire some KB-like knowledge, even without targeted entity-or relation-level supervision.…”

Section: Word-level Supervisionmentioning

confidence: 99%

“…Automatic prompt engineering is a promising alternative to prompt handcrafting for knowledge extraction in LMs (Liu et al, 2021a), as prompts engineered using discrete (Jiang et al, 2020b;Shin et al, 2020;Haviv et al, 2021) and continuous (Zhong et al, 2021;Qin and Eisner, 2021;Liu et al, 2021b) optimization have improved LMs' lower-bound performance on LAMA's underlying queries. Note, however, that optimized prompts are not always grammatical or intelligible (Shin et al, 2020).…”

Section: Cloze Promptingmentioning

confidence: 99%

“…Note, however, that optimized prompts are not always grammatical or intelligible (Shin et al, 2020). Prompt optimization methods may also confound knowledge probes by overfitting to the probes' answer distributions during train-ing (Zhong et al, 2021;, and often require large validation sets for tuning, which may not be feasible in practice (Perez et al, 2021).…”

Section: Cloze Promptingmentioning

confidence: 99%

See 2 more Smart Citations

Relational World Knowledge Representation in Contextual Language Models: A Review

Safavi

Koutra

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

Relational knowledge bases (KBs) are commonly used to represent world knowledge in machines. However, while advantageous for their high degree of precision and interpretability, KBs are usually organized according to manually-defined schemas, which limit their expressiveness and require significant human efforts to engineer and maintain. In this review, we take a natural language processing perspective to these limitations, examining how they may be addressed in part by training deep contextual language models (LMs) to internalize and express relational knowledge in more flexible forms. We propose to organize knowledge representation strategies in LMs by the level of KB supervision provided, from no KB supervision at all to entity-and relation-level supervision. Our contributions are threefold:(1) We provide a high-level, extensible taxonomy for knowledge representation in LMs;(2) Within our taxonomy, we highlight notable models, evaluation tasks, and findings, in order to provide an up-to-date review of current knowledge representation capabilities in LMs; and (3) We suggest future research directions that build upon the complementary aspects of LMs and KBs as knowledge representations.

show abstract

Section: Word-level Supervisionmentioning

confidence: 99%

Section: Cloze Promptingmentioning

confidence: 99%

Section: Cloze Promptingmentioning

confidence: 99%

See 1 more Smart Citation

Relational World Knowledge Representation in Contextual Language Models: A Review

Safavi

Koutra

2021

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

show abstract

“…Recently, many prompt-based works have emerged, i.e., manuallydesigned (Schick and Schütze, 2021a,b;Mishra et al, 2021) or automatically-searched (Jiang et al, 2020;Shin et al, 2020;Gao et al, 2021) hard prompts, which are discrete tokens but not necessarily human-readable. Furthermore, soft prompt (Li and Liang, 2021;Hambardzumyan et al, 2021;Zhong et al, 2021;Liu et al, 2021) comes out, which are tuneable embeddings rather than tokens in the vocabularies and can be directly trained with task-specific supervision. And demonstrates that this prompt tuning (PT) method can match the performance of full-parameter finetuning when the PLM size is extremely large.…”

Section: Introductionmentioning

confidence: 99%

On Transferability of Prompt Tuning for Natural Language Processing

Su¹,

Wang²,

Qin³

et al. 2021

Preprint

View full text Add to dashboard Cite

Prompt tuning (PT) is a promising parameterefficient method to utilize extremely large pre-trained language models (PLMs), which could achieve comparable performance to fullparameter fine-tuning by only tuning a few soft prompts. However, compared to fine-tuning, PT empirically requires much more training steps. To explore whether we can improve the efficiency of PT by reusing trained soft prompts and sharing learned knowledge, we empirically investigate the transferability of soft prompts across different tasks and models. In cross-task transfer, we find that trained soft prompts can well transfer to similar tasks and initialize PT for them to accelerate training and improve performance. Moreover, to explore what factors influence prompts' transferability across tasks, we investigate how to measure the prompt similarity and find that the overlapping rate of activated neurons highly correlates to the transferability. In cross-model transfer, we explore how to project the prompts of a PLM to another PLM and successfully train a kind of projector which can achieve nontrivial transfer performance on similar tasks. However, initializing PT with the projected prompts does not work well, which may be caused by optimization preferences and PLMs' high redundancy. Our findings show that improving PT with knowledge transfer is possible and promising, while prompts' crosstask transferability is generally better than the cross-model transferability.

show abstract

“…We harness the knowledge present in large scale pre-trained language models (Davison et al, 2019;Zhou et al, 2020;Petroni et al, 2019;Zhong et al, 2021;Shin et al, 2020) to detect a rich set of biases. Our method prompts the LM with a textual post and labeled exemplars selected using a novel technique along with instructions to detect bias in this post.…”

Section: Introductionmentioning

confidence: 99%

Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases

Prabhumoye¹,

Kocielnik²,

Shoeybi³

et al. 2021

Preprint

View full text Add to dashboard Cite

Warning: this paper contains content that may be offensive or upsetting.Detecting social bias in text is challenging due to nuance, subjectivity, and difficulty in obtaining good quality labeled datasets at scale, especially given the evolving nature of social biases and society. To address these challenges, we propose a few-shot instructionbased method for prompting pre-trained language models (LMs). We select a few labelbalanced exemplars from a small support repository that are closest to the query to be labeled in the embedding space. We then provide the LM with instruction that consists of this subset of labeled exemplars, the query text to be classified, a definition of bias, and prompt it to make a decision. We demonstrate that large LMs used in a few-shot context can detect different types of fine-grained biases with similar and sometimes superior accuracy to fine-tuned models. We observe that the largest 530B parameter model is significantly more effective in detecting social bias compared to smaller models (achieving at least 20% improvement in AUC metric compared to other models). It also maintains a high AUC (dropping less than 5%) in a few-shot setting with a labeled repository reduced to as few as 100 samples. Large pretrained language models thus make it easier and quicker to build new bias detectors.

show abstract

Factual Probing Is [MASK]: Learning vs. Learning to Recall

Cited by 152 publications

References 22 publications

Relational World Knowledge Representation in Contextual Language Models: A Review

Relational World Knowledge Representation in Contextual Language Models: A Review

On Transferability of Prompt Tuning for Natural Language Processing

Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases

Contact Info

Product

Resources

About