Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?

Ravichander, Abhilasha; Belinkov, Yonatan; Hovy, Eduard

doi:10.18653/v1/2021.eacl-main.295

Cited by 52 publications

(53 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Ethayarajh, 2019;Mimno and Thompson, 2017), including the recently proposed DIRECTPROBE (Zhou and Srikumar, 2021), which we use in this work. Another line of probing work is to design control tasks (Ravichander et al, 2021;Lan et al, 2020) to reverse-engineer the internal mechanisms of representations (Kovaleva et al, 2019;. However, in contrast to our work, most studies focused on the pre-trained representations, not the fine-tuned ones.…”

Section: Related Workmentioning

confidence: 82%

A Closer Look at How Fine-tuning Changes BERT

Zhou¹,

Srikumar²

2021

Preprint

View full text Add to dashboard Cite

Given the prevalence of pre-trained contextualized representations in today's NLP, there have been several efforts to understand what information such representations contain. A common strategy to use such representations is to fine-tune them for an end task. However, how fine-tuning for a task changes the underlying space is less studied. In this work, we study the English BERT family and use two probing techniques to analyze how fine-tuning changes the space. Our experiments reveal that fine-tuning improves performance because it pushes points associated with a label away from other labels. By comparing the representations before and after fine-tuning, we also discover that fine-tuning does not change the representations arbitrarily; instead, it adjusts the representations to downstream tasks while preserving the original structure. Finally, using carefully constructed experiments, we show that fine-tuning can encode training sets in a representation, suggesting an overfitting problem of a new kind.

show abstract

Section: Related Workmentioning

confidence: 82%

A Closer Look at How Fine-tuning Changes BERT

Zhou¹,

Srikumar²

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Validity measures how well the test measures what it intends to measure. In a valid test, the result is right for the right reasons (McCoy et al, 2019;Ravichander et al, 2021). Robustness measures how well the results of a test can generalize from the experimental setting to realworld settings (Xing et al, 2020;Niu et al, 2020).…”

Section: Probing Methodsmentioning

confidence: 99%

On the data requirements of probing

Zhu¹,

Wang²,

Li³

et al. 2022

Preprint

View full text Add to dashboard Cite

As large and powerful neural language models are developed, researchers have been increasingly interested in developing diagnostic tools to probe them. There are many papers with conclusions of the form "observation X is found in model Y ", using their own datasets with varying sizes. Larger probing datasets bring more reliability, but are also expensive to collect. There is yet to be a quantitative method for estimating reasonable probing dataset sizes. We tackle this omission in the context of comparing two probing configurations: after we have collected a small dataset from a pilot study, how many additional data samples are sufficient to distinguish two different configurations? We present a novel method to estimate the required number of data samples in such experiments and, across several case studies, we verify that our estimations have sufficient statistical power. Our framework helps to systematically construct probing datasets to diagnose neural NLP models.

show abstract

“…This has been a subject of recent criticism for probing methods, on the basis of "correlation does not equal causation", where although probing methods infer that the model represents some concept, no guarantee is given on whether the model actually uses this concept to make its decisions [39,96,112]. This has led to the development of causally-informed class of methods [32,38,115] that do provide a stronger guarantee that causality is correctly attributed, e.g., by showing that the model indeed changes its decision if it ceases to recognize the concept through deriving a counterfactual [28,32].…”

Section: Concept Attributionmentioning

confidence: 99%

Diagnosing AI Explanation Methods with Folk Concepts of Behavior

Jacovi¹,

Bastings²,

Gehrmann³

et al. 2022

Preprint

View full text Add to dashboard Cite

When explaining AI behavior to humans, how is the communicated information being comprehended by the human explainee, and does it match what the explanation attempted to communicate? When can we say that an explanation is explaining something? We aim to provide an answer by leveraging theory of mind literature about the folk concepts that humans use to understand behavior.We establish a framework of social attribution by the human explainee, which describes the function of explanations: the concrete information that humans comprehend from them. Specifically, effective explanations should be coherent (communicate information which generalizes to other contrast cases), complete (communicating an explicit contrast case, objective causes, and subjective causes), and interactive (surfacing and resolving contradictions to the generalization property through iterations). We demonstrate that many XAI mechanisms can be mapped to folk concepts of behavior. This allows us to uncover their modes of failure that prevent current methods from explaining effectively, and what is necessary to enable coherent explanations. CCS Concepts: • Computing methodologies → Philosophical/theoretical foundations of artificial intelligence; Machine learning; Cognitive science; Theory of mind; • Human-centered computing → HCI theory, concepts and models.

show abstract

Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?

Cited by 52 publications

References 47 publications

A Closer Look at How Fine-tuning Changes BERT

A Closer Look at How Fine-tuning Changes BERT

On the data requirements of probing

Diagnosing AI Explanation Methods with Folk Concepts of Behavior

Contact Info

Product

Resources

About