Know What You Don’t Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories

Zarrieß, Sina; Schlangen, David

doi:10.18653/v1/p19-1063

Cited by 13 publications

(12 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the language and vision community, pragmatic aspects have been taken into account in the task of IC, where approaches building on Bayesian frameworks have been proposed to generate descriptions that contrastively refer to one but not another (similar) image (Achlioptas et al., 2019; Andreas & Klein, 2016; Cohn‐Gordon et al., 2018; Monroe et al., 2017). Similar approaches have been proposed for zero‐shot referring expression generation (Zarrieß & Schlangen, 2019).…”

Section: Revisiting the Wishes From The Pastmentioning

confidence: 99%

Linguistic issues behind visual question answering

Bernardi

Pezzelle

2021

Language and Linguist. Compass

View full text Add to dashboard Cite

Answering a question that is grounded in an image is a crucial ability that requires understanding the question, the visual context, and their interaction at many linguistic levels: among others, semantics, syntax and pragmatics. As such, visually‐grounded questions have long been of interest to theoretical linguists and cognitive scientists. Moreover, they have inspired the first attempts to computationally model natural language understanding, where pioneering systems were faced with the highly challenging task—still unsolved—of jointly dealing with syntax, semantics and inference whilst understanding a visual context. Boosted by impressive advancements in machine learning, the task of answering visually‐grounded questions has experienced a renewed interest in recent years, to the point of becoming a research sub‐field at the intersection of computational linguistics and computer vision. In this paper, we review current approaches to the problem which encompass the development of datasets, models and frameworks. We conduct our investigation from the perspective of the theoretical linguists; we extract from pioneering computational linguistic work a list of desiderata that we use to review current computational achievements. We acknowledge that impressive progress has been made to reconcile the engineering with the theoretical view. At the same time, we claim that further research is needed to get to a unified approach which jointly encompasses all the underlying linguistic problems. We conclude the paper by sharing our own desiderata for the future.

show abstract

Section: Revisiting the Wishes From The Pastmentioning

confidence: 99%

Linguistic issues behind visual question answering

Bernardi

Pezzelle

2021

Language and Linguist. Compass

View full text Add to dashboard Cite

show abstract

“…Zarrieß and Schlangen [171] extend RSA-based reasoning to a zero-shot setting, where the speaker's task is to refer to target object of an "unknown" category that the literal speaker has not encountered during training. This resembles the set-up described in Anderson et al [161], where the decoding procedure extends the capabilities of the underlying language model to out-of-domain data, though Zarrieß and Schlangen [171]'s reasoning scheme does not widen the model's vocabulary but aims at leveraging the training vocabulary in efficient way for referring to unknown objects.…”

Section: Conversational Goalsmentioning

confidence: 99%

Decoding Methods in Neural Language Generation: A Survey

2021

Self Cite

View full text Add to dashboard Cite

Neural encoder-decoder models for language generation can be trained to predict words directly from linguistic or non-linguistic inputs. When generating with these so-called end-to-end models, however, the NLG system needs an additional decoding procedure that determines the output sequence, given the infinite search space over potential sequences that could be generated with the given vocabulary. This survey paper provides an overview of the different ways of implementing decoding on top of neural network-based generation models. Research into decoding has become a real trend in the area of neural language generation, and numerous recent papers have shown that the choice of decoding method has a considerable impact on the quality and various linguistic properties of the generation output of a neural NLG system. This survey aims to contribute to a more systematic understanding of decoding methods across different areas of neural NLG. We group the reviewed methods with respect to the broad type of objective that they optimize in the generation of the sequence—likelihood, diversity, and task-specific linguistic constraints or goals—and discuss their respective strengths and weaknesses.

show abstract

“…This assumption is reasonable for classification. It is also common practice when working with a finite set of intents in RSA (Monroe et al, 2017;Zarrieß and Schlangen, 2019). The column norm (Equation 6) describes a mathematical formulation aligned with how a pragmatic listener infers speaker expectations.…”

Section: Related Workmentioning

confidence: 99%

“…Previous work has applied RSA to systems that generate and understand language (Andreas and Klein, 2016;Mao et al, 2016;Vedantam et al, 2017;Cohn-Gordon et al, 2018;Zarrieß and Schlangen, 2019) in both referential games (Frank and Goodman, 2012;Goodman and Frank, 2016;Monroe et al, 2017) and sequential decisionmaking systems (Fried et al, 2018a,b). Our method departs from these applications by focusing on the ambiguity avoidance property of the listener agent as applied to generic classification tasks.…”

Section: Related Workmentioning

confidence: 99%

When in Doubt: Improving Classification Performance with Alternating Normalization

Jia¹,

Reiter²,

Lim³

et al. 2021

Preprint

View full text Add to dashboard Cite

We introduce Classification with Alternating Normalization (CAN), a non-parametric postprocessing step for classification. CAN improves classification accuracy for challenging examples by re-adjusting their predicted class probability distribution using the predicted class distributions of high-confidence validation examples. CAN is easily applicable to any probabilistic classifier, with minimal computation overhead. We analyze the properties of CAN using simulated experiments, and empirically demonstrate its effectiveness across a diverse set of classification tasks 1 . 1 We use a classical pragmatic reasoning example for our illustration, highlighting our inspiration in the Rational Speech Act (RSA; Frank and Goodman, 2012) model, which we discuss in Section 6.

show abstract

Know What You Don’t Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories

Cited by 13 publications

References 25 publications

Linguistic issues behind visual question answering

Linguistic issues behind visual question answering

Decoding Methods in Neural Language Generation: A Survey

When in Doubt: Improving Classification Performance with Alternating Normalization

Contact Info

Product

Resources

About