A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations

Colombo, Pierre; Piantanida, Pablo; Clavel, Chloé

doi:10.18653/v1/2021.acl-long.511

Cited by 16 publications

(10 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Among available contrast measures, the Fisher-Rao distance is parameter-free and thus, it is easy to use in practice while the AB-Divergence achieves better results but requires to select α and β. Future work includes extending our metrics to new tasks such as SLU (Chapuis et al 2020(Chapuis et al , 2021Dinkar et al 2020;Colombo, Clavel, and Piantanida 2021), controlled sentence generation (Colombo et al 2019(Colombo et al , 2021b and multi-modal learning (Colombo et al 2021a;Garcia et al 2019).…”

Section: Summary and Concluding Remarksmentioning

confidence: 99%

InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation

Colombo

Clavel

Piantanida

2022

AAAI

Self Cite

View full text Add to dashboard Cite

Assessing the quality of natural language generation (NLG) systems through human annotation is very expensive. Additionally, human annotation campaigns are time-consuming and include non-reusable human labour. In practice, researchers rely on automatic metrics as a proxy of quality. In the last decade, many string-based metrics (e.g., BLEU or ROUGE) have been introduced. However, such metrics usually rely on exact matches and thus, do not robustly handle synonyms. In this paper, we introduce InfoLM a family of untrained metrics that can be viewed as a string-based metric that addresses the aforementioned flaws thanks to a pre-trained masked language model. This family of metrics also makes use of information measures allowing the possibility to adapt InfoLM to different evaluation criteria. Using direct assessment, we demonstrate that InfoLM achieves statistically significant improvement and two figure correlation gains in many configurations compared to existing metrics on both summarization and data2text generation tasks.

show abstract

Section: Summary and Concluding Remarksmentioning

confidence: 99%

InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation

Colombo

Clavel

Piantanida

2022

AAAI

Self Cite

View full text Add to dashboard Cite

show abstract

“…. Therefore, this objective minimizes an upper-bound estimate of the MI between each pair of latent spaces, following (Cheng et al, 2020a,b;Colombo et al, 2021).…”

Section: Mutual-information Minimization (Min): a Natural Measure Of ...mentioning

confidence: 99%

“…Still, no previous work has tested whether negation, uncertainty, and content can be disentangled, as linguistic theory suggests, although previous works have disentangled attributes such as syntax, semantics, and style (Balasubramanian et al, 2021;John et al, 2019;Cheng et al, 2020b;Bao et al, 2019;Hu et al, 2017;Colombo et al, 2021). To fill this gap, we aim to answer the following research questions:…”

Section: Introductionmentioning

confidence: 99%

Learning Disentangled Representations of Negation and Uncertainty

Vasilakes¹,

Zerva²,

Misawa³

et al. 2022

Preprint

View full text Add to dashboard Cite

Negation and uncertainty modeling are longstanding tasks in natural language processing. Linguistic theory postulates that expressions of negation and uncertainty are semantically independent from each other and the content they modify. However, previous works on representation learning do not explicitly model this independence. We therefore attempt to disentangle the representations of negation, uncertainty, and content using a Variational Autoencoder 1 . We find that simply supervising the latent representations results in good disentanglement, but auxiliary objectives based on adversarial learning and mutual information minimization can provide additional disentanglement gains.

show abstract

“…Disentangled VAEs in language Early approaches in text disentanglement use VAEs with multiple adversarial losses for style transfer (Hu et al, 2017;John et al, 2019). More recently, Cheng et al (2020) propose a style transfer method which minimizes the mutual information between the latent and the observed variable, while Colombo et al (2021) propose an upper bound of mutual information for fair text classification. Disentanglement of syntactic and semantic information on sentences is explored by , using multiple losses for word ordering and paraphrasing, and by Bao et al (2019) with linearized constituency tree losses.…”

Section: Related Workmentioning

confidence: 99%

Quasi-symbolic explanatory NLI via disentanglement: A geometrical examination

Ying-ji¹,

Carvalho²,

Pratt-Hartmann³

et al. 2022

Preprint

View full text Add to dashboard Cite

Disentangling the encodings of neural models is a fundamental aspect for improving interpretability, semantic control, and understanding downstream task performance in Natural Language Processing. The connection points between disentanglement and downstream tasks, however, remains underexplored from a explanatory standpoint. This work presents a methodology for assessment of geometrical properties of the resulting latent space w.r.t. vector operations and semantic disentanglement in quantitative and qualitative terms, based on a VAE-based supervised framework. Empirical results indicate that the role-contents of explanations, such as ARG0animal, are disentangled in the latent space, which provides us a chance for controlling the explanation generation by manipulating the traversal of vector over latent space.

show abstract

A Novel Estimator of Mutual Information for Learning to Disentangle Textual Representations

Cited by 16 publications

References 43 publications

InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation

InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation

Learning Disentangled Representations of Negation and Uncertainty

Quasi-symbolic explanatory NLI via disentanglement: A geometrical examination

Contact Info

Product

Resources

About