Do latent tree learning models identify meaningful structure in sentences?

Williams, Adina; Drozdov, Andrew; Bowman, Samuel R.

doi:10.1162/tacl_a_00019

Cited by 99 publications

(136 citation statements)

References 16 publications

(30 reference statements)

Supporting

Mentioning

124

Contrasting

Order By: Relevance

“…We evaluate the induced constituency parse trees via the overall F 1 score, as well as the recall of four types of constituents: noun phrases (NP), verb phrases (VP), prepositional phrases (PP), and adjective phrases (ADJP) ( Table 1). We also evaluate the robustness of models trained with fixed data and hyperparameters, but different random initialization, in two ways: via the standard deviation of performance across multiple runs, and via the selfagreement F 1 score (Williams et al, 2018), which is the average F 1 taken over pairs of different runs. Among all of the models which do not require extra labels, VG-NSL with the head-initial inductive bias (VG-NSL+HI) achieves the best F 1 score.…”

Section: Results: Unsupervised Constituency Parsingmentioning

confidence: 99%

“…Recent work has proposed several approaches for inducing latent syntactic structures, including constituency trees (Choi et al, 2018;Yogatama et al, 2017;Maillard and Clark, 2018;Havrylov et al, 2019;Kim et al, 2019;Drozdov et al, 2019) and dependency trees (Shi et al, 2019), from the distant supervision of downstream tasks. However, most of the methods are not able to produce linguistically sound structures, or even consistent ones with fixed data and hyperparameters but different random initializations (Williams et al, 2018).…”

Section: Related Workmentioning

confidence: 99%

“…Trivial tree structures. Similarly to recent work on latent tree structures (Williams et al, 2018;Phu Mon Htut et al, 2018;Shi et al, 2018b), we include three types of trivial baselines without linguistic information: random binary trees, left-branching binary trees, and right-branching binary trees.…”

Section: Baselinesmentioning

confidence: 99%

“…The learning Table 1: Recall of specific typed phrases, and overall F 1 score, evaluated on the MSCOCO test split, averaged over 5 runs with different random initializations. We also include self-agreement F 1 score (Williams et al, 2018) across the 5 runs. ± denotes standard deviation.…”

Section: Implementation Detailsmentioning

confidence: 99%

See 3 more Smart Citations

Visually Grounded Neural Syntax Acquisition

Shi¹,

Mao²,

Gimpel³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

We present the Visually Grounded Neural Syntax Learner (VG-NSL), an approach for learning syntactic representations and structures without explicit supervision. The model learns by looking at natural images and reading paired captions. VG-NSL generates constituency parse trees of texts, recursively composes representations for constituents, and matches them with images. We define the concreteness of constituents by their matching scores with images, and use it to guide the parsing of text. Experiments on the MSCOCO data set show that VG-NSL outperforms various unsupervised parsing approaches that do not use visual grounding, in terms of F 1 scores against gold parse trees. We find that VG-NSL is much more stable with respect to the choice of random initialization and the amount of training data. We also find that the concreteness acquired by VG-NSL correlates well with a similar measure defined by linguists. Finally, we also apply VG-NSL to multiple languages in the Multi30K data set, showing that our model consistently outperforms prior unsupervised approaches.

show abstract

Section: Results: Unsupervised Constituency Parsingmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Baselinesmentioning

confidence: 99%

Section: Implementation Detailsmentioning

confidence: 99%

See 2 more Smart Citations

Visually Grounded Neural Syntax Acquisition

Shi¹,

Mao²,

Gimpel³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

show abstract

“…For example, the structured attention method (Kim et al, 2017;Liu and Lapata, 2018) does not sample entire trees but rather computes arc marginals, and hence does not faithfully represent higher-order statistics. Much of other previous work relies either on reinforce-ment learning Nangia and Bowman, 2018;Williams et al, 2018a) or does not treat the latent structure as a random variable (Peng et al, 2018). Niculae et al (2018) marginalizes over latent structures, however, this necessitates strong sparsity assumptions on the posterior distributions which may inject undesirable biases in the model.…”

Section: Introductionmentioning

confidence: 99%

Learning Latent Trees with Stochastic Perturbations and Differentiable Dynamic Programming

Corro¹,

Titov²

2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

View full text Add to dashboard Cite

We treat projective dependency trees as latent variables in our probabilistic model and induce them in such a way as to be beneficial for a downstream task, without relying on any direct tree supervision. Our approach relies on Gumbel perturbations and differentiable dynamic programming. Unlike previous approaches to latent tree learning, we stochastically sample global structures and our parser is fully differentiable. We illustrate its effectiveness on sentiment analysis and natural language inference tasks. We also study its properties on a synthetic structure induction task. Ablation studies emphasize the importance of both stochasticity and constraining latent structures to be projective trees.

show abstract

Learning to Select Important Context Words for Event Detection

Ngo

Nguyen

2020

Advances in Knowledge Discovery and Data Mining

View full text Add to dashboard Cite

It is important to locate important context words in the sentences and model them appropriately to perform event detection (ED) effectively. This has been mainly achieved by some fixed word selection strategy in the previous studies for ED. In this work, we propose a novel method that learns to select relevant context words for ED based on the Gumbel-Softmax trick. The extensive experiments demonstrate the effectiveness of the proposed method, leading to the state-of-the-art performance for ED over different benchmark datasets and settings.

show abstract

Do latent tree learning models identify meaningful structure in sentences?

Cited by 99 publications

References 16 publications

Visually Grounded Neural Syntax Acquisition

Visually Grounded Neural Syntax Acquisition

Learning Latent Trees with Stochastic Perturbations and Differentiable Dynamic Programming

Learning to Select Important Context Words for Event Detection

Contact Info

Product

Resources

About