Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Drozdov, Andrew; Rongali, Subendhu; Chen, Yi-Pei; O'Gorman, Tim; Iyyer, Mohit; McCallum, Andrew

doi:10.18653/v1/2020.emnlp-main.392

Cited by 28 publications

(31 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We compared examples of trees inferred by our model with the corresponding ground truth constituency trees (see Appendix), encountering reasonable structures that are different from the constituent structure posited by the manually defined gold trees. Experimental results of previous work (Drozdov et al, 2020;Kim et al, 2019a) also show significant variance with different random seeds. Thus, we hypothesize that an isomorphy-focused F 1 evaluation with respect to gold constituency trees is insufficient to evaluate how reasonable the induced structures are.…”

Section: Dependency Tree Compatibilitymentioning

confidence: 79%

See 1 more Smart Citation

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

Hu¹,

Mi²,

Wen³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

Human language understanding operates at multiple levels of granularity (e.g., words, phrases, and sentences) with increasing levels of abstraction that can be hierarchically combined. However, existing deep models with stacked layers do not explicitly model any sort of hierarchical process. This paper proposes a recursive Transformer model based on differentiable CKY style binary trees to emulate the composition process. We extend the bidirectional language model pre-training objective to this architecture, attempting to predict each word given its left and right abstraction nodes. To scale up our approach, we also introduce an efficient pruned tree induction algorithm to enable encoding in just a linear number of composition steps. Experimental results on language modeling and unsupervised parsing show the effectiveness of our approach. 1

show abstract

Section: Dependency Tree Compatibilitymentioning

confidence: 79%

“…Analysis. In order to better understand why our model works better when evaluating on word-piece level golden trees, we compute the recall of constituents following Kim et al (2019b) and Drozdov et al (2020). Besides standard constituents, we also compare the recall of word-piece chunks and proper noun chunks.…”

Section: Resultsmentioning

confidence: 99%

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

Hu¹,

Mi²,

Wen³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

View full text Add to dashboard Cite

show abstract

“…Grammar induction using neural networks: There is a recent resurgence of interest in unsupervised constituency parsing, mostly driven by neural network based methods (Shen et al, 2018a(Shen et al, , 2019Drozdov et al, 2019Drozdov et al, , 2020Kim et al, 2019a,b;Jin et al, 2019;Zhu et al, 2020). These methods can be categorized into two major groups: those built on top of a generative grammar and those without a grammar component.…”

Section: Related Workmentioning

confidence: 99%

PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols

Yang¹,

Zhao²,

Tu³

2021

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

Probabilistic context-free grammars (PCFGs) with neural parameterization have been shown to be effective in unsupervised phrasestructure grammar induction. However, due to the cubic computational complexity of PCFG representation and parsing, previous approaches cannot scale up to a relatively large number of (nonterminal and preterminal) symbols. In this work, we present a new parameterization form of PCFGs based on tensor decomposition, which has at most quadratic computational complexity in the symbol number and therefore allows us to use a much larger number of symbols. We further use neural parameterization for the new form to improve unsupervised parsing performance. We evaluate our model across ten languages and empirically demonstrate the effectiveness of using more symbols.

show abstract

“…Unsupervised parsing (or grammar induction) trains syntax-dependent models to produce syntactic trees of natural language expressions without direct syntactic annotation (Klein and Manning, 2002;Bod, 2006;Ponvert et al, 2011;Pate and Johnson, 2016;Shen et al, 2018;Kim et al, 2019;Drozdov et al, 2020). Comparing to them, our model learns both syntax and semantics jointly.…”

Section: Unsupervised Parsingmentioning

confidence: 99%

Learning Algebraic Recombination for Compositional Generalization

Liu¹,

An²,

Lin³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

View full text Add to dashboard Cite

Neural sequence models exhibit limited compositional generalization ability in semantic parsing tasks. Compositional generalization requires algebraic recombination, i.e., dynamically recombining structured expressions in a recursive manner. However, most previous studies mainly concentrate on recombining lexical units, which is an important but not sufficient part of algebraic recombination. In this paper, we propose LEAR, an end-toend neural model to learn algebraic recombination for compositional generalization. The key insight is to model the semantic parsing task as a homomorphism between a latent syntactic algebra and a semantic algebra, thus encouraging algebraic recombination. Specifically, we learn two modules jointly: a Composer for producing latent syntax, and an Interpreter for assigning semantic operations. Experiments on two realistic and comprehensive compositional generalization benchmarks demonstrate the effectiveness of our model. The source code is publicly available at https://github.com/microsoft/ContextualSP.

show abstract

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Cited by 28 publications

References 50 publications

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

PCFGs Can Do Better: Inducing Probabilistic Context-Free Grammars with Many Symbols

Learning Algebraic Recombination for Compositional Generalization

Contact Info

Product

Resources

About