Unsupervised Labeled Parsing with Deep Inside-Outside Recursive Autoencoders

Drozdov, Andrew; Verga, Patrick; Chen, Yi-Pei; Iyyer, Mohit; McCallum, Andrew

doi:10.18653/v1/d19-1161

Cited by 10 publications

(22 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Typically unsupervised constituency parsing is purely evaluated by its structure, although recent work fromDrozdov et al (2019b) shows that a simple approach to induce labels with DIORA can be done by clustering the inside and outside phrase vectors.…”

mentioning

confidence: 99%

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Drozdov¹,

Rongali²,

Chen³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

The deep inside-outside recursive autoencoder (DIORA;Drozdov et al. 2019a) is a selfsupervised neural model that learns to induce syntactic tree structures for input sentences without access to labeled training data. In this paper, we discover that while DIORA exhaustively encodes all possible binary trees of a sentence with a soft dynamic program, its vector averaging approach is locally greedy and cannot recover from errors when computing the highest scoring parse tree in bottom-up chart parsing. To fix this issue, we introduce S-DIORA, an improved variant of DIORA that encodes a single tree rather than a softlyweighted mixture of trees by employing a hard argmax operation and a beam at each cell in the chart. Our experiments show that through fine-tuning a pre-trained DIORA with our new algorithm, we improve the state of the art in unsupervised constituency parsing on the English WSJ Penn Treebank by 2.2 6% F1, depending on the data used for fine-tuning.

show abstract

mentioning

confidence: 99%

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Drozdov¹,

Rongali²,

Chen³

et al. 2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Table 1 shows the unlabeled F 1 scores for our model compared to existing unsupervised parsers on PTB. The vanilla inside model is in itself competitive and is already in the range of previous best models like DIORA (Drozdov et al, 2019), Compound PCFG (Kim et al, 2019a). 4 See Appendix 3 https://nlp.cs.nyu.edu/evalb 4 We do not include the results of Shi et al (2021) in our analysis because their boost in the performance is contingent on the nature of the supervision data (especially the QA-SRL dataset) rather than on the actual learning process itself.…”

Section: Resultsmentioning

confidence: 99%

“…Following prior work (Kim et al, 2019a;Shen et al, 2018Shen et al, , 2019Cao et al, 2020), we remove punctuation and collapse unary chains before evaluation, and calculate F 1 ignoring trivial spans, i.e., single-word spans and whole-sentence spans, and we perform the averaging at sentence-level (macro average) rather than span-level (micro average), which means that we compute F 1 for each sentence and later average over all sentences. We also mention the oracle (Shen et al, 2019) 47.7 49.4 63.9 -Tree Transformer † (Wang et al, 2019) 50.5 52.0 66.2 -Neural PCFG † (Kim et al, 2019a) 50.8 52.6 64.6 -DIORA (Drozdov et al, 2019) -58.9 60.5 -Compound PCFG † (Kim et al, 2019a) 55.2 60.1 70.5 -S-DIORA † (Drozdov et al, 2020) 57.6 64.0 71.8 -Constituency Test (Cao et al, 2020) 62.8 65.9 (2019a) and take the baseline numbers of certain models from (Kim et al, 2019a;Cao et al, 2020). † denotes models trained without punctuation and denotes models trained on additional data.…”

Section: Discussionmentioning

confidence: 99%

“…For preprocessing, we keep all punctuation and remove any trailing punctuation. To maintain the unsupervised nature of our experiments, we avoid the common practice of using gold parses of the validation set for either early stopping (Shen et al, 2018(Shen et al, , 2019Drozdov et al, 2019) or hyperparameter tuning (Kim et al, 2019a). Additionally, we experiment on Chinese with version 5.1 of the Chinese Penn Treebank (CTB; Xue et al 2005) with the same splits as in Chen and Manning (2014), and the Japanese Keyaki Treebank (KTB; Butler et al 2012).…”

Section: Datamentioning

confidence: 99%

“…PLMs that possess rich contextualized textual representations can assist parsing when we have a match syntactic constituents, hence restricting the scope for the actual algorithm to derive meaningful trees. (Kim et al, 2019b) 10 10.2 22.7 22.7 DIORA (Drozdov et al, 2019) 24.9 26.0 42.3 43.3 DIORA-all (Hong et al, 2020) 36.4 40.0 47.1 48.9 Ours (using inside) 33.7 36.3 53.8 55.9 Ours (using inside w/ self-training) 37.6 39.8 55.5 58.2 Ours (using inside and outside w/ co-training) 39.2 41.1 56.7 59.1 Upper Bound 76.5 76.6…”

Section: Effect Of Self-trainingmentioning

confidence: 99%

See 2 more Smart Citations

Co-training an Unsupervised Constituency Parser with Weak Supervision

Maveli¹,

Cohen²

2021

Preprint

View full text Add to dashboard Cite

We introduce a method for unsupervised parsing that relies on bootstrapping classifiers to identify if a node dominates a specific span in a sentence. There are two types of classifiers, an inside classifier that acts on a span, and an outside classifier that acts on everything outside of a given span. Through self-training and co-training with the two classifiers, we show that the interplay between them helps improve the accuracy of both, and as a result, effectively parse. A seed bootstrapping technique prepares the data to train these classifiers. Our analyses further validate that such an approach in conjunction with weak supervision using prior branching knowledge of a known language (left/right-branching) and minimal heuristics injects strong inductive bias into the parser, achieving 63.1 F 1 on the English (PTB) test set. In addition, we show the effectiveness of our architecture by evaluating on treebanks for Chinese (CTB) and Japanese (KTB) and achieve new state-of-the-art results. 1

show abstract

RL-GRIT: Reinforcement Learning for Grammar Inference

Woods

2021

2021 IEEE Security and Privacy Workshops (SPW)

View full text Add to dashboard Cite

Unsupervised Labeled Parsing with Deep Inside-Outside Recursive Autoencoders

Cited by 10 publications

References 21 publications

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders

Co-training an Unsupervised Constituency Parser with Weak Supervision

RL-GRIT: Reinforcement Learning for Grammar Inference

Contact Info

Product

Resources

About