Dependency Parsing with Bounded Block Degree and Well-nestedness via Lagrangian Relaxation and Branch-and-Bound

Corro, Caio; Roux, Joseph Le; Lacroix, Mathieu; Rozenknop, Antoine; Calvo, Roberto Wolfler

doi:10.18653/v1/p16-1034

Cited by 8 publications

(11 citation statements)

References 26 publications

(23 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For example, a verbal phrase is not constrained to contain a verb. As such, they can be assimilated to the mainstream approach to bi-lexical dependency parsing where one considers candidate outputs only in a restricted class of graphs: non-projective (McDonald et al, 2005), projective (Eisner, 1997) or bounded block degree and wellnested spanning aborescences (Gómez-Rodríguez et al, 2009Corro et al, 2016), among others (Kuhlmann and Nivre, 2006;Satta and Kuhlmann, 2013;Pitler et al, 2012). These approaches assume that intricate relations in the syntactic content can be implicitly learned by the scoring function.…”

Section: Introductionmentioning

confidence: 99%

Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(nˆ6) down to O(nˆ3)

Corro¹

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

We introduce a novel chart-based algorithm for span-based parsing of discontinuous constituency trees of block degree two, including ill-nested structures. In particular, we show that we can build variants of our parser with smaller search spaces and time complexities ranging from O(n 6 ) down to O(n 3 ). The cubic time variant covers 98% of constituents observed in linguistic treebanks while having the same complexity as continuous constituency parsers. We evaluate our approach on German and English treebanks (Negra, Tiger, and DPTB) and report state-of-the-art results in the fully supervised setting. We also experiment with pre-trained word embeddings and Bertbased neural networks.

show abstract

Section: Introductionmentioning

confidence: 99%

Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(nˆ6) down to O(nˆ3)

Corro¹

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Finally, we restricted our grammar to spinal trees but it could be possible to allow full lexicalized TAG-like trees, with substitution nodes and even obligatory adjunction sites. Derivations compat-ible with the TAG formalism (or more generally LCFRS) could be recovered by the use of a constrained version of MSA (Corro et al, 2016).…”

Section: Discussionmentioning

confidence: 99%

“…Exact pruning based on duality has already been studied in parsing, with branch and bound (Corro et al, 2016) or column generation (Riedel et al, 2012) and in machine translation with beam search (Rush et al, 2013).…”

Section: Related Workmentioning

confidence: 99%

“…Lagrangian relaxation has been successfully applied to various NLP tasks (Koo et al, 2010;Le Roux et al, 2013;Almeida and Martins, 2013;Das et al, 2012;Corro et al, 2016). Intuitively, given an integer linear program, it consists in relaxing some linear constraints which make the program difficult to solve and penalizing their violation in the objective function.…”

Section: Efficient Decodingmentioning

confidence: 99%

“…To accelerate the resolution, our method relies heavily on problem reduction (Beasley, 1993), which uses the primal/dual bounds to filter out suboptimal assignments. Exact pruning based on duality has already been studied in parsing, with branch and bound (Corro et al, 2016) or column generation (Riedel et al, 2012) and in machine translation with beam search (Rush et al, 2013).…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Efficient Discontinuous Phrase-Structure Parsing via the Generalized Maximum Spanning Arborescence

Corro¹,

Roux²,

Lacroix³

2017

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing

Self Cite

View full text Add to dashboard Cite

We present a new method for the joint task of tagging and non-projective dependency parsing. We demonstrate its usefulness with an application to discontinuous phrase-structure parsing where decoding lexicalized spines and syntactic derivations is performed jointly. The main contributions of this paper are (1) a reduction from joint tagging and non-projective dependency parsing to the Generalized Maximum Spanning Arborescence problem, and (2) a novel decoding algorithm for this problem through Lagrangian relaxation. We evaluate this model and obtain state-of-the-art results despite strong independence assumptions.

show abstract

Please Mind the Root: Decoding Arborescences for Dependency Parsing

Zmigrod¹,

Vieira²,

Cotterell³

2020

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

The connection between dependency trees and spanning trees is exploited by the NLP community to train and to decode graph-based dependency parsers. However, the NLP literature has missed an important difference between the two structures: only one edge may emanate from the root in a dependency tree. We analyzed the output of state-of-the-art parsers on many languages from the Universal Dependency Treebank: although these parsers are often able to learn that trees which violate the constraint should be assigned lower probabilities, their ability to do so unsurprisingly degrades as the size of the training set decreases. In fact, the worst constraint-violation rate we observe is 24%. Prior work has proposed an inefficient algorithm to enforce the constraint, which adds a factor of n to the decoding runtime. We adapt an algorithm due to Gabow and Tarjan (1984) to dependency parsing, which satisfies the constraint without compromising the original runtime. 1 (2005)) opt for the simpler CLE algorithm (Chu and Liu, 1965;Bock, 1971; Edmonds, 1967), which has a worst-case bound of O(n 3 ), but is often fast in practice.3 A notable exception is the Prague Dependency Treebank (Bejček et al., 2013), which allows for multi-rooted trees.

show abstract

Dependency Parsing with Bounded Block Degree and Well-nestedness via Lagrangian Relaxation and Branch-and-Bound

Cited by 8 publications

References 26 publications

Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(nˆ6) down to O(nˆ3)

Span-based discontinuous constituency parsing: a family of exact chart-based algorithms with time complexities from O(nˆ6) down to O(nˆ3)

Efficient Discontinuous Phrase-Structure Parsing via the Generalized Maximum Spanning Arborescence

Please Mind the Root: Decoding Arborescences for Dependency Parsing

Contact Info

Product

Resources

About