Semi-supervised Dependency Parsing using Bilexical Contextual Features from Auto-Parsed Data

Kiperwasser, Eliyahu; Goldberg, Yoav

doi:10.18653/v1/d15-1158

Cited by 9 publications

(12 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In parsing, bilexical preferences have been used by Van Noord (2007) to improve syntactic ambiguity resolution in a Maximum-Entropy parser for Dutch. Kiperwasser and Goldberg (2015) extended bilexical preferences to contextual association scores based on PMI and dependency embeddings (Levy and Goldberg, 2014a) in a graph-based parser. Mirroshandel and Nasr (2016) integrated selectional preferences into a graph-based dependency parser.…”

Section: Relationmentioning

confidence: 99%

Association Metrics in Neural Transition-Based Dependency Parsing

Fischer¹,

Pütz²,

Kok³

2019

Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, SyntaxFest 2019)

View full text Add to dashboard Cite

Lexical preferences encoded as association metrics have been shown to improve performance on structural ambiguities that are still challenging for modern parsers. This paper introduces a mechanism to include lexical preferences into a neural transition-based dependency parser for German. We compare pointwise mutual information (PMI) and embedding-based scores. Both the PMI-based model and the embedding-based model outperform the baseline significantly. The best model is PMI-based and increases overall performance by 0.26 LAS points over the baseline.

show abstract

Section: Relationmentioning

confidence: 99%

Association Metrics in Neural Transition-Based Dependency Parsing

Fischer¹,

Pütz²,

Kok³

2019

Proceedings of the Fifth International Conference on Dependency Linguistics (Depling, SyntaxFest 2019)

View full text Add to dashboard Cite

show abstract

“…In these approaches, word co-occurrences are defined in terms of dependency contexts (x is the governor of word w), instead of linear contexts (x appears within a range of s around word w). Embedding techniques have also started to be applied to objects other than words, namely on dependency relations (Bansal, 2015;Kiperwasser and Goldberg, 2015).…”

Section: Word Vectors For Dependency Parsingmentioning

confidence: 99%

Delexicalized Word Embeddings for Cross-lingual Dependency Parsing

Dehouck¹,

Pelletier²

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1

View full text Add to dashboard Cite

This paper presents a new approach to the problem of cross-lingual dependency parsing, aiming at leveraging training data from different source languages to learn a parser in a target language. Specifically, this approach first constructs word vector representations that exploit structural (i.e., dependency-based) contexts but only considering the morpho-syntactic information associated with each word and its contexts. These delexicalized word embeddings, which can be trained on any set of languages and capture features shared across languages, are then used in combination with standard language-specific features to train a lexicalized parser in the target language. We evaluate our approach through experiments on a set of eight different languages that are part the Universal Dependencies Project. Our main results show that using such delexicalized embeddings, either trained in a monolingual or multilingual fashion, achieves significant improvements over monolingual baselines.

show abstract

“…A line of work is devoted to parsing with RNN models, including using RNNs (Miikkulainen, 1996;Mayberry and Miikkulainen, 1999;Legrand and Collobert, 2015;Watanabe and Sumita, 2015) and LSTM (Hochreiter and Schmidhuber, 1997) RNNs Kiperwasser and Goldberg, 2016). Legrand and Collobert (2015) used RNNs to learn conditional distributions over syntactic rules; explored sequenceto-sequence learning (Sutskever et al, 2014) for parsing; utilized characterlevel representations and Kiperwasser and Goldberg (2016) built an easy-first dependency parser using tree-structured compositional LSTMs. However, all these parsers use greedy search and are trained using the maximum likelihood criterion (except Kiperwasser and Goldberg (2016), who used a margin-based objective).…”

Section: Related Workmentioning

confidence: 99%

“…Legrand and Collobert (2015) used RNNs to learn conditional distributions over syntactic rules; explored sequenceto-sequence learning (Sutskever et al, 2014) for parsing; utilized characterlevel representations and Kiperwasser and Goldberg (2016) built an easy-first dependency parser using tree-structured compositional LSTMs. However, all these parsers use greedy search and are trained using the maximum likelihood criterion (except Kiperwasser and Goldberg (2016), who used a margin-based objective). For learning global models, Watanabe and Sumita (2015) used a marginbased objective, which was not optimized for the evaluation metric; although not using RNNs, Weiss et al (2015) proposed a method using the averaged perceptron with beam search (Collins, 2002;Collins and Roark, 2004;Zhang and Clark, 2008), which required fixing the neural network representations, and thus their model parameters were not learned using end-to-end backpropagation.…”

Section: Related Workmentioning

confidence: 99%