Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2016
DOI: 10.18653/v1/p16-1079
|View full text |Cite
|
Sign up to set email alerts
|

Coordination Annotation Extension in the Penn Tree Bank

Abstract: Coordination is an important and common syntactic construction which is not handled well by state of the art parsers. Coordinations in the Penn Treebank are missing internal structure in many cases, do not include explicit marking of the conjuncts and contain various errors and inconsistencies. In this work, we initiated manual annotation process for solving these issues. We identify the different elements in a coordination phrase and label each element with its function. We add phrase boundaries when these ar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
9

Relationship

1
8

Authors

Journals

citations
Cited by 20 publications
(16 citation statements)
references
References 7 publications
0
16
0
Order By: Relevance
“…We evaluate our models on their ability to identify conjunction boundaries in the extended Penn Treebank (Ficler and Goldberg, 2016) and Genia Treebank (Ohta et al, 2002) Berkeley parser (Petrov et al, 2006) and the discriminative Zpar parser (Zhang and Clark, 2011). When evaluating on the Genia treebank, we compare to the results of the discriminative coordination-prediction model of Hara et al (2009).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We evaluate our models on their ability to identify conjunction boundaries in the extended Penn Treebank (Ficler and Goldberg, 2016) and Genia Treebank (Ohta et al, 2002) Berkeley parser (Petrov et al, 2006) and the discriminative Zpar parser (Zhang and Clark, 2011). When evaluating on the Genia treebank, we compare to the results of the discriminative coordination-prediction model of Hara et al (2009).…”
Section: Methodsmentioning
confidence: 99%
“…However, using the Genia corpus is not ideal since it is in a specialized domain and much smaller than the PTB. In this work we rely on a version of the PTB released by Ficler and Goldberg (2016) in which the above deficiencies are manually resolved. In particular, coordinating elements, coordination phrases and conjunct boundaries are explicitly marked with specialized function labels.…”
Section: Coordination In the Ptbmentioning
confidence: 99%
“…NomBank (Meyers et al, 2004) extended the argument structure for instances of common nouns. Vadas and Curran (2007), and Ficler and Goldberg (2016) extended the PTB with noun phrase and coordination annotations, respectively.…”
Section: Related Workmentioning
confidence: 99%
“…We trained two additional RNNG models on 38,546 sentences from the Penn Treebank annotated with two different schemes: The first, RNNG (PTB-control) was trained with the original Penn Treebank annotation. The second, RNNG (PTB-coord), was trained on the same sentences, but with an extended coordination annotation scheme, meant to employ the scheme employed in the FTB, adapted from Ficler and Goldberg (2016). We stripped empty categories from their scheme and only kept the NP-COORD label for constituents inside a coordination structure.…”
Section: A the Effect Of Annotation Schemesmentioning
confidence: 99%