Proceedings of the Workshop on Grammar Engineering Across Frameworks - GEAF '08 2008
DOI: 10.3115/1611546.1611549
|View full text |Cite
|
Sign up to set email alerts
|

A more precise analysis of punctuation for broad-coverage surface realization with CCG

Abstract: This paper describes a more precise analysis of punctuation for a bi-directional, broad coverage English grammar extracted from the CCGbank (Hockenmaier and Steedman, 2007). We discuss various approaches which have been proposed in the literature to constrain overgeneration with punctuation, and illustrate how aspects of Briscoe's (1994) influential approach, which relies on syntactic features to constrain the appearance of balanced and unbalanced commas and dashes to appropriate sentential contexts, is unattr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
11
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
6
1

Relationship

3
4

Authors

Journals

citations
Cited by 9 publications
(11 citation statements)
references
References 7 publications
0
11
0
Order By: Relevance
“…Generally speaking, the semantic dependency graphs are more abstract than unordered dependency trees, but more detailed than AMRs. The grammar is extracted from a version of the CCGbank (Hockenmaier and Steedman, 2007) enhanced for realization, where the enhancements include: better analyses of punctuation (White and Rajkumar, 2008); less error prone handling of named entities ; re-inserting quotes into the CCGbank; and assignment of consistent semantic roles across diathesis alternations (Boxwell and White, 2008), using PropBank (Palmer et al, 2005).…”
Section: Surface Realization With Openccgmentioning
confidence: 99%
“…Generally speaking, the semantic dependency graphs are more abstract than unordered dependency trees, but more detailed than AMRs. The grammar is extracted from a version of the CCGbank (Hockenmaier and Steedman, 2007) enhanced for realization, where the enhancements include: better analyses of punctuation (White and Rajkumar, 2008); less error prone handling of named entities ; re-inserting quotes into the CCGbank; and assignment of consistent semantic roles across diathesis alternations (Boxwell and White, 2008), using PropBank (Palmer et al, 2005).…”
Section: Surface Realization With Openccgmentioning
confidence: 99%
“…Our starting point is an enhanced version of the CCGbank (Hockenmaier and Steedman, 2007)-a corpus of CCG derivations derived from the Penn Treebank-with Propbank (Palmer et al, 2005) roles projected onto it (Boxwell and White, 2008). To engineer a grammar from this corpus suitable for realization with OpenCCG, the derivations are first revised to reflect the lexicalized treatment of coordination and punctuation assumed by the multi-modal version of CCG that is implemented in OpenCCG (White and Rajkumar, 2008). Further changes are necessary to support semantic dependencies rather than surface syntactic ones; in particular, the features and unification constraints in the categories related to semantically empty function words such complementizers, infinitivalto, expletive subjects, and case-marking prepositions are adjusted to reflect their purely syntactic status.…”
Section: Realization From An Enhanced Ccgbankmentioning
confidence: 99%
“…Note that the baseline for Section 23 uses 4-grams and a filter for balanced punctuation(White and Rajkumar, 2008), unlike the other reported configurations, which would explain the somewhat smaller increase seen with this section.6 Scripts for running these tests are available at http://projectile.sv.cmu.edu/research/ public/tools/bootStrap/tutorial.htm…”
mentioning
confidence: 99%
“…In the figure, nodes correspond to discourse referents labeled with lexical predicates, and dependency relations between nodes encode argument structure (gold standard CCG lexical categories are also shown); note that semantically empty function words such as infinitival-to are missing. The grammar is extracted from a version of the CCGbank (Hockenmaier and Steedman, 2007) enhanced for realization; the enhancements include: better analyses of punctuation (White and Rajkumar, 2008); less error prone handling of named entities Figure 1: Example OpenCCG semantic dependency input for he has a point he wants to make, with gold standard lexical categories for each node and assignment of consistent semantic roles across diathesis alternations (Boxwell and White, 2008), using PropBank (Palmer et al, 2005).…”
Section: Introductionmentioning
confidence: 99%