Pronolninalization has been related to tile idea of a local focusa set of discourse entities in the speaker's centre of attention, for exmnple ill Gundel et al. (1993)'s givenness hierarchy or in centering theory. Both accounts say that the determination of tile tbcus depends on syntactic as well as pragmatic factors, but have not been able to pin those factors down. In this paper, we uncover the major factors which determine the focus set in descriptive texts. This new tbcus definition has been ew, luated with respect to two corporm museum exhibit labels, mid newspaper mtieles. It provides an operationalizable basis for pronoun production, and has been implemented as the reusable module gnome-np. The algorithm l)ehind gnome-np is conlpared with the most recent pronoun generation algorithm of McCoy and Strube (1999).
In this paper we sketch the design, motivation and use of the GeM annotation scheme: an XML-based annotation framework for preparing corpora involving documents with complex layout of text, graphics, diagrams, layout and other navigational elements. We set out the basic organizational layers, contrast the technical approach with some other schemes for complex markup in the XML tradition, and indicate some of the applications we are pursuing.
In this paper we discuss some problems arising in German-Russian Machine Translation with regard to tense and aspect. Since the formal category of aspect is missing in German the information required for generating Russian aspect forms has to be extracted from different representation levels. A sentence based procedure for aspect choice in the MT system VIRTEX is presented which takes lexieal, morphological and semantic criteria into account. The limits of this approach are shown. To overcome these difficulties a human interaction component is proposed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.