We examine the role of discourse relations (relations between propositions) in the interpretation of evaluative or opinion words. Through a combination of Rhetorical Structure Theory or RST (Mann & Thompson, 1988) and Appraisal Theory (Martin & White, 2005), we analyze how different discourse relations modify the evaluative content of opinion words, and what impact the nucleus-satellite structure in RST has on the evaluation. We conduct a corpus study, examining and annotating over 3,000 evaluative words in 50 movie reviews in the SFU Review Corpus (Taboada, 2008) with respect to five parameters: word category (nouns, verbs, adjectives or adverbs), prior polarity (positive, negative or neutral), RST structure (both nucleus-satellite status and relation type) and change of polarity as a result of being part of a discourse relation (Intensify, Downtone, Reversal or No Change). Results show that relations such as Concession, Elaboration, Evaluation, Evidence and Restatement most frequently intensify the polarity of the opinion words, although the majority of evaluative words (about 70%) do not undergo changes in their polarity because of the relations they are a part of. We also find that most opinion words (about 70%) are positioned in the nucleus, confirming a hypothesis in the literature, that nuclei are the most important units when extracting evaluation automatically.
This paper investigates the pragmatic expressions of negative evaluation (negativity) in two corpora: (i) comments posted online in response to newspaper opinion articles; and (ii) online reviews of movies, books and consumer products. We propose a taxonomy of linguistic resources that are deployed in the expression of negativity, with two broad groups at the top level of the taxonomy: resources from the lexicogrammar or from discourse semantics. We propose that rhetorical figures can be considered part of the discourse semantic resources used in the expression of negativity. Using our taxonomy as starting point, we carry out a corpus analysis, and focus on three phenomena: adverb + adjective combinations; rhetorical questions; and rhetorical figures. Although the analysis in this paper is corpus-assisted rather than corpus-driven, the final goal of our research is to make it quantitative, in extracting patterns and resources that can be detected automatically.
Taboada et al. (2008) propose a word-based method for extracting sentiment from text that relies on the most relevant parts of a text. The method predicts that opinion words found in the nuclei (more important parts) of a document are more significant for the overall sentiment, whereas opinion words found in the satellites (less important parts) only potentially interfere with the overall sentiment. However, as pointed out by Taboada et al. (2008) and Narayanan et al. (2009), for certain discourse relations (for instance, Condition relations), the calculation of sentiment should involve both parts of the relation. Based on our analysis of the affective content expressed by automatically extracted discourse relations from the Simon Fraser University Corpus (Taboada 2008) and the Penn Discourse Treebank (Prasad et al. 2008), we propose to classify all the discourse relations into four categories: (1) relations that reverse polarity, (2) intensify polarity, (3) downtone polarity, or (4) produce no change in polarity. We compare the performance of a sentiment analysis system (SO-CAL, Taboada et al. 2011) when opinion words are detected only in the nuclei with its performance when both parts of the relation are analyzed in combination with the opinion words. The results of the experiment show that extraction of both the nucleus and the satellite parts of texts does not improve the performance of a sentiment extraction system.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.