Abstract-Web intelligence applications track online sources with economic relevance such as customer reviews, news articles and social media postings. Automated sentiment analysis based on lexical methods or machine learning identifies the polarity of opinions expressed in these sources to assess how stakeholders perceive a topic. This paper introduces a hybrid approach that combines the throughput of lexical analysis with the flexibility of machine learning to resolve ambiguity and consider the context of sentiment terms. The context-aware method identifies ambiguous terms that vary in polarity depending on the context and stores them in contextualized sentiment lexicons. In conjunction with semantic knowledge bases, these lexicons help ground ambiguous sentiment terms to concepts that correspond to their polarity. This grounding paves the way for interlinking, extending, or even replacing contextualized sentiment lexicons with semantic knowledge bases. An extensive evaluation applies the method to user reviews across three domains (movies, products and hotels).
This paper presents a novel method for contextualizing and enriching large semantic knowledge bases for opinion mining with a focus on Web intelligence platforms and other high-throughput big data applications. The method is not only applicable to traditional sentiment lexicons, but also to more comprehensive, multi-dimensional affective resources such as SenticNet. It comprises the following steps: (i) identify ambiguous sentiment terms, (ii) provide context information extracted from a domain-specific training corpus, and (iii) ground this contextual information to structured background knowledge sources such as ConceptNet and WordNet. A quantitative evaluation shows a significant improvement when using an enriched version of SenticNet for polarity classification. Crowdsourced gold standard data in conjunction with a qualitative evaluation sheds light on the strengths and weaknesses of the concept grounding, and on the quality of the enrichment process.
The advantages and positive effects of multiple coordinated views on search performance have been documented in several studies. This paper describes the implementation of multiple coordinated views within the Media Watch on Climate Change, a domain-specific news aggregation portal available at www.ecoresearch.net/climate that combines a portfolio of semantic services with a visual information exploration and retrieval interface. The system builds contextualized information spaces by enriching the content repository with geospatial, semantic and temporal annotations, and by applying semi-automated ontology learning to create a controlled vocabulary for structuring the stored information. Portlets visualize the different dimensions of the contextualized information spaces, providing the user with multiple views on the latest news media coverage. Context information facilitates access to complex datasets and helps users navigate large repositories of Web documents. Currently, the system synchronizes information landscapes, domain ontologies, geographic maps, tag clouds and just-in-time information retrieval agents that suggest similar topics and nearby locations.
This paper presents a method to integrate external knowledge sources such as DBpedia and OpenCyc into an ontology learning system that automatically suggests labels for unknown relations in domain ontologies based on large corpora of unstructured text. The method extracts and aggregates verb vectors from semantic relations identified in the corpus. It composes a knowledge base which consists of (i) verb centroids for known relations between domain concepts, (ii) mappings between concept pairs and the types of known relations, and (iii) ontological knowledge retrieved from external sources. Applying semantic inference and validation to this knowledge base yields a refined relation label suggestion. A formal evaluation compares the accuracy and average ranking precision of this hybrid method with the performance of methods that solely rely on corpus data and those that are only based on reasoning and external data sources.
ABSTRACT. This paper presents the U.S. Election 2004 Web Monitor, a public Web portal that captured trends in political media coverage before and after the 2004 U.S. presidential election. Developed by the authors of this article, the webLyzard suite of Web mining tools provided the required functionality to aggregate and analyze about a half-million documents in weekly intervals. The study paid particular attention to the editorial slant, which is defined as the quantity and tone of a Web site's coverage as influenced by its editorial position. The observable attention and attitude toward the candidates served as proxies of editorial slant. The system identified attention by determining the frequency of candidate references and measured attitude towards the candidate by looking for positive and negative expressions that co-occur with these references. Keywords and perceptual maps summarized the most important topics associated with the candidates, placing special emphasis on environmental issues. KEYWORDS. U.S. presidential elections, media monitoring, Web mining, natural language processing, semantic orientation, keyword analysis Arno Scharl is the Vice President of MODUL University Vienna where he also heads the Department of New Media Technology (www.modul.ac.at/nmt). Prior to this appointment, he was a Key Researcher at the Austrian Competence Center for Knowledge Management, held professorships at Graz University of Technology and the University of Western Australia, and had joined Curtin University of Technology and the University of California at Berkeley as a Visiting Fellow. His current research projects focus on the integration of semantic and geospatial technology, human-computer interaction, computer-mediated collaboration, ontology learning, and the various aspects of environmental online communication.Albert Weichselbraun is an Assistant Professor at the Department of Information Systems and Operations at Vienna University of Economics and Business Administration (ai.wu-wien.ac.at). After completing two Master's degrees in Economics and Chemical Engineering, his doctoral thesis focused on ontology-based text classification. Dr. Weichselbraun leads the technical development of webLyzard (www.webLyzard.com) and the IDIOM Project (www.idiom.at) with a special focus on the analytical methods involved. His current research focuses on ontology evolution and learning, text mining and the application of semantic technologies to information retrieval.
Highlights“Westeros Sentinel” – a visual analytics dashboard for Game of Thrones.Extraction of affective and factual knowledge from news and social media coverage.Emotional categories from semantic knowledge bases.Automated annotation services for contextualized information spaces.Interactive visualizations to explore context features.
Given the intense attention that environmental topics such as climate change attract in news and social media coverage, scientists and communication professionals want to know how different stakeholders perceive observable threats and policy options, how specific media channels react to new insights, and how journalists present scientific knowledge to the public. This paper investigates the potential of semantic technologies to address these questions. After summarizing methods to extract and disambiguate context information, we present visualization techniques to explore the lexical, geospatial, and relational context of topics and entities referenced in these repositories. The examples stem from the Media Watch on Climate Change, the Climate Resilience Toolkit and the NOAA Media Watch-three applications that aggregate environmental resources from a wide range of online sources. These systems not only show the value of providing comprehensive information to the public, but also have helped to develop a novel communication success metric that goes beyond bipolar assessments of sentiment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.