Context and Domain Knowledge Enhanced Entity Spotting in Informal Text

Gruhl, Daniel; Nagarajan, Meena; Pieper, Jan; Robson, Christine; Sheth, Amit P.

doi:10.1007/978-3-642-04930-9_17

Cited by 29 publications

(18 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our collection module uses the Twitter Streaming API 5 . The Twitter Streaming API allows near-realtime access to various subsets of Twitter public statuses.…”

Section: Architecturementioning

confidence: 99%

“…Entities such as Obama, Senate and Health Care Bill are mentioned within the text in microposts and represent finer grained semantic units that can be extracted. The task of Named Entity Recognition has been studied in casual text [5] and in more general form following both unsupervised and supervised machine learning approaches [9]. The best performing systems achieve up to 90.8 F 1 score [12] through supervised approaches, i.e.…”

Section: A Extracting Semantic Descriptorsmentioning

confidence: 99%

See 1 more Smart Citation

Linked Open Social Signals

Mendes

Passant

Kapanipathi

et al. 2010

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

View full text Add to dashboard Cite

Abstract-In this paper we discuss the collection, semantic annotation and analysis of real-time social signals from microblogging data. We focus on users interested in analyzing social signals collectively for sensemaking. Our proposal enables flexibility in selecting subsets for analysis, alleviating information overload. We define an architecture that is based on state-ofthe-art Semantic Web technologies and a distributed publishsubscribe protocol for real time communication. In addition, we discuss our method and application in a scenario related to the health care reform in the United States.

show abstract

“…Our collection module uses the Twitter Streaming API 5 . The Twitter Streaming API allows near-realtime access to various subsets of Twitter public statuses.…”

Section: Architecturementioning

confidence: 99%

Section: A Extracting Semantic Descriptorsmentioning

confidence: 99%

Linked Open Social Signals

Mendes

Passant

Kapanipathi

et al. 2010

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology

View full text Add to dashboard Cite

show abstract

“…Kleb et al [14] used concept-dependant text patterns for the disambiguation of text information. Gruhl et al [9] trained an SVM classifier in order to spot ontology entities. Here, many common ideas from information retrieval (IR) have been transferred to this domain.…”

Section: Related Workmentioning

confidence: 99%

“…Also, many try to transfer NLP approaches to this domain [26,14,9], mostly focusing on a specific domain, using domain-specific measures. So far, in the field of ontology-based entity disambiguation, domain-independent complex structures in semantic graphs have not been exploited.…”

Section: Introductionmentioning

confidence: 99%

Entity Reference Resolution via Spreading Activation on RDF-Graphs

Kleb

Abecker

2010

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. The use of natural language identifiers as reference for ontology elements-in addition to the URIs required by the Semantic Web standards-is of utmost importance because of their predominance in the human everyday life, i.e. speech or print media. Depending on the context, different names can be chosen for one and the same element, and the same element can be referenced by different names. Here homonymy and synonymy are the main cause of ambiguity in perceiving which concrete unique ontology element ought to be referenced by a specific natural language identifier describing an entity. We propose a novel method to resolve entity references under the aspect of ambiguity which explores only formal background knowledge represented in RDF graph structures. The key idea of our domain independent approach is to build an entity network with the most likely referenced ontology elements by constructing steiner graphs based on spreading activation. In addition to exploiting complex graph structures, we devise a new ranking technique that characterises the likelihood of entities in this network, i.e. interpretation contexts. Experiments in a highly polysemic domain show the ability of the algorithm to retrieve the correct ontology elements in almost all cases.

show abstract

“…However when dealing with social media sites, performing NLP can be particularly difficult due to the typically informal nature of user posts, which tend to contain a lot of slang and contextdependant terms, with little attention given to spelling and grammar (Gruhl et al, 2009). Thus, while NLP algorithms are potentially very useful tools for investigating SNSs, there are challenges particular to user-generated content which must be handled.…”

Section: Current Approaches For Data Mining and Analysismentioning

confidence: 99%

Understanding Online Communities by Using Semantic Web Technologies

Passant

Kinsella

Bojārs

et al.

Handbook of Research on Methods and Techniques for Studying Virtual Communities

View full text Add to dashboard Cite

During the last few years, the Web that we used to know as a read-only medium shifted to a read-write Web, often known as Web 2.0 or the Social Web, in which people interact, share and build content collaboratively within online communities. In order to clearly understand how these online communities are formed, evolve, share and produce content, a first requirement is to gather related data. In this chapter, we give an overview of how Semantic Web technologies can be used to provide a unified layer of representation for Social Web data in an open and machine-readable manner thanks to common models and shared semantics, facilitating data gathering and analysis. Through a comprehensive state of the art review, we describe the various models that can be applied to online communities and give an overview of some of the new possibilities offered by such a layer in terms of data querying and community analysis.

show abstract

Context and Domain Knowledge Enhanced Entity Spotting in Informal Text

Cited by 29 publications

References 11 publications

Linked Open Social Signals

Linked Open Social Signals

Entity Reference Resolution via Spreading Activation on RDF-Graphs

Understanding Online Communities by Using Semantic Web Technologies

Contact Info

Product

Resources

About