Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval 2005
DOI: 10.1145/1076034.1076132
|View full text |Cite
|
Sign up to set email alerts
|

A testbed for people searching strategies in the WWW

Abstract: This paper describes the creation of a testbed to evaluate people searching strategies on the World-Wide-Web. This task involves resolving person names' ambiguity and locating relevant information characterising every individual under the same name.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
46
0

Year Published

2008
2008
2017
2017

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 43 publications
(46 citation statements)
references
References 1 publication
0
46
0
Order By: Relevance
“…Since the goal of this work is aimed towards evaluating the person clustering hypothesis in a very general setting, we have selected these clustering methods because they are representative of the types already tried. For instance, Artiles et al [2] use a similar representation of documents with agglomerative clustering technique to obtain a baseline for a pilot test collection for this task. However, our work differs because we focus on exploring how document clustering performs for this task.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Since the goal of this work is aimed towards evaluating the person clustering hypothesis in a very general setting, we have selected these clustering methods because they are representative of the types already tried. For instance, Artiles et al [2] use a similar representation of documents with agglomerative clustering technique to obtain a baseline for a pilot test collection for this task. However, our work differs because we focus on exploring how document clustering performs for this task.…”
Section: Related Workmentioning
confidence: 99%
“…The harmonic mean (α = 0.5) was used for the final ranking of systems at SemEval, and F 0.2 was also reported as an additional measure, which gives more importance to the inverse purity aspect (α = 0.2). Artiles et al [2] argue that the rationale for using F 0.2 , from a user's point of view, is that "it is easier to discard a few incorrect web pages in a cluster which has all the information needed, than having to collect the relevant information across many different clusters." We decided to also report on F 0.8 , a measure which gives more importance to the purity aspect (α = 0.8).…”
Section: Performance Measuresmentioning
confidence: 99%
See 1 more Smart Citation
“…Many celebrities and experts from various fields are referred by their original names on web. Most of the queries to web search engines include person names [1] [2]. For example, people might use "Michel Jackson" as a query on search engine to know about him.…”
Section: Introduction 11 Information Retrievalmentioning
confidence: 99%
“…Around 30% of search engine queries include personal names [1]. However, retrieving information about a person merely using his or her real names is insufficient when that person has nicknames.…”
Section: Introductionmentioning
confidence: 99%