2008
DOI: 10.1353/lib.0.0000
|View full text |Cite
|
Sign up to set email alerts
|

Effects of Inconsistent Relevance Judgments on Information Retrieval Test Results: A Historical Perspective

Abstract: Only by continuous self-appraisal can a large information system make itself responsive to the needs of the scientific community. Concluding sentence in Lancaster (1969) AbstractThe main objective of information retrieval (IR) systems is to retrieve information or information objects relevant to user requests and possible needs. In IR tests, retrieval effectiveness is established by comparing IR systems retrievals (systems relevance) with users' or user surrogates' assessments (user relevance), where user rele… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
34
0
3

Year Published

2009
2009
2017
2017

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 40 publications
(37 citation statements)
references
References 38 publications
0
34
0
3
Order By: Relevance
“…Therefore, a reliable RJ is critical for a precise and effective evaluation. As these evaluation techniques become ubiquitous, the accuracy and meaningfulness of the result is subjective [36]. Due to conflicts in human judgement, research has been conducted to determine how to obtain reliable relevance judgements [37,38].…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, a reliable RJ is critical for a precise and effective evaluation. As these evaluation techniques become ubiquitous, the accuracy and meaningfulness of the result is subjective [36]. Due to conflicts in human judgement, research has been conducted to determine how to obtain reliable relevance judgements [37,38].…”
Section: Discussionmentioning
confidence: 99%
“…(SARACEVIC, 2009. Para algumas publicações selecionadas, ver Saracevic ( , 2008, Spink et al (2001), Jansen et al (1998). Maiores detalhes em:…”
Section: Sumário Geral Sumário Do Capítulo 10unclassified
“…Interassessor reliability is a controversial issue when it comes to the evaluation of automatically generated document similarity models. Some (e.g., Harman & Vorhees, 2006) point to strong consistency while others (e.g., Saracevic, 2008; Morris, 2010) point to substantial individual differences. Research evidence on which conclusions can be drawn is rather limited (Saracevic, 2008).…”
Section: Introductionmentioning
confidence: 99%
“…Some (e.g., Harman & Vorhees, 2006) point to strong consistency while others (e.g., Saracevic, 2008; Morris, 2010) point to substantial individual differences. Research evidence on which conclusions can be drawn is rather limited (Saracevic, 2008). In this paper we report two experiments that contribute to this issue.…”
Section: Introductionmentioning
confidence: 99%