When information retrieval measures agree about the relative quality of document rankings

Losee, Robert M.

doi:10.1002/(sici)1097-4571(2000)51:9<834::aid-asi60>3.0.co;2-1

Cited by 27 publications

(15 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other performance measures could be used for such a study, or the relationship between performance with this measure and performance with other measures might be studied (Losee, 2000). An earlier study on distributed information retrieval provides methods using Average Search Length as a performance measure that can address issues such as the clustering problems discussed above (Losee & Church, 2004).…”

Section: Discussionmentioning

confidence: 99%

Are two document clusters better than one? The Cluster Performance Question for information retrieval

Losee

Church

2004

J. Am. Soc. Inf. Sci.

Self Cite

View full text Add to dashboard Cite

When do information retrieval systems using two document clusters provide better retrieval performance than systems using no clustering? We answer this question for one set of assumptions and suggest how this may be studied with other assumptions. The "Cluster Hypothesis" asks an empirical question about the relationships between documents and user-supplied relevance judgments, while the "Cluster Performance Question" proposed here focuses on the when and why of information retrieval or digital library performance for clustered and unclustered text databases. This may be generalized to study the relative performance of m versus n clusters.

show abstract

Section: Discussionmentioning

confidence: 99%

Are two document clusters better than one? The Cluster Performance Question for information retrieval

Losee

Church

2004

J. Am. Soc. Inf. Sci.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Determining the relative performance of two different characteristics used in ordering allows one to make decisions about which types of characteristics should be incorporated into a specific system. While ordering performance measures take many forms (Losee, 2000), e.g. precision, recall, average search length, or any of a number of measures of ordering performance, a linear measure that can be directly interpreted as an occurrence of characteristic X produces performance at the same level as occurrences of characteristic Y can be particularly valuable when making decisions about whether characteristic ¡ or characteristic ¢ should be used, along with their associated costs and benefits.…”

Section: Ordering Performance As a Utility Measurementioning

confidence: 99%

Is 1 noun worth 2 adjectives? Measuring relative feature utility

Losee

2006

Information Processing & Management

Self Cite

View full text Add to dashboard Cite

Are two adjectives worth the same as a single noun when documents are ordered based on decreasing topicality? We propose an easy to interpret single number Relative Feature Utility (RFU) measure of the relative worth of using specific linguistic or non-linguistic features or sets of features in computational systems that order or filter media, such as information retrieval and classification systems. This measure allows one to make easily interpreted claims about the relative utility of features such as parts-of-speech, term suffixes, phrases vs. single terms, annotations, hyperlinks, citations, index terms, and metadata when ordering natural language text or other media. Data is provided for the RFU for stemming characteristics, part-of-speech tags, and phrase lengths, as well as retrieval characteristics and procedures. Using this linear measure of the relative utility of features makes available a wide range of cost-benefit analyses and decision theoretic techniques, allowing the study of whether or not to use many different kinds of representational information or tagging systems, and for the design of indexing and metadata systems. Some characteristics of natural languages used in the spectrum from softer to harder sciences, as well as medical terminology, are studied.

show abstract

“…Since different evaluation measures evaluate different aspects of retrieval behavior, many evaluation measures have come to proliferate, and these measures have been deeply analyzed and criticized [11,8]. For example, Buckley and Voorhees compare evaluation measures based on their query stability [5], and Loose [10] proposes criteria to determine under which conditions different measures agree or disagree.…”

Section: Introductionmentioning

confidence: 99%

A geometric interpretation and analysis of R-precision

Aslam

Yılmaz

2005

Proceedings of the 14th ACM International Conference on Information and Knowledge Management

View full text Add to dashboard Cite

Average precision and R-precision are two of the most commonly cited measures of overall retrieval performance, but their correlation, though well-known, has defied explanation. We recently devised a geometric interpretation of Rprecision which suggests that under a reasonable set of assumptions, R-precision approximates the area under the precision-recall curve, as does average precision, thus explaining their correlation. In this paper, we consider these assumptions and our geometric interpretation of R-precision in order to further understand, and make reasonable use of, the information that R-precision provides. Given our geometric interpretation of R-precision, we show that Rprecision is highly informative by demonstrating that it can be used to (1) accurately infer precision-recall curves, (2) accurately infer other measures of retrieval performance, and (3) devise new measures of retrieval performance. Through our analysis, we also state the conditions under which Rprecision is informative.

show abstract

When information retrieval measures agree about the relative quality of document rankings

Cited by 27 publications

References 19 publications

Are two document clusters better than one? The Cluster Performance Question for information retrieval

Are two document clusters better than one? The Cluster Performance Question for information retrieval

Is 1 noun worth 2 adjectives? Measuring relative feature utility

A geometric interpretation and analysis of R-precision

Contact Info

Product

Resources

About