2017
DOI: 10.1002/asi.23910
|View full text |Cite
|
Sign up to set email alerts
|

Toward an anatomy of IR system component performances

Abstract: Information retrieval (IR) systems are the prominent means for searching and accessing huge amounts of unstructured information on the web and elsewhere. They are complex systems, constituted by many different components interacting together, and evaluation is crucial to both tune and improve them. Nevertheless, in the current evaluation methodology, there is still no way to determine how much each component contributes to the overall performances and how the components interact together. This hampers the poss… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
4
1

Relationship

4
6

Authors

Journals

citations
Cited by 26 publications
(25 citation statements)
references
References 26 publications
0
18
1
Order By: Relevance
“…The effectiveness of IR systems heavily depends on a large number of configurations that need to be tuned [28,57]. Configurations range from the choice of different system components, e.g., stopword lists, stemming methods, retrieval models, to model parameters.…”
Section: Optimizationmentioning
confidence: 99%
“…The effectiveness of IR systems heavily depends on a large number of configurations that need to be tuned [28,57]. Configurations range from the choice of different system components, e.g., stopword lists, stemming methods, retrieval models, to model parameters.…”
Section: Optimizationmentioning
confidence: 99%
“…The factors used in an ANOVA analysis do not have to be the components of a test collection. Ferro and Silvello [21,22] systematically varied the components of an IR system: stop list, stemmer, ranking model, and so on, by using the grid-of-points approach proposed by Ferro and Harman [19]. The analysis allowed the researchers to understand the relative impact of each system component on performance.…”
Section: Anovamentioning
confidence: 99%
“…the Grid of Points (GoP) 1 -arising from the combinatorial composition of several open-source publicly available components such as stop lists, stemmers, and IR models, and run against 6 different public test collections shared by the Text REtrieval Conference (TREC) international evaluation initiative. Thanks to this GoP, in [8] we presented the deep statistical analyses we run and the insights we gathered about the individual contributions of single IR components to the overall performances of fully working IR systems.…”
Section: Motivationsmentioning
confidence: 99%