Evaluation Infrastructures for Academic Shared Tasks

Schaible, Johann; Breuer, Timo; Tavakolpoursaleh, Narges; Müller, Burkhard; Wolff, Benjamin; Schaer, Philipp

doi:10.1007/s13222-020-00335-x

Cited by 5 publications

(4 citation statements)

References 22 publications

(21 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Following previous living lab experiments, we implement the interleaving method by the Team-Draft-Interleaving algorithm [12]. More specifically, we refactored exactly the same implementation 13 for the highest degree of comparability. Furthermore we follow Gingstad et al's proposal of a weighted score based on click events [5] and define the Reward as…”

Section: Evaluation Metricsmentioning

confidence: 99%

“…Progress in the field of academic search and its corresponding domains is usually evaluated by means of shared tasks that are based on the principles of Cranfield/TREC-style studies [13]. Most recently the TREC-COVID evaluation campaign run by NIST attracted a high number of participants and showed the high impact of scientific retrieval tasks in the community.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Overview of LiLAS 2021 -- Living Labs for Academic Search

Schaer,

Breuer,

Castro

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

The Living Labs for Academic Search (LiLAS) lab aims to strengthen the concept of user-centric living labs for academic search. The methodological gap between real-world and lab-based evaluation should be bridged by allowing lab participants to evaluate their retrieval approaches in two real-world academic search systems from life sciences and social sciences. This overview paper outlines the two academic search systems LIVIVO and GESIS Search, and their corresponding tasks within LiLAS, which are ad-hoc retrieval and dataset recommendation. The lab is based on a new evaluation infrastructure named STELLA that allows participants to submit results corresponding to their experimental systems in the form of pre-computed runs and Docker containers that can be integrated into production systems and generate experimental results in real-time. Both submission types are interleaved with the results provided by the productive systems allowing for a seamless presentation and evaluation. The evaluation of results and a metaanalysis of the different tasks and submission types complement this overview.

show abstract

Section: Evaluation Metricsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Overview of LiLAS 2021 -- Living Labs for Academic Search

Schaer,

Breuer,

Castro

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Through the STELLA infrastructure the experimental systems of both types could be integrated into the live systems. Further, STELLA creates an interleaved ranking by systematically combining the results from two systems [8,22]. More lifelike results and insights are expected by utilizing real user interactions to assess search systems.…”

Section: Related Workmentioning

confidence: 99%

Evaluating Research Dataset Recommendations in a Living Lab

Keller

Munz

2022

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The search for research datasets is as important as laborious. Due to the importance of the choice of research data in further research, this decision must be made carefully. Additionally, because of the growing amounts of data in almost all areas, research data is already a central artifact in empirical sciences. Consequentially, research dataset recommendations can beneficially supplement scientific publication searches. We formulated the recommendation task as a retrieval problem by focussing on broad similarities between research datasets and scientific publications. In a multistage approach, initial recommendations were retrieved by the BM25 ranking function and dynamic queries. Subsequently, the initial ranking was re-ranked utilizing click feedback and document embeddings. The proposed system was evaluated live on real user interaction data using the STELLA infrastructure in the LiLAS Lab at CLEF 2021. Our experimental system could efficiently be fine-tuned before the live evaluation by pre-testing the system with a pseudo test collection based on prior user interaction data from the live system. The results indicate that the experimental system outperforms the other participating systems.

show abstract

“…The Living Labs for Academic Search (LiLAS) workshop fosters the discussion, research, and evaluation of academic search systems, and it employs the concept of living labs to the domain of academic search [13]. The goal is to expand the knowledge on improving the search for academic resources like literature, research data, and the interlinking between these resources.…”

Section: Introductionmentioning

confidence: 99%

Overview of LiLAS 2020 – Living Labs for Academic Search

Schaer

Schaible

Castro

2020

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Academic Search is a timeless challenge that the field of Information Retrieval has been dealing with for many years. Even today, the search for academic material is a broad field of research that recently started working on problems like the COVID-19 pandemic. However, test collections and specialized data sets like CORD-19 only allow for systemoriented experiments, while the evaluation of algorithms in real-world environments is only available to researchers from industry. In LiLAS, we open up two academic search platforms to allow participating research to evaluate their systems in a Docker-based research environment. This overview paper describes the motivation, infrastructure, and two systems LIVIVO and GESIS Search that are part of this CLEF lab.

show abstract

Evaluation Infrastructures for Academic Shared Tasks

Cited by 5 publications

References 22 publications

Overview of LiLAS 2021 -- Living Labs for Academic Search

Overview of LiLAS 2021 -- Living Labs for Academic Search

Evaluating Research Dataset Recommendations in a Living Lab

Overview of LiLAS 2020 – Living Labs for Academic Search

Contact Info

Product

Resources

About