Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval 2020
DOI: 10.1145/3397271.3401395
|View full text |Cite
|
Sign up to set email alerts
|

A Lightweight Environment for Learning Experimental IR Research Practices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
8

Relationship

4
4

Authors

Journals

citations
Cited by 10 publications
(7 citation statements)
references
References 7 publications
0
7
0
Order By: Relevance
“…The list goes on. These myriad details often trip up new researchers who are just learning systems-oriented IR evaluation methodology (motivating work such as Akkalyoncu Yilmaz et al [2]), and occasionally subtle issues confuse experienced researchers as well. 1 The explicit goal of Pyserini is to make these "easy things" easy, supporting common tasks and reducing the possibility of confusion as much as possible.…”
Section: Design Philosophymentioning
confidence: 99%
See 2 more Smart Citations
“…The list goes on. These myriad details often trip up new researchers who are just learning systems-oriented IR evaluation methodology (motivating work such as Akkalyoncu Yilmaz et al [2]), and occasionally subtle issues confuse experienced researchers as well. 1 The explicit goal of Pyserini is to make these "easy things" easy, supporting common tasks and reducing the possibility of confusion as much as possible.…”
Section: Design Philosophymentioning
confidence: 99%
“…A number of these features for sparse retrieval are illustrated in Figure 5 and available via the IndexReader object, which can be initialized with pre-built indexes in the same way as the searcher classes. 2 In (L7-9), we illustrate how to iterate over all terms in a corpus (i.e., its dictionary) and access each term's document frequency and collection frequency. Here, we use standard Python tools to select and print out the first 10 terms alphabetically.…”
Section: Access To System Internalsmentioning
confidence: 99%
See 1 more Smart Citation
“…More broadly, with its diverse and mature ecosystem, Python has emerged as the language of choice for most data scientists today. Anticipating this gap, we have been working on Pyserini, 2 Python bindings for Anserini, since late 2019 (Yilmaz et al, 2020). Pyserini is released as a well-documented, easy-to-use Python module distributed via PyPI and easily installable via pip.…”
Section: Abstractearchmentioning
confidence: 99%
“…However, Python is the main language for PyTorch (Paszke et al, 2019) and TensorFlow (Abadi et al, 2016), the two most popular neural network toolkits today, and more broadly, Python has emerged as the language of choice for applied machine learning today in part due to its diverse and mature ecosystem. Pyserini (Yilmaz et al, 2020) 2 bridges the gap between the JVM and Python by providing a Python interface to Anserini. Together, Anserini and Pyserini provide basic keyword search capabilities to arbitrary corpora, which include tools to fetch raw document texts as well as utilities to access various term statistics.…”
Section: The Covidex Stackmentioning
confidence: 99%