“…Running topic searches across databases to calculate the overlap, uniqueness, recall, and/or precision of results in each database are frequently used to test performance. Over the past few decades, studies have tested database performance for comparing discovery services (Ciccone and Vickery, 2015;Hanneke and O'Brien, 2016) and retrieval of the literature in a variety of fields, such as agricultural sciences (Brooks, 1980), education (Finch, 2010), geography (Ştirbu et al, 2015), health sciences (Roberts, 1999;Snow and Ifshin, 1984;Shultz, 2007;Stokes et al, 2009;Tober, 2011), history (Newton and Tellman, 2010), sociology (Todd, 2006), and toxicology (Bawden and Brock, 1985).…”