How search engines disseminate information about COVID-19 and why they should do better

Makhortykh, Mykola; Urman, Aleksandra; Ulloa, Roberto

doi:10.37016/mr-2020-017

Cited by 42 publications

(46 citation statements)

References 7 publications

(8 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To collect data, we utilized a set of virtual agents -that is software simulating user browsing behavior (e.g., scrolling web pages and entering queries) and recording its outputs. The benefits of this approach, which extends algorithmic auditing methodology introduced by Haim et al [24], is that it allows controlling for personalization [25] and randomization [35] factors influencing outputs of web search. In contrast to human actors, virtual agents can be easily synchronized (i.e., to isolate the effect of time at which the search actions are conducted) and deployed in a controlled environment (e.g., a network of virtual machines using the same IP range, the same type of operating system (OS) and the same browsing software) to limit the effects of personalization that might lead to skewed outputs.…”

Section: Methodsmentioning

confidence: 99%

Advances in Bias and Fairness in Information Retrieval

2021

Communications in Computer and Information Science

View full text Add to dashboard Cite

Web search engines influence perception of social reality by filtering and ranking information. However, their outputs are often subjected to bias that can lead to skewed representation of subjects such as professional occupations or gender. In our paper, we use a mixed-method approach to investigate presence of race and gender bias in representation of artificial intelligence (AI) in image search results coming from six different search engines. Our findings show that search engines prioritize anthropomorphic images of AI that portray it as white, whereas non-white images of AI are present only in non-Western search engines. By contrast, gender representation of AI is more diverse and less skewed towards a specific gender that can be attributed to higher awareness about gender bias in search outputs. Our observations indicate both the the need and the possibility for addressing bias in representation of societally relevant subjects, such as technological innovation, and emphasize the importance of designing new approaches for detecting bias in information retrieval systems.

show abstract

Section: Methodsmentioning

confidence: 99%

Advances in Bias and Fairness in Information Retrieval

2021

Communications in Computer and Information Science

View full text Add to dashboard Cite

show abstract

“…This overlap increased to 92% for Google and 86% for Yandex when search histories and cookies were deleted on the participants' computers in Moscow. Against this backdrop, and with reference to recent research that indicates that both Google and Yandex randomize significant proportions of their results (Makhortykh et al, 2020), we consider the results collected for this study as broadly representative of results obtained on standard Moscow computers.…”

Section: Data Collectionmentioning

confidence: 99%

Gauging reference and source bias over time: how Russia’s partially state-controlled search engine Yandex mediated an anti-regime protest event

Kravets

Toepfl

2021

Information, Communication & Society

View full text Add to dashboard Cite

Massive efforts have been dedicated to studying political search engine bias in democratic contexts, and a growing body of literature has scrutinized search engine censorship in authoritarian China. By contrast, very little is known still about political search engine bias within Russia's slightly more open authoritarian media system. In order to fill in this gap, this study asks: How has Russia's leading partially state-controlled search engine Yandex mediated a large-scale anti-regime protest event (anti-corruption protests on 26 March 2017) in the twenty months thereafter? The study analyzes a data set of 30,471 results, retrieved in regular intervals for nine query terms from four platforms:Yandex.ru, News.yandex.ru, Google.ru, and News.google.ru. As the findings demonstrate, both Yandex algorithms (by comparison with their Google counterparts) referred users to significantly fewer websites that contained information about the protest event (reference bias). In a similar vein, both Yandex algorithms forwarded users to fewer websites that regularly featured criticism of Russia's authoritarian leadership (source bias). Moreover, Yandex algorithms tended to be particularly biased in the event's immediate aftermath. In the first week after the protests, for instance, the difference in reference ratios between Google and Yandex Web searches were massive (mean difference: 23% points), while they were less pronounced during the remaining time period studied (mean difference: 7% points). These findings are highly politically relevant because rapid diffusion of information about protest events can be considered of key importance to further protest mobilization.

show abstract

“…Our decision to rely on the top-10 results only is motivated by the fact that users tend to pay the most attention to the first few results -i.e., those on the first results page [27]. A comparison of search results by browser has demonstrated that there are no major between-browser differences -a finding in contrast with those observed for text search results [23] -thus for the analysis we have proceeded aggregating the results for both browsers.…”

Section: Data Collectionmentioning

confidence: 99%

“…One form of retrieval bias, which the current paper focuses on, is source diversity bias. Originally discussed in the context of search engines' tendency to prioritize web pages with the highest number arXiv:2106.02715v1 [cs.IR] 4 Jun 2021 of visitors [16], source diversity bias is currently investigated in the context of prioritization of certain categories of sources in response to particular types of queries (e.g., [23]). A disproportionate visibility of specific types of web resources can diminish overall quality of search results [6] and provide unfair advantage to companies and individuals that own specific search engines [16] -e.g., through own-content bias, -or direct most of the traffic to a handful of well-established sources, a phenomenon also known as search concentration [19].…”

Section: Related Workmentioning

confidence: 99%

Auditing Source Diversity Bias in Video Search Results Using Virtual Agents

Urman

Makhortykh

Ulloa

2021

Companion Proceedings of the Web Conference 2021

Self Cite

View full text Add to dashboard Cite

We audit the presence of domain-level source diversity bias in video search results. Using a virtual agent-based approach, we compare outputs of four Western and one non-Western search engines for English and Russian queries. Our findings highlight that source diversity varies substantially depending on the language with English queries returning more diverse outputs. We also find disproportionately high presence of a single platform, YouTube, in top search outputs for all Western search engines except Google. At the same time, we observe that Youtube's major competitors such as Vimeo or Dailymotion do not appear in the sampled Google's video search results. This finding suggests that Google might be downgrading the results from the main competitors of Google-owned Youtube and highlights the necessity for further studies focusing on the presence of own-content bias in Google's search results.

show abstract

How search engines disseminate information about COVID-19 and why they should do better

Cited by 42 publications

References 7 publications

Advances in Bias and Fairness in Information Retrieval

Advances in Bias and Fairness in Information Retrieval

Gauging reference and source bias over time: how Russia’s partially state-controlled search engine Yandex mediated an anti-regime protest event

Auditing Source Diversity Bias in Video Search Results Using Virtual Agents

Contact Info

Product

Resources

About