The risk of racial bias while tracking influenza-related content on social media using machine learning

Lwowski, Brandon; Rios, Anthony

doi:10.1093/jamia/ocaa326

Cited by 19 publications

(11 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Namely, the likelihood of segregation in networks, an effect known as filter bubble [32,34], and the ranking signals that govern how message contents become available to users in a network [3,21,29]. Other issues have been identified derived from online recruitment, especifically participant retention [28], age bias [17] and racial and economical bias [19,31,50]. Some studies have shown that community-based methods, including personal references, local advertisements and pamphlets, indeed recruit more racially diverse samples than other methods [41], and may ensure better chances of participant retention [2,7,44,46,49].…”

Section: Discussionmentioning

confidence: 99%

“…In this sense, differences in age have be found between participants recruited via social networks * E-mail: alvaropastor@uoc.edu and other methods [17]. As well as a risk for racial and economical bias has been reported when using social networks data [19,31,50]. Moreover, the literature from a diversity of disciplines, including machine learning and data mining [6,23,36], and marketing [20,27,48], sheds light on the various management layers that aim to optimise links between users and the flow of messages.…”

Section: But How Do Social Network Work?mentioning

confidence: 99%

See 1 more Smart Citation

A case against social networks as the only means for recruitment in scientific studies with humans

Pastor¹

2022

Preprint

View full text Add to dashboard Cite

Social networks have been suggested as key media in the advertisement and recruitment of participants for human studies. They are appreciated due to their cost-effectiveness and apparent large-scale reach. Indeed, for a number of underfunded scientific studies, social networks effectively replace all other media in recruitment efforts.But, the organisation and operation of social networks not necessarily reflects the connections of a population in the real world. And, in contrast to other media, in social networks the rules that govern the spread of messages follow unstable ranking methods that sit entirely out of the control of investigators. This article reviews how social networks are managed environments that implement a variety of opportunistic regulations which control links between-users and the messages every user sees. Thus, their use as sole media in recruitment of participants for scientific trials would yield a non-representative biased sample of the population.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: But How Do Social Network Work?mentioning

confidence: 99%

A case against social networks as the only means for recruitment in scientific studies with humans

Pastor¹

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Machine learning starts with raw data harvested from an ever-growing collection of data sources. Electronic health records, administrative health records, data warehouses, social media data, 6 as well as population health data are collected and stored with various entities. 7 If the raw data available for training and validation is biased, the analytical results will be biased.…”

Section: Raw Data Can Be Racially Biasedmentioning

confidence: 99%

Misguided Artificial Intelligence: How Racial Bias is Built Into Clinical Models

Jindal

2022

Journal of Brown Hospital Medicine

View full text Add to dashboard Cite

Artificial Intelligence is being used today to solve a myriad of problems. While there is significant promise that AI can help us address many healthcare issues, there is also concern that health inequities can be exacerbated. This article looks specifically at predictive models in regards to racial bias. Each phase of the model building process including raw data collection and processing, data labelling, and implementation of the model can be subject to racial bias. This article aims to explore some of the ways in which this occurs.

show abstract

“…As a characteristic example, models trained to predict intelligence [ 16 , 17 ] might provide a statistically significant predictive performance by picking up solely on age-related variance [ 18 , 19 ]. Moreover, various types of systematic sampling bias, as well as stochastic group differences in the training sample, can result in confounded models (e.g., racially biased machine learning models [ 6 , 20 , 21 ]).…”

Section: Introductionmentioning

confidence: 99%

Statistical quantification of confounding bias in machine learning models

Spisák¹

2022

GigaScience

View full text Add to dashboard Cite

Background The lack of nonparametric statistical tests for confounding bias significantly hampers the development of robust, valid, and generalizable predictive models in many fields of research. Here I propose the partial confounder test, which, for a given confounder variable, probes the null hypotheses of the model being unconfounded. Results The test provides a strict control for type I errors and high statistical power, even for nonnormally and nonlinearly dependent predictions, often seen in machine learning. Applying the proposed test on models trained on large-scale functional brain connectivity data (N= 1,865) (i) reveals previously unreported confounders and (ii) shows that state-of-the-art confound mitigation approaches may fail preventing confounder bias in several cases. Conclusions The proposed test (implemented in the package mlconfound; https://mlconfound.readthedocs.io) can aid the assessment and improvement of the generalizability and validity of predictive models and, thereby, fosters the development of clinically useful machine learning biomarkers.

show abstract

The risk of racial bias while tracking influenza-related content on social media using machine learning

Cited by 19 publications

References 22 publications

A case against social networks as the only means for recruitment in scientific studies with humans

A case against social networks as the only means for recruitment in scientific studies with humans

Misguided Artificial Intelligence: How Racial Bias is Built Into Clinical Models

Statistical quantification of confounding bias in machine learning models

Contact Info

Product

Resources

About