RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR

Zhou, Yuyin; Huang, Shih-Cheng; Fries, Jason; Youssef, Alaa; Amrhein, Timothy J.; Chang, Marcello; Banerjee, Imon; Rubin, Daniel L.; Liu, Xing; Shah, Nigam H.; Lungren, Matthew P.

doi:10.48550/arxiv.2111.11665

Cited by 9 publications

(16 citation statements)

References 47 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Indeed, the first issue one encounters is that a large number of candidate measures exist. One can for instance evaluate fairness by comparing standard ML performance metrics across different sub-groups, such as accuracy 10,[12][13][14][15][16] , or AUC ROC (the area under the receiver operating characteristic curve) [8][9][10][14][15][16][17][18][19][20][21][22] , among others. Alternatively, one can choose to employ one of the (no less than ten) different fairnessspecific criteria formulated by the community 23 in order to audit the presence of bias in a given model 16,18 .…”

Section: What Does It Mean For An Algorithm To Be Fair?mentioning

confidence: 99%

“…One can for instance evaluate fairness by comparing standard ML performance metrics across different sub-groups, such as accuracy 10,[12][13][14][15][16] , or AUC ROC (the area under the receiver operating characteristic curve) [8][9][10][14][15][16][17][18][19][20][21][22] , among others. Alternatively, one can choose to employ one of the (no less than ten) different fairnessspecific criteria formulated by the community 23 in order to audit the presence of bias in a given model 16,18 . To complicate matters further, even if one carries out a multi-dimensional study by simultaneously employing multiple metrics 9,10,[14][15][16]20,21,24 , which model to select at the end in a given setting might be no trivial matter and additional information will in general be required.…”

Section: What Does It Mean For An Algorithm To Be Fair?mentioning

confidence: 99%

See 1 more Smart Citation

Addressing fairness in artificial intelligence for medical imaging

2022

View full text Add to dashboard Cite

A plethora of work has shown that AI systems can systematically and unfairly be biased against certain populations in multiple scenarios. The field of medical imaging, where AI systems are beginning to be increasingly adopted, is no exception. Here we discuss the meaning of fairness in this area and comment on the potential sources of biases, as well as the strategies available to mitigate them. Finally, we analyze the current state of the field, identifying strengths and highlighting areas of vacancy, challenges and opportunities that lie ahead.

show abstract

Section: What Does It Mean For An Algorithm To Be Fair?mentioning

confidence: 99%

Section: What Does It Mean For An Algorithm To Be Fair?mentioning

confidence: 99%

Addressing fairness in artificial intelligence for medical imaging

2022

View full text Add to dashboard Cite

show abstract

“…The multiple perspectives of medical data from different modalities provide information on patient treatment, allowing multimodal models to gradually show their unique advantages in the healthcare field [54]. The research on the fairness issue of multimodal models is not many at current stage [99,17]. A preliminary work presents a multimodal benchmark dataset consisting of 1794 patients and their corresponding EHR data and high-resolution computed tomography (CT) data, called RadFusion [99].…”

Section: Other Data Typesmentioning

confidence: 99%

“…The research on the fairness issue of multimodal models is not many at current stage [99,17]. A preliminary work presents a multimodal benchmark dataset consisting of 1794 patients and their corresponding EHR data and high-resolution computed tomography (CT) data, called RadFusion [99]. The authors evaluate the performance of several representative multimodal fusion models on a diagnostic task of pulmonary embolism and benchmark their fairness in a protected subgroup.…”

Section: Other Data Typesmentioning

confidence: 99%

Fair Machine Learning in Healthcare: A Review

Feng¹,

Du²,

Zou³

et al. 2022

Preprint

View full text Add to dashboard Cite

Benefiting from the digitization of healthcare data and the development of computing power, machine learning methods are increasingly used in the healthcare domain. Fairness problems have been identified in machine learning for healthcare, resulting in an unfair allocation of limited healthcare resources or excessive health risks for certain groups. Therefore, addressing the fairness problems has recently attracted increasing attention from the healthcare community. However, the intersection of machine learning for healthcare and fairness in machine learning remains understudied. In this review, we build the bridge by exposing fairness problems, summarizing possible biases, sorting out mitigation methods and pointing out challenges along with opportunities for the future.

show abstract

“…As machine learning models become increasingly integrated in the healthcare setting, one primary concern is whether such models are being used in a fair and ethical way (Ahmad et al, 2020;Wawira Gichoya et al, 2021;. In the field of machine learning for medical imaging, there have been several prior works that benchmark the degree of disparities between protected groups for machine learning mod- (Kinyanjui et al, 2020) and CT scans (Zhou et al, 2021), though to our knowledge, our work is the first to benchmark algorithms for bias reduction in the medical imaging setting.…”

Section: Fairness In Computational Medical Imagingmentioning

confidence: 99%

Improving the Fairness of Chest X-ray Classifiers

Zhang¹,

Dullerud²,

Roth³

et al. 2022

Preprint

View full text Add to dashboard Cite

Deep learning models have reached or surpassed human-level performance in the field of medical imaging, especially in disease diagnosis using chest x-rays. However, prior work has found that such classifiers can exhibit biases in the form of gaps in predictive performance across protected groups. In this paper, we question whether striving to achieve zero disparities in predictive performance (i.e. group fairness) is the appropriate fairness definition in the clinical setting, over minimax fairness, which focuses on maximizing the performance of the worstcase group. We benchmark the performance of nine methods in improving classifier fairness across these two definitions. We find, consistent with prior work on non-clinical data, that methods which strive to achieve better worstgroup performance do not outperform simple data balancing. We also find that methods which achieve group fairness do so by worsening performance for all groups. In light of these results, we discuss the utility of fairness definitions in the clinical setting, advocating for an investigation of the bias-inducing mechanisms in the underlying data generating process whenever possible. Data and Code AvailabilityWe make use of two chest x-ray datasets: MIMIC-CXR (Johnson et al., 2019) and CheXpert (Irvin et al., 2019). Both datasets are publicly available pending appropriate data usage agreements. Demographic data for pa-tients in MIMIC-CXR were obtained from MIMIC-IV (Johnson et al., 2021), available through Phy-sioNet (Goldberger et al., 2000). We analyze an additional radiologist-labelled dataset in this paper. We recruit a board-certified radiologist co-author to manually label 1,200 reports in MIMIC-CXR which have been labelled as No Finding by the CheXpert labeller, an automatic rule-based NLP model (Irvin et al., 2019). This dataset, along with code to reproduce our results, can be found at https: //github.com/MLforHealth/CXR_Fairness.

show abstract

RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR

Cited by 9 publications

References 47 publications

Addressing fairness in artificial intelligence for medical imaging

Addressing fairness in artificial intelligence for medical imaging

Fair Machine Learning in Healthcare: A Review

Improving the Fairness of Chest X-ray Classifiers

Contact Info

Product

Resources

About