2023
DOI: 10.1146/annurev-biodatasci-020722-020704
|View full text |Cite
|
Sign up to set email alerts
|

Addressing the Challenge of Biomedical Data Inequality: An Artificial Intelligence Perspective

Abstract: Artificial intelligence (AI) and other data-driven technologies hold great promise to transform healthcare and confer the predictive power essential to precision medicine. However, the existing biomedical data, which are a vital resource and foundation for developing medical AI models, do not reflect the diversity of the human population. The low representation in biomedical data has become a significant health risk for non-European populations, and the growing application of AI opens a new pathway for this he… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 11 publications
(5 citation statements)
references
References 120 publications
0
5
0
Order By: Relevance
“…Therefore, how to utilize data from different ancestry groups is crucial in a multi-ancestral machine learning strategy. We have categorized multi-ancestral (or multi-ethnic) machine learning schemes based on the way they utilize data from different subpopulations [ 7 , 8 ] (Table 2 ). Mixture learning indistinctly uses data from all ancestral populations for model training and testing.…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…Therefore, how to utilize data from different ancestry groups is crucial in a multi-ancestral machine learning strategy. We have categorized multi-ancestral (or multi-ethnic) machine learning schemes based on the way they utilize data from different subpopulations [ 7 , 8 ] (Table 2 ). Mixture learning indistinctly uses data from all ancestral populations for model training and testing.…”
Section: Methodsmentioning
confidence: 99%
“…Recent advances in high-throughput genotyping and genome sequencing technologies have enabled genome-wide association studies (GWAS) in large cohorts, providing the foundation for genomic disease prediction. However, more than 80% of the existing GWAS data were acquired from individuals of European descent [ 1 7 ], and the ancestral (or ethnic) diversity in GWAS has not improved in recent years [ 1 , 5 ]. The lack of adequate genomic data for non-European populations, who make up approximately 84% of the world’s population, results in low-quality artificial intelligence (AI) models for these data-disadvantaged populations (DDPs).…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…An important element of our approach is the cyber security of the proposed solution 100,101 . The rapid development of artificial intelligence, including machine learning, is suggesting to us more and more new methods and techniques falling within the framework of computational intelligence that can accelerate and increase the accuracy of analyze, inference and prediction from data [102][103][104] .…”
Section: Study Limitationsmentioning
confidence: 99%