2022
DOI: 10.1101/2022.09.22.509055
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deep transfer learning provides aParetoimprovement for multi-ancestral clinico-genomic prediction of diseases

Abstract: Accurate predisposition assessment is essential for the prevention and early detection of diseases. Polygenic scores and machine learning models have been developed for disease prediction based on genetic variants and other risk factors. However, over 80% of genomic data were acquired from individuals of European descent. Other ethnic groups comprise the vast majority of the world population and have a severe data disadvantage. Due to the lack of suitable training data, clinico-genomic risk prediction is less … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 68 publications
(89 reference statements)
0
4
0
Order By: Relevance
“…The synthetic datasets are available from https:// figsh are. com/ artic les/ media/ TLGP_ GM/ 25377 532 [107]. The source code is available from https:// github.…”
Section: Supplementary Informationmentioning
confidence: 99%
“…The synthetic datasets are available from https:// figsh are. com/ artic les/ media/ TLGP_ GM/ 25377 532 [107]. The source code is available from https:// github.…”
Section: Supplementary Informationmentioning
confidence: 99%
“…The current prevalent machine learning scheme for multiethnic data, the mixture learning scheme, and its main alternative, the independent learning scheme, have major obstacles in training optimal machine learning models for data-disadvantaged subpopulations (19,(93)(94)(95). The two Multiethnic machine learning schemes.…”
Section: Multiethnic Machine Learningmentioning
confidence: 99%
“…In transfer learning, a machine learning model trained on a data-rich subpopulation (source domain) can aid in training a model for a data-disadvantaged subpopulation (target domain) without affecting its own prediction accuracy. Thus, transfer learning provides a Pareto improvement (112) for multiethnic machine learning (95). Pareto improvement is a generally desired scenario in which some parties are better off without negatively impacting other parties in the system.…”
Section: Loss Function: the Difference Between Estimated And True Out...mentioning
confidence: 99%
See 1 more Smart Citation