Resample aggregating improves the generalizability of connectome predictive modeling

O’Connor, David; Lake, Evelyn; Scheinost, Dustin; Constable, R. Todd

doi:10.1016/j.neuroimage.2021.118044

Cited by 11 publications

(10 citation statements)

References 73 publications

(96 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Cross-ethnicity/race biases were not investigated for cross-dataset prediction considering the length of this article. However, it should be acknowledged that the generalizability of behavioral prediction models across datasets is a crucial research topic and is still under intensive investigation ( 20 , 45 ). How predictive models trained in one dataset could generalize to multiple ethnic/racial groups in another dataset should be examined in the future.…”

Section: Discussionmentioning

confidence: 99%

Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity

Bzdok

Chen

et al. 2022

Sci. Adv.

View full text Add to dashboard Cite

Algorithmic biases that favor majority populations pose a key challenge to the application of machine learning for precision medicine. Here, we assessed such bias in prediction models of behavioral phenotypes from brain functional magnetic resonance imaging. We examined the prediction bias using two independent datasets (preadolescent versus adult) of mixed ethnic/racial composition. When predictive models were trained on data dominated by white Americans (WA), out-of-sample prediction errors were generally higher for African Americans (AA) than for WA. This bias toward WA corresponds to more WA-like brain-behavior association patterns learned by the models. When models were trained on AA only, compared to training only on WA or an equal number of AA and WA participants, AA prediction accuracy improved but stayed below that for WA. Overall, the results point to the need for caution and further research regarding the application of current brain-behavior prediction models in minority populations.

show abstract

Section: Discussionmentioning

confidence: 99%

Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity

Bzdok

Chen

et al. 2022

Sci. Adv.

View full text Add to dashboard Cite

show abstract

“…Because the HCP 7T dataset is composed of data from individuals of varying degrees of genetic relatedness (monozygotic and dizygotic twins, non-twin siblings, and un-related individuals; 93 unique families), all individuals from the same family were randomly assigned to one of two groups of 88 (i.e., split-half cross-validation), with one group being used to train a model that would then be tested on the other (and vice versa). The following approach was then applied to 100 of these random splits of the data to assess the performance of rCPM across different training/testing sets and to build a bagged model that is more robust to overfitting ( O’Connor et al, 2021 ).…”

Section: Methodsmentioning

confidence: 99%

“…It could then be the case that the CV models rely on stimulus-specific signals and may fail to predict gISC during viewing of a different movie. A bootstrap aggregating, or “bagging,” approach was used to test whether the 200 linear models trained on Day 1 movie watching and resting state data could predict Day 2 gISC (derived from a different set of stimuli) from Day 2 RSFC, as previous work has shown bagged CPM models to be more accurate and more generalizable than their non-bagged counterparts ( O’Connor et al., 2021 ). To construct the bagged model, RSFC edges that passed the P < .01 feature selection step in at least 10% (20/200, reflecting the 100 iterations of split-half cross-validation) of iterations were identified, yielding 1437 edges total.…”

Section: Methodsmentioning

confidence: 99%

Brain connectivity at rest predicts individual differences in normative activity during movie watching

Gruskin

Patel

2022

NeuroImage

View full text Add to dashboard Cite

“…On the held-out set, unique subject-wise predictions were obtained by averaging across folds and occasional duplicate predictions due to Monte Carlo sampling, which could produce multiple predictions per participant (we ensured prior to computation that with 100 CV-splits, predictions were available for all participants). Such a strategy is known as CV-bagging [ 105 , 106 ] and can improve both performance and stability of results (the use of CV-bagging can explain why in Figs 3 and 4 and Fig. 3 - Figure supplement 1 the performance was sometimes slightly better on the held-out set compared to the cross-validation on the validation test).…”

Section: Methodsmentioning

confidence: 99%

Population modeling with machine learning can enhance measures of mental health

et al. 2021

View full text Add to dashboard Cite

Background Biological aging is revealed by physical measures, e.g., DNA probes or brain scans. In contrast, individual differences in mental function are explained by psychological constructs, e.g., intelligence or neuroticism. These constructs are typically assessed by tailored neuropsychological tests that build on expert judgement and require careful interpretation. Could machine learning on large samples from the general population be used to build proxy measures of these constructs that do not require human intervention? Results Here, we built proxy measures by applying machine learning on multimodal MR images and rich sociodemographic information from the largest biomedical cohort to date: the UK Biobank. Objective model comparisons revealed that all proxies captured the target constructs and were as useful, and sometimes more useful, than the original measures for characterizing real-world health behavior (sleep, exercise, tobacco, alcohol consumption). We observed this complementarity of proxy measures and original measures at capturing multiple health-related constructs when modeling from, both, brain signals and sociodemographic data. Conclusion Population modeling with machine learning can derive measures of mental health from heterogeneous inputs including brain signals and questionnaire data. This may complement or even substitute for psychometric assessments in clinical populations.

show abstract

Resample aggregating improves the generalizability of connectome predictive modeling

Cited by 11 publications

References 73 publications

Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity

Cross-ethnicity/race generalization failure of behavioral prediction from resting-state functional connectivity

Brain connectivity at rest predicts individual differences in normative activity during movie watching

Population modeling with machine learning can enhance measures of mental health

Contact Info

Product

Resources

About