2022
DOI: 10.1101/2022.09.22.508637
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comprehensive evaluation of harmonization on functional brain imaging for multisite data-fusion

Abstract: To embrace big-data neuroimaging, harmonization of site effect in resting-state functional magnetic resonance imaging (R-fMRI) data fusion is a fundamental challenge. Comprehensive evaluation of potentially effective harmonization strategies, particularly with specifically collected data has been rare, especially for R-fMRI metrics. Here, we comprehensively assess harmonization strategies from multiple perspectives, including efficiency, individual identification, test-retest reliability and replicability of g… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 67 publications
(92 reference statements)
0
1
0
Order By: Relevance
“…We evaluated seven site handling strategies after partitioning the discovery sample into different age bins as follows: (i) a single bin with the full sample age range (5–90 years); (ii) nine bins each covering sequential 10‐year intervals, that is, age ≤ 10 years, 10 < age ≤ 20 years, 20 < age ≤ 30 years, 30 < age ≤ 40 years, 40 < age ≤ 50 years, 50 < age ≤ 60 years, 60 < age ≤ 70 years, 70 < age ≤ 80 years, and 80 < age ≤ 90 years; (iii) four bins each covering sequential 20‐year intervals, that is, age ≤ 20 years, 20 < age ≤ 40 years, 40 < age ≤ 60 years, and 60 < age ≤ 80 years; (iv) three bins each covering sequential 30‐year intervals, that is, age ≤ 30 years, 30 < age ≤ 60 years, and 60 < age ≤ 90 years; (v) two age bins each covering sequential 40‐year intervals, that is, age ≤ 40 years and 40 < age ≤ 90 years. Seven site handling strategies were separately applied to each bin to perform data residualization with respect to site using: (i) Combat‐GAM (Pomponio et al, 2020); (ii) CovBat without age variability preservation (Chen et al, 2021); (iii) CovBat with age variability preservation (Chen et al, 2021); (iv) Subsampling Maximum‐mean‐distance Algorithm (SMA) (Wang et al, 2023; Zhou et al, 2018); (v) Invariant Conditional Variational Auto‐Encoder (ICVAE) (Moyer et al, 2020; Wang et al, 2023); (vi) generalized linear model (de Lange et al, 2022); and (vii) no site harmonization. In the case of Combat‐GAM, age was specified as the smooth term in the model while the empirical Bayes estimates were used for site effects, without custom boundaries for the smoothing terms.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We evaluated seven site handling strategies after partitioning the discovery sample into different age bins as follows: (i) a single bin with the full sample age range (5–90 years); (ii) nine bins each covering sequential 10‐year intervals, that is, age ≤ 10 years, 10 < age ≤ 20 years, 20 < age ≤ 30 years, 30 < age ≤ 40 years, 40 < age ≤ 50 years, 50 < age ≤ 60 years, 60 < age ≤ 70 years, 70 < age ≤ 80 years, and 80 < age ≤ 90 years; (iii) four bins each covering sequential 20‐year intervals, that is, age ≤ 20 years, 20 < age ≤ 40 years, 40 < age ≤ 60 years, and 60 < age ≤ 80 years; (iv) three bins each covering sequential 30‐year intervals, that is, age ≤ 30 years, 30 < age ≤ 60 years, and 60 < age ≤ 90 years; (v) two age bins each covering sequential 40‐year intervals, that is, age ≤ 40 years and 40 < age ≤ 90 years. Seven site handling strategies were separately applied to each bin to perform data residualization with respect to site using: (i) Combat‐GAM (Pomponio et al, 2020); (ii) CovBat without age variability preservation (Chen et al, 2021); (iii) CovBat with age variability preservation (Chen et al, 2021); (iv) Subsampling Maximum‐mean‐distance Algorithm (SMA) (Wang et al, 2023; Zhou et al, 2018); (v) Invariant Conditional Variational Auto‐Encoder (ICVAE) (Moyer et al, 2020; Wang et al, 2023); (vi) generalized linear model (de Lange et al, 2022); and (vii) no site harmonization. In the case of Combat‐GAM, age was specified as the smooth term in the model while the empirical Bayes estimates were used for site effects, without custom boundaries for the smoothing terms.…”
Section: Methodsmentioning
confidence: 99%
“…The CovBat approach was implemented using R scripts (version 3.6.0). The SMA method was implemented using Matlab (version R2021a) with the largest sample as the target site, in accordance with recommendations of Wang et al (2023) The ICVAE was implemented using Python (version 3.8.10). To prevent data leakage, the harmonization process was applied separately to the training and test datasets during cross‐validation.…”
Section: Methodsmentioning
confidence: 99%
“…The interaction of depression and MCI was included in all GLM analyses in order to study the possible different effects of depressive symptoms on network functionality in MCI and cognitively healthy subjects. Furthermore, the analyses included MRI scanner type [60], sex, age, education, and total connectivity strength as covariates. The same model was applied to analyze differences in network metrics and structural measures (e.g., cortical thickness and volumes).…”
Section: Statistical Analyses Including Sensitivity Analysesmentioning
confidence: 99%
“…This challenge is not unique to dMRI data and affects other MRI quantities such as cortical thickness assessed from T1-weighted and T2-weighted imaging, 236 as well as statistics derived from fMRI measurements. 237 , 238 Significant research effort has been invested in developing methods for signal harmonization across different scanners 239 241 but the state of the art in these methods still requires measurements from many different subjects in each scanner. 242 , 243 This makes the clinical application of dMRI tractometry challenging because comparing an individual’s brain against a normative sample would have to happen either on the same scanner as the one on which the normative sample was acquired or on a scanner that has been harmonized with the scanner used for normative measurements (there are some important exceptions to this rule; for example, where a disease affects lateralization of the measurement 244 ).…”
Section: Future Perspectives and Open Questionsmentioning
confidence: 99%