“…We evaluated seven site handling strategies after partitioning the discovery sample into different age bins as follows: (i) a single bin with the full sample age range (5–90 years); (ii) nine bins each covering sequential 10‐year intervals, that is, age ≤ 10 years, 10 < age ≤ 20 years, 20 < age ≤ 30 years, 30 < age ≤ 40 years, 40 < age ≤ 50 years, 50 < age ≤ 60 years, 60 < age ≤ 70 years, 70 < age ≤ 80 years, and 80 < age ≤ 90 years; (iii) four bins each covering sequential 20‐year intervals, that is, age ≤ 20 years, 20 < age ≤ 40 years, 40 < age ≤ 60 years, and 60 < age ≤ 80 years; (iv) three bins each covering sequential 30‐year intervals, that is, age ≤ 30 years, 30 < age ≤ 60 years, and 60 < age ≤ 90 years; (v) two age bins each covering sequential 40‐year intervals, that is, age ≤ 40 years and 40 < age ≤ 90 years. Seven site handling strategies were separately applied to each bin to perform data residualization with respect to site using: (i) Combat‐GAM (Pomponio et al, 2020); (ii) CovBat without age variability preservation (Chen et al, 2021); (iii) CovBat with age variability preservation (Chen et al, 2021); (iv) Subsampling Maximum‐mean‐distance Algorithm (SMA) (Wang et al, 2023; Zhou et al, 2018); (v) Invariant Conditional Variational Auto‐Encoder (ICVAE) (Moyer et al, 2020; Wang et al, 2023); (vi) generalized linear model (de Lange et al, 2022); and (vii) no site harmonization. In the case of Combat‐GAM, age was specified as the smooth term in the model while the empirical Bayes estimates were used for site effects, without custom boundaries for the smoothing terms.…”