“…These trials feature large samples of students within schools randomly assigned to condition (12,486 and 26,406, respectively), large numbers of school sites intentionally sampled to permit cross-site comparisons (65 and 21, with the latter further divided into 365 Race × First-Generation Status × College × Cohort groups), and preregistered hypotheses and analyses. Such precautions are necessary because heterogeneity findings can be unreliable (Bloom & Michalopoulos, 2013), especially with small samples (Sherman & Pashler, 2019). Moreover, each intervention was homogenously persuasive across sites, as assessed by manipulation checks.…”