There is a burgeoning body of evidence on the average impacts of educational programs. Yet, for many local decision makers, because impacts can vary across sites, the question of whether a certain program will work in their particular district or school remains. This article addresses the question of the generalizability of large-scale average causal effects to specific locales, that is, from “large to small.” Our method evaluates whether impacts measured on the large-scale generalize to individual sites and whether adjusting for variation among sites in their characteristics accounts for impact heterogeneity that limits generalization. Our application to a multisite experimental evaluation in education shows that the average impact does not readily generalize to sites, with discrepancies between 0.17 and 0.38 standard deviations, and that characteristics of sites account only minimally for this variation. This emphasizes need for caution by local decision makers in accepting average impact findings as generalizable to their specific locales.