ObjectivesIt is important to evaluate the quality of linkages between cohort and administrative data to discern the likely reliability of research using the linked data resource. We examined the quality and sample representativeness of linked 1958 National Child Development Study (NCDS) and Hospital Episode Statistics (HES) data.
ApproachWe examined the quality of the linkage in terms of the associations between key cohort member sociodemographic characteristics and successful linkage, and compared the levels of successful linkage within strata of NCDS variables which may be expected to be associated with hospital attendance (self-reported hospital attendance, self-rated general health, self-reported long-term illness), and hence with successful HES linkage. We additionally evaluated the population representativeness of the linked sample using external population statistics (hospital admission rates in the general population).
ResultsSome sociodemographic characteristics were found to be associated with successful HES linkage, with females slightly more likely than males (risk ratio 1.03; 95% confidence interval 1.00, 1.06) and those who had ≤ 1 person per room at birth more likely than those who had > 1.5 (1.12; 1.05, 1.19) to have linked data. There was a high level of correspondence between NCDS variables and HES linkage – e.g., 86.0% of cohort members who reported no day patient or in-patient attendance had no linked HES Admitted Patient Care (APC) data and 76.3% of those who reported having day patient or in-patient attendance did have linked HES APC data. The level and pattern of hospital admission rates over time was similar in the linked NCDS-HES data to the population statistics.
ConclusionThe quality and representativeness of the linked NCDS-HES data appears to be high. We hope that these analyses will both improve the quality and transparency of research using this linked data resource and encourage providers and users of other linked data resources to undertake and publish similarly thorough evaluations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.