Meta-analysis methods involve combining and analysing quantitative evidence from related studies to produce results based on a whole body of research. As such, metaanalyses are an integral part of evidence based medicine. Traditional methods for meta-analysis synthesise aggregate study level data obtained from study publications or study authors, such as a treatment effect estimate (for example, an odds ratio) and its associated uncertainty (for example, a standard error or confidence interval). An alternative but increasingly popular approach is meta-analysis of individual participant data, or individual patient data, in which the raw individual level data for each study are obtained and used for synthesis.1 In this article we describe the rationale for individual participant data meta-analysis and illustrate through applied examples why this strategy offers numerous advantages, both clinically and statistically, over the aggregate data approach.1 2 We outline when and how to initiate an individual participant data meta-analysis, the statistical issues in conducting one, how the findings should be reported, and what challenges this approach may bring. What are individual participant data?The term "individual participant data" relates to the data recorded for each participant in a study. In a hypertension trial, for example, the individual participant data could be the pre-treatment and post-treatment blood pressure, a treatment group indicator, and important baseline clinical characteristics such as age and sex, for each patient in each study (table). A set of individual participant data from multiple studies often comprises thousands of patients; this is the case in the table, so for brevity we do not show all rows of data here. This concept is in contrast to the term "aggregate data," which relates to information averaged or estimated across all individuals in a study, such as the mean treatment effect on blood pressure, the mean age, or the proportion of participants who are male. Such aggregate data are derived from the individual participant data themselves, so individual participant data can be considered the original source material. What is an individual participant data meta-analysis?As with any meta-analysis, an individual participant data meta-analysis aims to summarise the evidence on a particular clinical question from multiple related studies, such as whether a treatment is effective. The statistical implementation of an individual participant data meta-analysis crucially must preserve the clustering of patients within studies; it is inappropriate to simply analyse individual participant data as if they all came from a single study. Clusters can be retained during analysis by using a two step or a one step approach. 3 In the two step approach, the individual participant data are first analysed in each separate study independently by using a statistical method appropriate for the type of data being analysed; for example, a linear regression model might be fitted for continuous responses such as blo...
Many routinely used summary methods provide widely ranging estimates when applied to sparse data with high imbalance between the size of the studies' arms. A sensitivity analysis using several methods and continuity correction factors is advocated for routine practice.
Royston and Parmar (2002, Statistics in Medicine 21: 2175-2197) developed a class of flexible parametric survival models that were programmed in Stata with the stpm command (Royston, 2001, Stata Journal 1: 1-28). In this article, we introduce a new command, stpm2, that extends the methodology. New features for stpm2 include improvement in the way time-dependent covariates are modeled, with these effects far less likely to be over parameterized; the ability to incorporate expected mortality and thus fit relative survival models; and a superior predict command that enables simple quantification of differences between any two covariate patterns through calculation of time-dependent hazard ratios, hazard differences, and survival differences. The ideas are illustrated through a study of breast cancer survival and incidence of hip fracture in prostate cancer patients.
Background Population-based cancer survival estimates provide valuable insights into the effectiveness of cancer services and can reflect the prospects of cure. As part of the second phase of the International Cancer Benchmarking Partnership (ICBP), the Cancer Survival in High-Income Countries (SURVMARK-2) project aims to provide a comprehensive overview of cancer survival across seven high-income countries and a comparative assessment of corresponding incidence and mortality trends. Methods In this longitudinal, population-based study, we collected patient-level data on 3•9 million patients with cancer from population-based cancer registries in 21 jurisdictions in seven countries (
There has been a recent growth in the use of Bayesian methods in medical research. The main reasons for this are the development of computer intensive simulation based methods such as Markov chain Monte Carlo (MCMC), increases in computing power and the introduction of powerful software such as WinBUGS. This has enabled increasingly complex models to be fitted. The ability to fit these complex models has led to MCMC methods being used as a convenient tool by frequentists, who may have no desire to be fully Bayesian. Often researchers want 'the data to dominate' when there is no prior information and thus attempt to use vague prior distributions. However, with small amounts of data the use of vague priors can be problematic. The results are potentially sensitive to the choice of prior distribution. In general there are fewer problems with location parameters. The main problem is with scale parameters. With scale parameters, not only does one have to decide the distributional form of the prior distribution, but also whether to put the prior distribution on the variance, standard deviation or precision. We have conducted a simulation study comparing the effects of 13 different prior distributions for the scale parameter on simulated random effects meta-analysis data. We varied the number of studies (5, 10 and 30) and compared three different between-study variances to give nine different simulation scenarios. One thousand data sets were generated for each scenario and each data set was analysed using the 13 different prior distributions. The frequentist properties of bias and coverage were investigated for the between-study variance and the effect size. The choice of prior distribution was crucial when there were just five studies. There was a large variation in the estimates of the between-study variance for the 13 different prior distributions. With a large number of studies the choice of prior distribution was less important. The effect size estimated was not biased, but the precision with which it was estimated varied with the choice of prior distribution leading to varying coverage intervals and, potentially, to different statistical inferences. Again there was less of a problem with a larger number of studies. There is a particular problem if the between-study variance is close to the boundary at zero, as MCMC results tend to produce upwardly biased estimates of the between-study variance, particularly if inferences are based on the posterior mean. The choice of 'vague' prior distribution can lead to a marked variation in results, particularly in small studies. Sensitivity to the choice of prior distribution should always be assessed.
Purpose A dramatic improvement in the survival of patients with chronic myeloid leukemia (CML) occurred after the introduction of imatinib mesylate, the first tyrosine kinase inhibitor (TKI). We assessed how these changes affected the life expectancy of patients with CML and life-years lost as a result of CML between 1973 and 2013 in Sweden. Materials and Methods Patients recorded as having CML in the Swedish Cancer Registry from 1973 to 2013 were included in the study and followed until death, censorship, or end of follow-up. The life expectancy and loss in expectation of life were predicted from a flexible parametric relative survival model. Results A total of 2,662 patients with CML were diagnosed between 1973 and 2013. Vast improvements in the life expectancy of these patients were seen over the study period; larger improvements were seen in the youngest ages. The great improvements in life expectancy translated into great reductions in the loss in expectation of life. Patients of all ages diagnosed in 2013 will, on average, lose < 3 life-years as a result of CML. Conclusion Imatinib mesylate and new TKIs along with allogeneic stem cell transplantation and other factors have contributed to the life expectancy in patients with CML approaching that of the general population today. This will be an important message to convey to patients to understand the impact of a CML diagnosis on their life. In addition, the increasing prevalence of patients with CML will have a great effect on future health care costs as long as continuous TKI treatment is required.
SUMMARYMeta-analysis of individual patient data (IPD) is the gold-standard for synthesizing evidence across clinical studies. However, for some studies IPD may not be available and only aggregate data (AD), such as a treatment effect estimate and its standard error, may be obtained. In this situation, methods for combining IPD and AD are important to utilize all the available evidence. In this paper, we develop and assess a range of statistical methods for combining IPD and AD in meta-analysis of continuous outcomes from randomized controlled trials.The methods take either a one-step or a two-step approach. The latter is simple, with IPD reduced to AD so that standard AD meta-analysis techniques can be employed. The one-step approach is more complex but offers a flexible framework to include both patient-level and trial-level parameters. It uses a dummy variable to distinguish IPD trials from AD trials and to constrain which parameters the AD trials estimate. We show that this is important when assessing how patient-level covariates modify treatment effect, as aggregate-level relationships across trials are subject to ecological bias and confounding. We thus develop models to separate within-trial and across-trials treatment-covariate interactions; this ensures that only IPD trials estimate the former, whilst both IPD and AD trials estimate the latter in addition to the pooled treatment effect and any between-study heterogeneity. Extension to multiple correlated outcomes is also considered. Ten IPD trials in hypertension, with blood pressure the continuous outcome of interest,
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.