Jiming Liu scite author profile

BackgroundIn systematic reviews and meta-analysis, researchers often pool the results of the sample mean and standard deviation from a set of similar clinical trials. A number of the trials, however, reported the study using the median, the minimum and maximum values, and/or the first and third quartiles. Hence, in order to combine results, one may have to estimate the sample mean and standard deviation for such trials.MethodsIn this paper, we propose to improve the existing literature in several directions. First, we show that the sample standard deviation estimation in Hozo et al.’s method (BMC Med Res Methodol 5:13, 2005) has some serious limitations and is always less satisfactory in practice. Inspired by this, we propose a new estimation method by incorporating the sample size. Second, we systematically study the sample mean and standard deviation estimation problem under several other interesting settings where the interquartile range is also available for the trials.ResultsWe demonstrate the performance of the proposed methods through simulation studies for the three frequently encountered scenarios, respectively. For the first two scenarios, our method greatly improves existing methods and provides a nearly unbiased estimate of the true sample standard deviation for normal data and a slightly biased estimate for skewed data. For the third scenario, our method still performs very well for both normal data and skewed data. Furthermore, we compare the estimators of the sample mean and standard deviation under all three scenarios and present some suggestions on which scenario is preferred in real-world applications.ConclusionsIn this paper, we discuss different approximation methods in the estimation of the sample mean and standard deviation and propose some new estimation methods to improve the existing literature. We conclude our work with a summary table (an Excel spread sheet including all formulas) that serves as a comprehensive guidance for performing meta-analysis in different situations.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2288-14-135) contains supplementary material, which is available to authorized users.

show abstract

Optimally estimating the sample mean from the sample size, median, mid-range, and/or mid-quartile range

Luo

Wan

Liu

et al. 2016

Stat Methods Med Res

1,788

1,126

View full text Add to dashboard Cite

The era of big data is coming, and evidence-based medicine is attracting increasing attention to improve decision making in medical practice via integrating evidence from well designed and conducted clinical research. Meta-analysis is a statistical technique widely used in evidence-based medicine for analytically combining the findings from independent clinical trials to provide an overall estimation of a treatment effectiveness. The sample mean and standard deviation are two commonly used statistics in meta-analysis but some trials use the median, the minimum and maximum values, or sometimes the first and third quartiles to report the results. Thus, to pool results in a consistent format, researchers need to transform those information back to the sample mean and standard deviation. In this article, we investigate the optimal estimation of the sample mean for meta-analysis from both theoretical and empirical perspectives. A major drawback in the literature is that the sample size, needless to say its importance, is either ignored or used in a stepwise but somewhat arbitrary manner, e.g. the famous method proposed by Hozo et al. We solve this issue by incorporating the sample size in a smoothly changing weight in the estimators to reach the optimal estimation. Our proposed estimators not only improve the existing ones significantly but also share the same virtue of the simplicity. The real data application indicates that our proposed estimators are capable to serve as ''rules of thumb'' and will be widely applied in evidence-based medicine.

show abstract

Social Collaborative Filtering by Trust

Yang

Liu

et al. 2017

IEEE Trans. Pattern Anal. Mach. Intell.

450

286

View full text Add to dashboard Cite

Recommender systems are used to accurately and actively provide users with potentially interesting information or services. Collaborative filtering is a widely adopted approach to recommendation, but sparse data and cold-start users are often barriers to providing high quality recommendations. To address such issues, we propose a novel method that works to improve the performance of collaborative filtering recommendations by integrating sparse rating data given by users and sparse social trust network among these same users. This is a model-based method that adopts matrix factorization technique that maps users into low-dimensional latent feature spaces in terms of their trust relationship, and aims to more accurately reflect the users reciprocal influence on the formation of their own opinions and to learn better preferential patterns of users for high-quality recommendations. We use four large-scale datasets to show that the proposed method performs much better, especially for cold start users, than state-of-the-art recommendation algorithms for social collaborative filtering based on trust.

show abstract

Community Mining from Signed Social Networks

Yang

Cheung

Liu

2007

IEEE Trans. Knowl. Data Eng.

343

203

View full text Add to dashboard Cite

Clonal Integration Affects Growth, Photosynthetic Efficiency and Biomass Allocation, but not the Competitive Ability, of the Alien Invasive Alternanthera philoxeroides under Severe Stress

et al. 2008

View full text Add to dashboard Cite

show abstract

What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization

Yang

Shang³

et al. 2020

EClinicalMedicine

135

102

View full text Add to dashboard Cite

Background COVID-19 has spread to 6 continents. Now is opportune to gain a deeper understanding of what may have happened. The findings can help inform mitigation strategies in the disease-affected countries.Methods In this work, we examine an essential factor that characterizes the disease transmission patterns: the interactions among people. We develop a computational model to reveal the interactions in terms of the social contact patterns among the population of different age-groups. We divide a city's

show abstract

NFATc1 autoregulation: a crucial step for cell-fate determination

Serfling

Chuvpilo

Liu

et al. 2006

Trends in Immunology

View full text Add to dashboard Cite

Particle Competition and Cooperation in Networks for Semi-Supervised Learning

Breve

Zhao

Quiles

et al. 2012

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

Semi-supervised learning is one of the important topics in machine learning, concerning with pattern classification where only a small subset of data is labeled. In this paper, a new network-based (or graph-based) semi-supervised classification model is proposed. It employs a combined random-greedy walk of particles, with competition and cooperation mechanisms, to propagate class labels to the whole network. Due to the competition mechanism, the proposed model has a local label spreading fashion, i.e., each particle only visits a portion of nodes potentially belonging to it, while it is not allowed to visit those nodes definitely occupied by particles of other classes. In this way, a "divide-and-conquer" effect is naturally embedded in the model. As a result, the proposed model can achieve a good classification rate while exhibiting low computational complexity order in comparison to other network-based semi-supervised algorithms. Computer simulations carried out for synthetic and real-world data sets provide a numeric quantification of the performance of the method.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jiming Liu

Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range

Optimally estimating the sample mean from the sample size, median, mid-range, and/or mid-quartile range

Social Collaborative Filtering by Trust

Community Mining from Signed Social Networks

Clonal Integration Affects Growth, Photosynthetic Efficiency and Biomass Allocation, but not the Competitive Ability, of the Alien Invasive Alternanthera philoxeroides under Severe Stress

What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization

NFATc1 autoregulation: a crucial step for cell-fate determination

Particle Competition and Cooperation in Networks for Semi-Supervised Learning

Contact Info

Product

Resources

About