There are several indices for measuring the similarity of two populations, including the ratio of the number of shared species to the number of distinct species (Jaccard's index) and the conditional probability of observing a shared species (Smith et al., (1)). However, these indices only take into account the number of species and species proportions of shared species.In this paper, we propose a new similarity index which includes the species proportions of both the shared and non-shared species in each population, and also propose a Nonparametric Maximum Likelihood Estimator (NPMLE) for this index. Bootstrap and delta methods are used to evaluate the standard errors of the NPMLE. Based on a loss function, we also compare a class of nonparametric estimators for the proposed index in various situations.
BackgroundGeographically weighted regression (GWR) is a modelling technique designed to deal with spatial non-stationarity, e.g., the mean values vary by locations. It has been widely used as a visualization tool to explore the patterns of spatial data. However, the GWR tends to produce unsmooth surfaces when the mean parameters have considerable variations, partly due to that all parameter estimates are derived from a fixed- range (bandwidth) of observations. In order to deal with the varying bandwidth problem, this paper proposes an alternative approach, namely Conditional geographically weighted regression (CGWR).MethodsThe estimation of CGWR is based on an iterative procedure, analogy to the numerical optimization problem. Computer simulation, under realistic settings, is used to compare the performance between the traditional GWR, CGWR, and a local linear modification of GWR. Furthermore, this study also applies the CGWR to two empirical datasets for evaluating the model performance. The first dataset consists of disability status of Taiwan’s elderly, along with some social-economic variables and the other is Ohio’s crime dataset.ResultsUnder the positively correlated scenario, we found that the CGWR produces a better fit for the response surface. Both the computer simulation and empirical analysis support the proposed approach since it significantly reduces the bias and variance of data fitting. In addition, the response surface from the CGWR reviews local spatial characteristics according to the corresponded variables.ConclusionsAs an explanatory tool for spatial data, producing accurate surface is essential in order to provide a first look at the data. Any distorted outcomes would likely mislead the following analysis. Since the CGWR can generate more accurate surface, it is more appropriate to use it exploring data that contain suspicious variables with varying characteristics.Electronic supplementary materialThe online version of this article (doi:10.1186/s12942-017-0085-9) contains supplementary material, which is available to authorized users.
SUMMARYFor two communities, species overlap has been defined by Smith et al. (1996) as the probability that a randomly selected species is present in both communities, given that it is present in at least one community. Species overlap can thus be used to describe the similarity of two communities. In contrast to the parametric estimator of Smith et al., we propose a Nonparametric Maximum Likelihood Estimator (NPMLE). We prove that the NPMLE is consistent and asymptotically normally distributed, and show that computation of the NPMLE and its standard error is straightforward. We also compare the NPMLE and the estimator of Smith et al. for a variety of situations.
Statewide discharge abstract data can be used by a hospital to quantify how its surgical practice differs from that of other hospitals (e.g., to show that it provides a more diverse, comprehensive, and physiologically complex selection of procedures in younger patients).
We studied anesthesia times for diagnostic and interventional radiology using anesthesia billing data and paper radiology logbooks. For computerized tomography and magnetic resonance imaging procedures, we tried to predict future anesthesia times by using historical anesthesia times classified by Current Procedural Terminology (CPT) codes. By this method, anesthesia times were estimated even less accurately than operating room cases. Computerized tomography and magnetic resonance imaging had many different CPT codes, most rare, and CPT codes reflected organs imaged, not scanning times. However, when, anesthesia times were estimated by expert judgment, face validity and accuracy were good. Lower and upper prediction bounds were also estimated from the expert estimates. For interventional radiology, predicting anesthesia times was challenging because few CPT codes accounted for most cases. Because interventional radiologists scheduled their elective cases into allocated time, the necessary goal was not to estimate the time to complete each case but rather the time to complete each day's entire series of elective cases including turnover times. We determined the time of day (e.g., 4 pm) up to when interventional radiology could schedule so that on 80% of days the anesthesia team finishes no later than a specified time (e.g., 6 pm). Both diagnostic and interventional radiology results were similarly less accurate when Version 9 of the International Classifications of Diseases' procedure codes was used instead of CPT.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.