This paper proposes new methods of estimating missing values in time series data while comparing them with existing methods. The new methods are based on the row, column and overall averages of time series data arranged in a Buys-Ballot table with m rows and s columns. The methods assume that 1) only one value is missing at a time, 2) the trending curve may be linear, quadratic or exponential and 3) the decomposition method is either Additive or Multiplicative. The performances of the methods are assessed by comparing accuracy measures (MAE, MAPE and RMSE) computed from the deviations of estimates of the missing values from the actual values used in simulation. Results show that, under the stated assumptions, estimates from the new method based on full decomposition of a series is the best (in terms of the accuracy measures) when compared with other two new and the existing methods.
This paper is an extension and generalization of the study carried out by [1] on the estimation of the population ratio (R) of the population means of two variables (y and x) under Simple Random Sampling (SRS) scheme, using a variable transformation of the auxiliary variable, x. All the six estimators proposed by [1] are easily identified as special cases of the proposed class of estimators. Asymptotic properties of the proposed class of estimators are derived theoretically and subsequently verified using empirical illustrations. Some of the proposed estimators are found to have relatively large gains in efficiency over the customary ratio estimator, ˆ/ R y x = for the given data set.
This paper presents some tests for seasonality in a time series data which considers the model structure and the nature of trending curve. The tests were applied to the row variances of the Buys Ballot
This paper discusses comparison of two time series decomposition methods: The Least Squares Estimation (LSE) and Buys-Ballot Estimation (BBE) methods. As noted by Iwueze and Nwogu (2014), there exists a research gap for the choice of appropriate model for decomposition and detection of presence of seasonal effect in a series model. Estimates of trend parameters and seasonal indices are all that are needed to fill the research gap. However, these estimates are obtainable through the Least Squares Estimation (LSE) and Buys-Ballot Estimation (BBE) methods. Hence, there is need to compare estimates of the two methods and recommend. The comparison of the two methods is done using the Accuracy Measures (Mean Error (ME)), Mean Square Error (MSE), the Mean Absolute Error (MAE), and the Mean Absolute Percentage Error (MAPE). The results from simulated series show that for the additive model; the summary statistics (ME, MSE and MAE) for the two estimation methods and for all the selected trending curves are equal in all the simulations both in magnitude and direction. For the multiplicative model, results show that when a series is dominated by trend, the estimates of the parameters by both methods become less precise and differ more widely from each other. However, if conditions for successful transformation (using the logarithmic transform in linearizing the multiplicative model to additive model) are met, both of them give similar results.
Motivation Probabilistic Identification of bacterial essential genes using TraDIS data based on Tn5 libraries has received relatively little attention in the literature; most methods are designed for mariner transposon insertions. Analysis of Tn5 transposon-based genomic data is challenging due to the high insertion density and genomic resolution. We present a novel probabilistic Bayesian approach for classifying bacterial essential genes using transposon insertion density derived from transposon insertion sequencing data. We implement a Markov chain Monte Carlo sampling procedure to estimate the posterior probability that any given gene is essential. We implement a Bayesian decision theory approach to selecting essential genes. We assess the effectiveness of our approach via analysis of both simulated data and three previously published Escherichia coli, Salmonella Typhimurium and Staphylococcus aureus datasets. These three bacteria have relatively well characterised essential genes which allows us to test our classification procedure using receiver operating characteristic curves and area under the curves. We compare the classification performance with that of Bio-Tradis, a standard tool for bacterial gene classification. Results Our method is able to classify genes in the three datasets with areas under the curves between 0.967 and 0.983. Our simulated synthetic datasets show that both the number of insertions and the extent to which insertions are tolerated in the distal regions of essential genes are both important in determining classification accuracy. Importantly our method gives the user the option of classifying essential genes based on the user-supplied costs of false discovery and false non-discovery. Availability An R package that implements the method presented in this paper is available for download from https://github.com/Kevin-walters/insdens Supplementary information Supplementary data are available at Bioinformatics online.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.