Optimal estimator of hypothesis probability for data mining problems with small samples

Piegat, Andrzej; Landowski, Marek

doi:10.2478/v10006-012-0048-z

Cited by 3 publications

(9 citation statements)

References 23 publications

(12 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As an error measure of a probability estimation method we use the mean absolute error (abbreviated as MAE in the paper) for easier comparison with the findings reported in Piegat and Landowski (2012). Also, the preliminary experiments with another measure of error, root mean squared error (RMSE), revealed that the general observations and conclusions remain the same regardless of the error measure used.…”

Section: Historical Background and Related Workmentioning

confidence: 98%

“…Formula (4) is in their paper denoted by Ep ha and has one parameter a. The theoretical optimization of the mean absolute error (MAE) with the proposed formula (4) yielded the optimal value of a = √ 2 (Piegat and Landowski, 2012). After the replacement with the optimized value of a, the following formula, denoted by Ep h √ 2 in their paper, was obtained:…”

Section: Historical Background and Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Cestnik

2019

International Journal of Applied Mathematics and Computer Science

View full text Add to dashboard Cite

Estimation of probabilities from empirical data samples has drawn close attention in the scientific community and has been identified as a crucial phase in many machine learning and knowledge discovery research projects and applications. In addition to trivial and straightforward estimation with relative frequency, more elaborated probability estimation methods from small samples were proposed and applied in practice (e.g., Laplace’s rule, the m-estimate). Piegat and Landowski (2012) proposed a novel probability estimation method from small samples Eph√2 that is optimal according to the mean absolute error of the estimation result. In this paper we show that, even though the articulation of Piegat’s formula seems different, it is in fact a special case of the m-estimate, where pa =1/2 and m = √2. In the context of an experimental framework, we present an in-depth analysis of several probability estimation methods with respect to their mean absolute errors and demonstrate their potential advantages and disadvantages. We extend the analysis from single instance samples to samples with a moderate number of instances. We define small samples for the purpose of estimating probabilities as samples containing either less than four successes or less than four failures and justify the definition by analysing probability estimation errors on various sample sizes.

show abstract

Section: Historical Background and Related Workmentioning

confidence: 98%

Section: Historical Background and Related Workmentioning

confidence: 99%

Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Cestnik

2019

International Journal of Applied Mathematics and Computer Science

View full text Add to dashboard Cite

show abstract

“…Figure 3 shows the Category distribution based on λ, where the operating costs are classified into "low" and "high" by setting λ ¼ 0:69 based on Tables 2 and 4. The "high" operating cost set is denoted by columns 1, 2, 3, 4, 6,7,8,9,10,11,12,13,14,15,18, and 21 in Table 4. The "low" operating cost is represented by columns 5, 16, 17, 19, and 20 in Table 4.…”

Section: Case Studymentioning

confidence: 99%

“…In other words, the focus of the problem is how to estimate the lifecycle costs using small sample data. Relatively recent publications have provided some in-depth discussions regarding small sample estimation [4][5][6][7][8], where fuzzy clustering and support vector machine (SVM) have received special attentions [9][10][11]. Fuzzy clustering and SVM have been applied to address various problems through progression as the methodologies themselves advance, such as classification, regression, image classification, human activity, geo-marketing analysis, and drug discovery [12][13][14][15][16][17][18][19].…”

Section: Introductionmentioning

confidence: 99%

Methodologies for assessing costs of rail transit systems based on small sample data

Wang

Zhang

Chen

et al. 2015

International Journal of Rail Transportation

View full text Add to dashboard Cite

China has developed plans to build 87 mass transit rail lines, totalling 2500 km, in 25 cities from 2009 to 2015. The life-cycle costs of the urban rail transit systems have become the focus of both the government and the private sector involved in these large-scale investments. However, the availability of quality data has posed a major challenge to such life-cycle cost analyses; in other words, for any methodology to be effective, it must have the capability of working with very limited amount of available data, or small sample data. In this article, two cost assessment methodologies, fuzzy cluster and support vector machine, are proposed to analyse the life-cycle cost of urban rail transit systems based on small sample data. A case study featuring Line 1 of the Shijiazhuang urban rail transit system was employed to demonstrate the validity of the proposed methodologies. The analysis results indicate that the two assessment methodologies are valid for the life-cycle cost assessment of urban rail transit systems when only small sample data are available.

show abstract

“…Problems concerning the small sample size and pseudoinverse appear in the most recent works (Piegat and Landowski, 2012;Röbenack and Reinschke, 2011). We propose an extension of the last approach.…”

Section: Introductionmentioning

confidence: 99%

Linear discriminant analysis with a generalization of the Moore–Penrose pseudoinverse

Górecki

Łuczak

2013

International Journal of Applied Mathematics and Computer Science

View full text Add to dashboard Cite

The Linear Discriminant Analysis (LDA) technique is an important and well-developed area of classification, and to date many linear (and also nonlinear) discrimination methods have been put forward. A complication in applying LDA to real data occurs when the number of features exceeds that of observations. In this case, the covariance estimates do not have full rank, and thus cannot be inverted. There are a number of ways to deal with this problem. In this paper, we propose improving LDA in this area, and we present a new approach which uses a generalization of the Moore-Penrose pseudoinverse to remove this weakness. Our new approach, in addition to managing the problem of inverting the covariance matrix, significantly improves the quality of classification, also on data sets where we can invert the covariance matrix. Experimental results on various data sets demonstrate that our improvements to LDA are efficient and our approach outperforms LDA.

show abstract

Optimal estimator of hypothesis probability for data mining problems with small samples

Cited by 3 publications

References 23 publications

Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Revisiting the Optimal Probability Estimator from Small Samples for Data Mining

Methodologies for assessing costs of rail transit systems based on small sample data

Linear discriminant analysis with a generalization of the Moore–Penrose pseudoinverse

Contact Info

Product

Resources

About