Tirthankar Roy scite author profile

Tirthankar Roy

3Publications

1Citation Statement Received

63Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Nebraska–Lincoln, Indian Statistical Institute, Belgoprocess (Belgium)

Publications

Order By: Most citations

Computing Accurate Probabilistic Estimates of One-D Entropy from Equiprobable Random Samples

Gupta

Ehsani

Roy

et al. 2021

Entropy

View full text Add to dashboard Cite

We develop a simple Quantile Spacing (QS) method for accurate probabilistic estimation of one-dimensional entropy from equiprobable random samples, and compare it with the popular Bin-Counting (BC) and Kernel Density (KD) methods. In contrast to BC, which uses equal-width bins with varying probability mass, the QS method uses estimates of the quantiles that divide the support of the data generating probability density function (pdf) into equal-probability-mass intervals. And, whereas BC and KD each require optimal tuning of a hyper-parameter whose value varies with sample size and shape of the pdf, QS only requires specification of the number of quantiles to be used. Results indicate, for the class of distributions tested, that the optimal number of quantiles is a fixed fraction of the sample size (empirically determined to be ~0.25–0.35), and that this value is relatively insensitive to distributional form or sample size. This provides a clear advantage over BC and KD since hyper-parameter tuning is not required. Further, unlike KD, there is no need to select an appropriate kernel-type, and so QS is applicable to pdfs of arbitrary shape, including those with discontinuous slope and/or magnitude. Bootstrapping is used to approximate the sampling variability distribution of the resulting entropy estimate, and is shown to accurately reflect the true uncertainty. For the four distributional forms studied (Gaussian, Log-Normal, Exponential and Bimodal Gaussian Mixture), expected estimation bias is less than 1% and uncertainty is low even for samples of as few as 100 data points; in contrast, for KD the small sample bias can be as large as -10% and for BC as large as -50%. We speculate that estimating quantile locations, rather than bin-probabilities, results in more efficient use of the information in the data to approximate the underlying shape of an unknown data generating pdf.

show abstract

Measurement of fuzziness: A general approach

Chakravarty

Roy

1985

Theor Decis

View full text Add to dashboard Cite

show abstract

Fuzzy optimization and nuclear production processes

Trauwaert

Reynders

Roy

1995

Fuzzy Sets and Systems

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tirthankar Roy

Computing Accurate Probabilistic Estimates of One-D Entropy from Equiprobable Random Samples

Measurement of fuzziness: A general approach

Fuzzy optimization and nuclear production processes

Contact Info

Product

Resources

About