Abstract:A finite set is "hidden" if its elements are not directly enumerable or if its size cannot be ascertained via a deterministic query. In public health, epidemiology, demography, ecology and intelligence analysis, researchers have developed a wide variety of indirect statistical approaches, under different models for sampling and observation, for estimating the size of a hidden set. Some methods make use of random sampling with known or estimable sampling probabilities, and others make structural assumptions abo… Show more
“…The German tank problem in other contexts. The Bayesian probability theory used to solve the German tank problem applies (perhaps with modification) to many other contexts in which we wish to estimate the size of some finite hidden set [ 9 ], such as the number of taxicabs in a city [ 19 , 23 ], racing cars on a track [ 48 ], accounts at a bank [ 25 ], pieces of furniture purchased by a university [ 22 ], aircraft operations at an airport [ 34 ], cases in court [ 50 ], or electronic devices produced by a company [ 2 ]. And also the extent of leaked classified government communications [ 18 ], the time needed to complete a project deadline [ 16 ], the time-coverage of historical records of extreme events like floods [ 39 ], the length of a short-tandem repeat allele [ 51 ], the size of a social network [ 28 ], the lifetime of a flower of a plant [ 38 ], or the duration of existence of a species [ 42 ].…”
“…The German tank problem in other contexts. The Bayesian probability theory used to solve the German tank problem applies (perhaps with modification) to many other contexts in which we wish to estimate the size of some finite hidden set [ 9 ], such as the number of taxicabs in a city [ 19 , 23 ], racing cars on a track [ 48 ], accounts at a bank [ 25 ], pieces of furniture purchased by a university [ 22 ], aircraft operations at an airport [ 34 ], cases in court [ 50 ], or electronic devices produced by a company [ 2 ]. And also the extent of leaked classified government communications [ 18 ], the time needed to complete a project deadline [ 16 ], the time-coverage of historical records of extreme events like floods [ 39 ], the length of a short-tandem repeat allele [ 51 ], the size of a social network [ 28 ], the lifetime of a flower of a plant [ 38 ], or the duration of existence of a species [ 42 ].…”
“…To investigate this, we assume a model for link formation in the general population, then consider the distribution of the estimators over simple random samples from that population without replacement. The binomial model can be viewed as an approximation to the Erdős-Rényi network model, in which the presence or absence of a link between each pair of nodes is independently drawn from the same Bernoulli (p) distribution (for details, see the online supplement and Cheng, Eck, and Crawford 2020). In this study, we generalize this model to incorporate barrier effects by using a stochastic blockmodel (SBM) that partitions the general population into two mutually exclusive groups, the hard-to-reach group of interest (H) and everyone else (L).…”
Section: A Framework For Studying Estimator Behavior Under Barrier Ef...mentioning
The network scale-up method (NSUM) is a cost-effective approach to estimating the size or prevalence of a group of people that is hard to reach through a standard survey. The basic NSUM involves two steps: estimating respondents’ degrees and estimating the prevalence of the hard-to-reach population of interest using respondents’ estimated degrees and the number of people they report knowing in the hard-to-reach group. Each of these two steps involves taking either an average of ratios or a ratio of averages. Using the ratio of averages for each step has so far been the most common approach. However, the authors present theoretical arguments that using the average of ratios at the second, prevalence-estimation step often has lower mean squared error when the random mixing assumption is violated, which seems likely in practice; this estimator was proposed early in NSUM development but has largely been unexplored and unused. Simulation results using an example network data set also support these findings. On the basis of this theoretical and empirical evidence, the authors suggest that future surveys that use a simple estimator may want to use this mixed estimator, and estimation methods based on this estimator may produce new improvements.
“…The German tank problem in other contexts. The Bayesian probability theory to solve the German tank problem applies (perhaps, with modification) to many other contexts where we wish to estimate the size of some finite, hidden set [27], eg. : the number of taxicabs in a city [12], the number of accounts at a bank [15], the number of furniture pieces purchased by a university [10], the number of aircraft operations at an airport [28], the extent of leaked classified government communications [29], the time needed to complete a project deadline [30], the time-coverage of historical records of extreme events like floods [31], the length of a short-tandem repeat allele [32], the size of a social network [33], the number of cases in court [34], the lifetime of a flower of a plant [35], or the duration of existence of a species [36].…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.