argues convincingly for a paradigm shift in opinion polling. The current paradigm works on the basis that (truly) random samples are available, despite the fact that nonresponse and nonprobability samples have essentially consigned them to history. To ease this tension, the current paradigm has incorporated weighting adjustments on the premise that weighted nonrandom samples possess the useful properties of random ones. The problem, Bailey writes, is that weighting and related methods make strong assumptions, assumptions that are untenable in a polling context. Reading Bailey's article, I was struck by the relevance of his arguments to my line of work: biodiversity monitoring. Although there are rare cases where random samples are available to biodiversity monitors, the prevalence of nonprobability samples and incomplete uptake of sites (equivalent to nonresponse in polling) mean that they are the exception rather than the rule. Weighting and other adjustments are less common than in polling, but where they are used, their assumptions are similarly untenable. In this commentary, I expand on these points and pose the question of whether biodiversity monitoring, like polling, is due a paradigm shift.
Descriptive Inference as a Unifying ConceptAlthough very different subjects, opinion polling and biodiversity monitoring are both exercises in descriptive statistical inference, so similar statistical challenges arise. The first step is (or should be) to define a finite population comprising many 'observation units.' In polling, the units are people, and the population is all people in some geographic or political boundary (or a subset like registered voters). In biodiversity monitoring, it is simplest to think of the population as a landscape and the observation units as patches of land (sites) within that landscape. The goal in both disciplines is to estimate some parameter describing a variable of interest in the population. Pollsters are interested in parameters like the average political opinion in the electorate; for biodiversity monitors, it is parameters like the average abundance of some species across the landscape that are of interest. Calculating these parameters would be simple if data were available on every observation unit.Typically, however, it is not possible to survey every observation unit, so analysts must estimate them from data on a sample of observation units. Estimating population parameters from a sample-that is, inference-is most straightforward where the sample was selected randomly from the population.
Why Random Sampling Permits Straightforward InferenceRandom sampling has two useful properties that permit straightforward inference. The first is that it ensures that the distribution of the variable of interest in the sample mirrors its distribution in the population in expectation; that is, when averaged over many possible (but never realized) samples. Intuitively, if the distribution of the variable of interest in the sample mirrors its population distribution, then sample-based statisti...