Zeros (i.e. events that do not happen) are the source of two common phenomena in count data: overdispersion and zero‐inflation. Zeros have multiple origins in a dataset: false zeros occur due to errors in the experimental design or the observer; structural zeros are related to the ecological or evolutionary restrictions of the system under study; and random zeros are the result of the sampling variability. Identifying the type of zeros and their relation with overdispersion and/or zero inflation is key to select the most appropriate statistical model.
Here we review the different modelling options in relation to the presence of overdispersion and zero inflation, tested through the dispersion and zero inflation indices. We then examine the theory of the zero‐inflated (ZI) models and the use of the score tests to assess overdispersion and zero inflation over a model.
In order to choose an adequate model when analysing count data we suggest the following protocol: Step 1) classify the zeros and minimize the presence of false zeros; Step 2) identify suitable covariates; Step 3) test the data for overdispersion and zero‐inflation and Step 4) choose the most adequate model based on the results of step 3 and use score tests to determine whether more complex models should be implemented.
We applied the recommended protocol on a real dataset on plant–herbivore interactions to evaluate the suitability of six different models (Poisson, NB and their zero‐inflated versions—ZIP, ZINB). Our data were overdispersed and zero‐inflated, and the ZINB was the model with the best fit, as predicted.
Ignoring overdispersion and/or zero inflation during data analyses caused biased estimates of the statistical parameters and serious errors in the interpretation of the results. Our results are a clear example on how the conclusions of an ecological hypothesis can change depending on the model applied. Understanding how zeros arise in count data, for example identifying the potential sources of structural zeros, is essential to select the best statistical design. A good model not only fits the data correctly but also takes into account the idiosyncrasies of the biological system.
Summary• We studied the effects of Cistus albidus leaf leachates on nitrogen-cycling processes in two siliceous soils (granite and schist) and one calcareous soil. We compared those effects with gross N-transformation rates in soils sampled underneath Cistus .• Soils amended with leachates and soils sampled under Cistus had higher NH 4 + immobilization and lower nitrification compared with control soils. Gross N mineralization increased under Cistus but decreased in soils amended with leachates. These effects were especially strong in granite soil.• To determine whether phenolic compounds were causing those effects, we incubated granite soils with leachate and a leachate fraction containing only nonphenolic compounds. Nonphenolic compounds increased NH 4 + immobilization and decreased gross nitrification, while decreases in gross N mineralization were estimated to be caused by phenolic compounds.• Our results show that although phenolic compounds leached from green foliage changed gross N mineralization, their effects on net N rates were eclipsed by the changes produced by polar nonphenolic compounds such as carbohydrates. Plant nonphenolic compounds may drive N cycling under Cistus .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.