Plot of 95-percent parametric and nonparametric prediction intervals for an individual estimate of total dissolved solids concentration resulting from the regression model fit between total dissolved solids and the log of discharge for the Cuyahoga River data from example 9.
We consider the appropriateness of "rating curves" and other log linear models to estimate the fluvial transport of nutrients. Split-sample studies using data from tributaries to the Chesapeake Bay reveal that a minimum variance unbiased estimator (MVUE), based on a simple log linear model, provides satisfactory load estimates, even in some cases where the model exhibited significant lack of fit. For total nitrogen (TN) the average difference between the MVUE estimates and the observed loads ranges from -8% to +2% at the four sites. The corresponding range for total phosphorus (TP) is -6% to +5%. None of these differences is statistically significant. The observed variability of the MVUE load estimates for TN and TP, which ranges from 7% to 25% depending on the case, is accurately predicted by statistical theory.
Several recent articles have called attention to the problem of retransformation bias, which can arise when log linear regression models are used to estimate sediment or other constituent loads. In some cases the bias can lead to underestimation of constituent loads by as much as 50%, and several procedures have been suggested for reducing or eliminating it. However, some of the procedures recommended for reducing the bias can actually increase it. This paper compares the bias and variance of three procedures that can be used with log linear regression models: the traditional rating curve estimator, a modified rating curve method, and a minimum variance unbiased estimator (MVUE). Analytical derivations of the bias and efficiency of all three estimators are presented. It is shown that for many conditions the traditional and the modified estimator can provide satisfactory estimates. However, other conditions exist where they have substantial bias and a large mean square error. These conditions commonly occur when sample sizes are small, or when loads are estimated during high-flow conditions. The MVUE, however, is unbiased and always performs nearly as well or better than the rating curve estimator or the modified estimator provided that the hypothesis of the log linear model is correct. Since an efficient unbiased estimator is available, there seems to be no reason to employ biased estimators.
Three methods of fitting straight lines to data are described and their purposes are discussed and contrasted in terms of their applicability in various water resources contexts. The three methods are ordinary least squares (OLS), least normal squares (LNS), and the line of organic correlation (OC). In all three methods the parameters are based on moment statistics of the data. When estimation of an individual value is the objective, OLS is the most appropriate. When estimation of many values is the objective and one wants the set of estimates to have the appropriate variance, then OC is most appropriate. When one wishes to describe the relationship between two variables and measurement error is unimportant, then OC is most appropriate. Whee the error is important in descriptive problems or in calibration problems, then structural analysis techniques may be most appropriate. Finally, if the problem is one of describing some geographic trajectory, then LNS is most appropriate.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.