Abstract:Multivariate statistical approaches have been increasingly applied in hemorrhagic stroke data analysis. Nevertheless, several aspects regarding their relevance and validity in respect of the application of data transformations have not been studied in details. This paper examines the effects of different data transformations in the standard statistical methods of the multivariate analysis of the intracerebral hemorrhage (ICH) parameters in small group samples. Two different methods for data transformations (lo… Show more
“…Logarithmic transformation is a simpler and more common solution for analyzing heteroscedastic variables for which the variance increases with the mean (Snedecor and Cochran 1989, Draper and Smith 1998), and log–log regression relating one such variable to another is widely applied in water quality analyses (Helsel and Hirsch 2002). However, statisticians have documented general problems with log transformation, including failure to eliminate heteroscedasticity and difficulty applying parameter estimates or hypothesis tests back to the untransformed variables (Feng et al 2013, 2014, Choi 2016, Greenacre 2016, Rendevski et al 2016, Curran‐Everett 2018, Ekwaru and Veugelers 2018). We chose the bootstrapping approach instead of log–log regression because it handled our heteroscedastic variables without causing these problems and because bootstrapping gave other benefits, such as quantifying sampling uncertainty.…”
Stream water quality data are essential for understanding watershed processes and managing water pollution, but the effort and expense of stream monitoring limit how many watersheds can be studied. For 59 small watersheds in the Chesapeake Bay drainage, we compared water quality measurements from inexpensive spot sampling to data from costly automated monitoring that used 1–3 yr of continuous flow measurement and weekly, temporally composited water sampling. Mean nitrogen (N) levels ranged from 0.01 to 16 mg N/L among streams. There were important temporal variations in N concentrations at each site, but the differences among sites were much greater. Spot samples were very effective at accurately and precisely placing average stream N levels within the N gradient among streams draining N‐enriched watersheds. Among watersheds, nitrate (NO3) and total N concentrations from spot samples were very strongly correlated with means from weekly composite sampling (R2 > 97%). We confirmed this result for independent data for 85 larger watersheds in the Chesapeake Bay Non‐tidal Network. NO3 concentration from a single March spot sample was highly correlated (R2 > 92%) with flow‐weighted average total N concentration synthesized from five years of monitoring. Spot sampling effectively quantifies average N status across N‐enriched watersheds because most N moves as dissolved NO3 in subsurface flow and that flux is much less variable than the episodic surface transport of particulate materials. For questions answered by quantifying average N levels, spot sampling can assess more watersheds at much lower cost than automated sampling, so it should be more widely used to support cost‐effective N research and management. For materials that are mainly bound to particulates, such as phosphorus, spot sampling is much less effective.
“…Logarithmic transformation is a simpler and more common solution for analyzing heteroscedastic variables for which the variance increases with the mean (Snedecor and Cochran 1989, Draper and Smith 1998), and log–log regression relating one such variable to another is widely applied in water quality analyses (Helsel and Hirsch 2002). However, statisticians have documented general problems with log transformation, including failure to eliminate heteroscedasticity and difficulty applying parameter estimates or hypothesis tests back to the untransformed variables (Feng et al 2013, 2014, Choi 2016, Greenacre 2016, Rendevski et al 2016, Curran‐Everett 2018, Ekwaru and Veugelers 2018). We chose the bootstrapping approach instead of log–log regression because it handled our heteroscedastic variables without causing these problems and because bootstrapping gave other benefits, such as quantifying sampling uncertainty.…”
Stream water quality data are essential for understanding watershed processes and managing water pollution, but the effort and expense of stream monitoring limit how many watersheds can be studied. For 59 small watersheds in the Chesapeake Bay drainage, we compared water quality measurements from inexpensive spot sampling to data from costly automated monitoring that used 1–3 yr of continuous flow measurement and weekly, temporally composited water sampling. Mean nitrogen (N) levels ranged from 0.01 to 16 mg N/L among streams. There were important temporal variations in N concentrations at each site, but the differences among sites were much greater. Spot samples were very effective at accurately and precisely placing average stream N levels within the N gradient among streams draining N‐enriched watersheds. Among watersheds, nitrate (NO3) and total N concentrations from spot samples were very strongly correlated with means from weekly composite sampling (R2 > 97%). We confirmed this result for independent data for 85 larger watersheds in the Chesapeake Bay Non‐tidal Network. NO3 concentration from a single March spot sample was highly correlated (R2 > 92%) with flow‐weighted average total N concentration synthesized from five years of monitoring. Spot sampling effectively quantifies average N status across N‐enriched watersheds because most N moves as dissolved NO3 in subsurface flow and that flux is much less variable than the episodic surface transport of particulate materials. For questions answered by quantifying average N levels, spot sampling can assess more watersheds at much lower cost than automated sampling, so it should be more widely used to support cost‐effective N research and management. For materials that are mainly bound to particulates, such as phosphorus, spot sampling is much less effective.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.