2An experiment is designed to evaluate and compare precipitation forecasts from a 5-member, 4-km grid-spacing (ENS4) and a 15-member, 20-km grid-spacing (ENS20) Weather Research and Forecasting (WRF) model ensemble, which cover a similar domain over the central United States.The ensemble forecasts are initialized at 2100 UTC on 23 different dates and cover forecast lead times up to 33 hours. Previous work has demonstrated that simulations using convection-allowing resolution (CAR; dx ~ 4-km) have a better representation of the spatial and temporal statistical properties of convective precipitation than coarser models using convective parameterizations. In addition, higher resolution should lead to greater ensemble spread as smaller scales of motion are resolved. Thus, CAR ensembles should provide more accurate and reliable probabilistic forecasts than parameterized-convection resolution (PCR) ensembles.Computation of various precipitation skill metrics for probabilistic and deterministic forecasts reveals that ENS4 generally provides more accurate precipitation forecasts than ENS20, with the differences tending to be statistically significant for precipitation thresholds above 0.25 inches at forecast lead times of 9 to 21 hours (0600 -1800 UTC) for all accumulation intervals analyzed (1-, 3-, and 6-hr). In addition, an analysis of rank histograms and statistical consistency reveals that faster error growth in ENS4 eventually leads to more reliable precipitation forecasts in ENS4 than in ENS20. For the cases examined, these results imply that the skill gained by increasing to CAR outweighs the skill lost by decreasing the ensemble size. Thus, when computational capabilities become available, it will be highly desirable to increase the ensemble resolution from PCR to CAR, even if the size of the ensemble has to be reduced.
The diurnal cycles of rainfall in 5-km grid-spacing convection-resolving and 22-km grid-spacing nonconvection-resolving configurations of the Weather Research and Forecasting (WRF) model are compared to see if significant improvements can be obtained by using fine enough grid spacing to explicitly resolve convection. Diurnally averaged Hovmöller diagrams, spatial correlation coefficients computed in Hovmöller space, equitable threat scores (ETSs), and biases for forecasts conducted from 1 April to 25 July 2005 over a large portion of the central United States are used for the comparisons. A subjective comparison using Hovmöller diagrams of diurnally averaged rainfall show that the diurnal cycle representation in the 5-km configuration is clearly superior to that in the 22-km configuration during forecast hours 24-48. The superiority of the 5-km configuration is validated by much higher spatial correlation coefficients than in the 22-km configuration. During the first 24 forecast hours the 5-km model forecasts appear to be more adversely affected by model "spinup" processes than the 22-km model forecasts, and it is less clear, subjectively, which configuration has the better diurnal cycle representation, although spatial correlation coefficients are slightly higher in the 22-km configuration. ETSs in both configurations have diurnal oscillations with relative maxima occurring in both configurations at forecast hours corresponding to 0000-0300 LST, while biases also have diurnal oscillations with relative maxima (largest errors) in the 22-km (5-km) configuration occurring at forecast hours corresponding to 1200 (1800) LST. At all forecast hours, ETSs from the 22-km configuration are higher than those in the 5-km configuration. This inconsistency with some of the results obtained using the aforementioned spatial correlation coefficients reinforces discussion in past literature that cautions against using "traditional" verification statistics, such as ETS, to compare high-to low-resolution forecasts. ABSTRACTThe diurnal cycles of rainfall in 5-km grid-spacing convection-resolving and 22-km grid-spacing nonconvection-resolving configurations of the Weather Research and Forecasting (WRF) model are compared to see if significant improvements can be obtained by using fine enough grid spacing to explicitly resolve convection. Diurnally averaged Hovmöller diagrams, spatial correlation coefficients computed in Hovmöller space, equitable threat scores (ETSs), and biases for forecasts conducted from 1 April to 25 July 2005 over a large portion of the central United States are used for the comparisons. A subjective comparison using Hovmöller diagrams of diurnally averaged rainfall show that the diurnal cycle representation in the 5-km configuration is clearly superior to that in the 22-km configuration during forecast hours 24-48. The superiority of the 5-km configuration is validated by much higher spatial correlation coefficients than in the 22-km configuration. During the first 24 forecast hours the 5-km model for...
Probabilistic quantitative precipitation forecasts (PQPFs) from the storm-scale ensemble forecast system run by the Center for Analysis and Prediction of Storms during the spring of 2009 are evaluated using area under the relative operating characteristic curve (ROC area). ROC area, which measures discriminating ability, is examined for ensemble size n from 1 to 17 members and for spatial scales ranging from 4 to 200 km.Expectedly, incremental gains in skill decrease with increasing n. Significance tests comparing ROC areas for each n to those of the full 17-member ensemble revealed that more members are required to reach statistically indistinguishable PQPF skill relative to the full ensemble as forecast lead time increases and spatial scale decreases. These results appear to reflect the broadening of the forecast probability distribution function (PDF) of future atmospheric states associated with decreasing spatial scale and increasing forecast lead time. They also illustrate that efficient allocation of computing resources for convection-allowing ensembles requires careful consideration of spatial scale and forecast length desired.
Despite an increased understanding of the environments that favor tornado formation, a high false-alarm rate for tornado warnings still exists, suggesting that tornado formation could be a volatile process that is largely internal to each storm. To assess this, an ensemble of 30 supercell simulations was constructed based on small variations to the nontornadic and tornadic environmental profiles composited from the second Verification of the Origins of Rotation in Tornadoes Experiment (VORTEX2). All simulations produce distinct supercells despite occurring in similar environments. Both the tornadic and nontornadic ensemble members possess ample subtornadic surface vertical vorticity; the determinative factor is whether this vorticity can be converged and stretched by the low-level updraft. Each of the 15 members in the tornadic VORTEX2 ensemble produces a long-track, intense tornado. Although there are notable differences in the precipitation and near-surface buoyancy fields, each storm features strong dynamic lifting of surface air with vertical vorticity. This lifting is due to a steady low-level mesocyclone, which is linked to the ingestion of predominately streamwise environmental vorticity. In contrast, each nontornadic VORTEX2 simulation features a supercell with a disorganized low-level mesocyclone, due to crosswise vorticity in the lowest few hundred meters in the nontornadic environment. This generally leads to insufficient dynamic lifting and stretching to accomplish tornadogenesis. Even so, 40% of the nontornadic VORTEX2 ensemble members become weakly tornadic. This implies that chaotic within-storm details can still play a role and, occasionally, lead to marginally tornadic vortices in suboptimal storms.
Since 2003 the National Center for Atmospheric Research (NCAR) has been running various experimental convection-allowing configurations of the Weather Research and Forecasting Model (WRF) for domains covering a large portion of the central United States during the warm season (April-July). In this study, the skill of 3-hourly accumulated precipitation forecasts from a large sample of these convection-allowing simulations conducted during 2004-05 and 2007-08 is compared to that from operational North American Mesoscale (NAM) model forecasts using a neighborhood-based equitable threat score (ETS). Separate analyses were conducted for simulations run before and after the implementation in 2007 of positive-definite (PD) moisture transport for the NCAR-WRF simulations. The neighborhood-based ETS (denoted hETSir) relaxes the criteria for ''hits'' (i.e., correct forecasts) by considering grid points within a specified radius r. It is shown that hETSir is more useful than the traditional ETS because hETSir can be used to diagnose differences in precipitation forecast skill between different models as a function of spatial scale, whereas the traditional ETS only considers the spatial scale of the verification grid. It was found that differences in hETSir between NCAR-WRF and NAM generally increased with increasing r, with NCAR-WRF having higher scores. Examining time series of hETSir for r 5 100 and r 5 0 km (which simply reduces to the ''traditional'' ETS), statistically significant differences between NCAR-WRF and NAM were found at many forecast lead times for hETSi100 but only a few times for hETSi0. Larger and more statistically significant differences occurred with the 2007-08 cases relative to the 2004-05 cases. Because of differences in model configurations and dominant large-scale weather regimes, a more controlled experiment would have been needed to diagnose the reason for the larger differences that occurred with the 2007-08 cases. Finally, a compositing technique was used to diagnose the differences in the spatial distribution of the forecasts. This technique implied westward displacement errors for NAM model forecasts in both sets of cases and in NCAR-WRF model forecasts for the 2007-08 cases. Generally, the results are encouraging because they imply that advantages in convectionallowing relative to convection-parameterizing simulations noted in recent studies are reflected in an objective neighborhood-based metric. In this study, the skill of 3-hourly accumulated precipitation forecasts from a large sample of these convection-allowing simulations conducted during 2004-05 and 2007-08 is compared to that from operational North American Mesoscale (NAM) model forecasts using a neighborhood-based equitable threat score (ETS). Separate analyses were conducted for simulations run before and after the implementation in 2007 of positive-definite (PD) moisture transport for the NCAR-WRF simulations. The neighborhood-based ETS (denoted hETSi r ) relaxes the criteria for ''hits'' (i.e., correct forecasts) by co...
The possibility that Arctic sea ice loss weakens mid-latitude westerlies, promoting more severe cold winters, has sparked more than a decade of scientific debate, with apparent support from observations but inconclusive modelling evidence. Here we show that sixteen models contributing to the Polar Amplification Model Intercomparison Project simulate a weakening of mid-latitude westerlies in response to projected Arctic sea ice loss. We develop an emergent constraint based on eddy feedback, which is 1.2 to 3 times too weak in the models, suggesting that the real-world weakening lies towards the higher end of the model simulations. Still, the modelled response to Arctic sea ice loss is weak: the North Atlantic Oscillation response is similar in magnitude and offsets the projected response to increased greenhouse gases, but would only account for around 10% of variations in individual years. We further find that relationships between Arctic sea ice and atmospheric circulation have weakened recently in observations and are no longer inconsistent with those in models.
An experiment is described that is designed to examine the contributions of model, initial condition (IC), and lateral boundary condition (LBC) errors to the spread and skill of precipitation forecasts from two regional eight-member 15-km grid-spacing Weather Research and Forecasting (WRF) ensembles covering a 1575 km X 1800 km domain. It is widely recognized that a skillful ensemble [i.e., an ensemble with a probability distribution function (PDF) that generates forecast probabilities with high resolution and reliability] should account for both error sources. Previous work suggests that model errors make a larger contribution than IC and LBC errors to forecast uncertainty in the short range before synoptic-scale error growth becomes nonlinear. However, in a regional model with unperturbed LBCs, the infiltration of the lateral boundaries will negate increasing spread. To obtain a better understanding of the contributions to the forecast errors in precipitation and to examine the window of forecast lead time before unperturbed ICs and LBCs begin to cause degradation in ensemble forecast skill, the "perfect model" assumption is made in an ensemble that uses perturbed ICs and LBCs (PILB ensemble), and the "perfect analysis" assumption is made in another ensemble that uses mixed physics-dynamic cores (MP ensemble), thus isolating the error contributions. For the domain and time period used in this study, unperturbed ICs and LBCs in the MP ensemble begin to negate increasing spread around forecast hour 24, and ensemble forecast skill as measured by relative operating characteristic curves (ROC scores) becomes lower in the MP ensemble than in the PILB ensemble, with statistical significance beginning after forecast hour 69. However, degradation in forecast skill in the MP ensemble relative to the PILB ensemble is not observed in an analysis of deterministic forecasts calculated from each ensemble using the probability matching method. Both ensembles were found to lack statistical consistency (i.e., to be underdispersive), with the PILB ensemble (MP ensemble) exhibiting more (less) statistical consistency with respect to forecast lead time. Spread ratios in the PILB ensemble are greater than those in the MP ensemble at all forecast lead times and thresholds examined; however, ensemble variance in the MP ensemble is greater than that in the PILB ensemble during the first 24 h of the forecast. This discrepancy in spread measures likely results from greater bias in the MP ensemble leading to an increase in ensemble variance and decrease in spread ratio relative to the PILB ensemble.Keywords boundary conditions, boundary value problems, degradation, distribution functions, error analysis, errors, probability, probability distributions, random processes, research, risk assessment, statistical methods, sulfate minterals, weather forecasting, deterministic forecasting, ensemble forecasting ABSTRACT An experiment is described that is designed to examine the contributions of model, initial condition (IC), and lateral b...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.