Wildfires are increasing in scale, frequency and longevity, and are affecting new locations as environmental conditions change. This paper presents a dataset collected during a community evacuation drill performed in Roxborough Park, Colorado (USA) in 2019. This is a wildland–urban interface community including approximately 900 homes. Data concerning several aspects of community response were collected through observations and surveys: initial population location, pre-evacuation times, route use, and arrival times at the evacuation assembly point. Data were used as inputs to benchmark two evacuation models that adopt different modelling approaches. The WUI-NITY platform and the Evacuation Management System model were applied across a range of scenarios where assumptions regarding pre-evacuation delays and the routes used were varied according to original data collection methods (and interpretation of the data generated). Results are mostly driven by the assumptions adopted for pre-evacuation time inputs. This is expected in communities with a low number of vehicles present on the road and relatively limited traffic congestion. The analysis enabled the sensitivity of the modelling approaches to different datasets to be explored, given the different modelling approaches adopted. The performance of the models were sensitive to the data employed (derived from either observations or self-reporting) and the evacuation phases addressed in them. This indicates the importance of monitoring the impact of including data in a model rather than simply on the data itself, as data affects models in different ways given the modelling methods employed. The dataset is released in open access and is deemed to be useful for future wildfire evacuation modelling calibration and validation efforts.