High-resolution population distribution data are critical for successfully addressing important issues ranging from socio-environmental research to public health to homeland security, since scientific analyses, operational activities, and policy decisions are significantly influenced by the number of impacted people. Dasymetric modeling has been a well-recognized approach for spatial decomposition of census data to increase the spatial resolution of population distribution. However, enhancing the temporal resolution of population distribution poses a greater challenge. In this paper, we discuss the development of LandScan USA, a multi-dimensional dasymetric modeling approach, which has allowed the creation of a very high-resolution population distribution data both over space and time. At a spatial resolution of 3 arc seconds (*90 m), the initial LandScan USA database contains both a nighttime residential as well as a baseline daytime population distribution that incorporates movement of workers and students. Challenging research issues of disparate and misaligned spatial data and modeling to develop a database at a national scale, as well as model verification and validation approaches are illustrated and discussed. Initial analyses indicate a high degree of locational accuracy for LandScan USA distribution model and data. High-resolution population data such as LandScan USA, which describes both distribution and dynamics of human population, clearly has the potential to profoundly impact multiple domain applications of national and global priority.
The objective of this research is to compute population estimates by age and sex for small areas whose boundaries are different from those for which the population counts were made. In our approach, population surfaces and age‐sex proportion surfaces are separately estimated. Age‐sex population estimates for small areas and their confidence intervals are then computed using a binomial model with the two surfaces as inputs. The approach was implemented for Iowa using a 90 m resolution population grid (LandScan USA) and U.S. Census 2000 population. Three spatial interpolation methods, the areal weighting (AW) method, the ordinary kriging (OK) method, and a modification of the pycnophylactic method, were used on Census Tract populations to estimate the age‐sex proportion surfaces. To verify the model, age‐sex population estimates were computed for paired Block Groups that straddled Census Tracts and therefore were spatially misaligned with them. The pycnophylactic method and the OK method were more accurate than the AW method. The approach is general and can be used to estimate subgroup‐count types of variables from information in existing administrative areas for custom‐defined areas used as the spatial basis of support in other applications.
Geospatial data sciences have emerged as critical requirements for high-priority application solutions in diverse areas, including, but not limited to, the mitigation of natural and man-made disasters.
Geospatial technologies and digital data have developed and disseminated rapidly in conjunction with increasing computing efficiency and Internet availability. The ability to store and transmit large datasets has encouraged the development of national infrastructure datasets in geospatial formats. National datasets are used by numerous agencies for analysis and modeling purposes because these datasets are standardized and considered to be of acceptable accuracy for national scale applications. At Oak Ridge National Laboratory a population model has been developed that incorporates national schools data as one of the model inputs. This paper evaluates spatial and attribute inaccuracies present within two national school datasets, Tele Atlas North America and National Center of Education Statistics (NCES).Schools are an important component of the population model, because they are spatially dense clusters of vulnerable populations. It is therefore essential to validate the quality of school input data. Schools were also chosen since a validated schools dataset was produced in geospatial format for Philadelphia County; thereby enabling a comparison between a local dataset and the national datasets.Analyses found the national datasets are not standardized and incomplete, containing 76 to 90 percent of existing schools. The temporal accuracy of updating annual enrollment values resulted in 89 percent inaccuracy for 2003. Spatial rectification was required for 87 percent of NCES points, of which 58 percent of the errors were attributed to the geocoding process. Lastly, it was found that by combining the two national datasets, the resultant dataset provided a more useful and accurate solution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.