2017
DOI: 10.1111/tgis.12286
|View full text |Cite
|
Sign up to set email alerts
|

Terra Populus’ architecture for integrated big geospatial services

Abstract: Big geospatial data is an emerging sub-area of geographic information science, big data, and cyberinfrastructure. Big geospatial data poses two unique challenges to these and other cognate disciplines. First, raster and vector data structures and analyses have developed on largely separate paths for the last twenty years and this creates an impediment to researchers utilizing big data platforms that do not promote the integration for these classes. Second, big spatial data repositories have yet to be integrate… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 13 publications
(22 citation statements)
references
References 32 publications
(35 reference statements)
0
22
0
Order By: Relevance
“…The geospatial big data distributed cluster solution is built using the open-source projects Greenplum and PostGIS [47]. Greenplum's architecture ( Figure 2) uses massively parallel processing (MPP).…”
Section: Data Clusteringmentioning
confidence: 99%
“…The geospatial big data distributed cluster solution is built using the open-source projects Greenplum and PostGIS [47]. Greenplum's architecture ( Figure 2) uses massively parallel processing (MPP).…”
Section: Data Clusteringmentioning
confidence: 99%
“…PostGIS allows for the storage and analysis of both vector and raster data types. Haynes et al (2017) discusses some of the obstacles encountered with high-performance computing using both spatial data types.…”
Section: Ipums-terramentioning
confidence: 99%
“…Raster summarizations were initially degrading the performance of our system as they are computationally expensive. Haynes (2017) discusses why query performance times of raster analyses with PostgreSQL vary greatly and we have implemented a data caching system to reduce additional calculations. The caching key is generated from the following variables: geographic level, temporal time point or range, and the raster variable and raster operation (e.g., minimum value, maximum value).…”
Section: Ipums-terra Solutions For Big Data Integrationmentioning
confidence: 99%
“…This paper studies the zonal statistics problem which combines raster data, e.g., temperature, with vector data, e.g., city boundaries, to compute aggregate values for each polygon, e.g., average temperature in each city. This problem has several applications including the study by ecologists on the effect of vegetation and temperature on human settlement [3,4], analyzing terabytes of socio-economic and environmental data [5,6], and studying of land use and land cover classification [7]. It can also be used for areal interpolation [8] and to assess the risk of wildfires [9].…”
Section: Introductionmentioning
confidence: 99%
“…Traditional methods to process the zonal statistics problem focused on either vectorizing the raster dataset [16] or rasterizing the vector data [5]. The first approach converts each pixel to a point and then runs a spatial join with polygons using a point-in-polygon predicate [16].…”
Section: Introductionmentioning
confidence: 99%