Estimating Socioeconomic Status via Temporal-Spatial Mobility Analysis - A Case Study of Smart Card Data

Ding, Shichang; Huang, Hong; Zhao, Tao; Fu, Xiaoming

doi:10.1109/icccn.2019.8847051

Cited by 19 publications

(11 citation statements)

References 33 publications

(58 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To reduce the manpower needed for fine-grained income surveys and to speed up fine-grained income data collection, researchers have used house price as a proxy for income. Previous studies have identified a positive correlation between house price and income [23][24][25][26][27][28][29], whilst house price data are easily accessible and downloadable online in the developed world. However, estimation models that depend on house price as the input and income as the output have yielded a low estimation accuracy.…”

mentioning

confidence: 99%

Siamese-Like Convolutional Neural Network for Fine-Grained Income Estimation of Developed Economies

Bai

Lam

2020

IEEE Access

View full text Add to dashboard Cite

Estimating the per-capita income and the household income at a fine-grained geographical scale is critical but challenging, even across the developed economies. In this paper, a novel Siamese-like Convolutional Neural Network, integrating Ridge Regression and Gaussian Process Regression, has been developed for fine-grained estimation of income across different parts of New York City. Our model (the GP-Mixed-Siamese-like-Double-Ridge model) makes good use of the pairwise comparison of locationbased house price information, daytime satellite image, street view and spatial location information as the inputs. Taking the per-capita income and the median household income in New York City as the ground truths, our model outperforms (R 2 = 0.72-0.86 for five-fold validation) other state-of-the-art income estimation models and achieves good performance in cross-district and cross-scale validation. We also find that models which partially share our model architecture, including the Spatial-Information-GP and the Mixed-Siamese-like model, perform well under certain spatial granularity and data availability. Since such models rely on less data input types and simpler architectures, they can be used to save resources on data collection and model training. Hence, using our model for fine-grained income estimation does not mean excluding these models that share similar architectures. Our fine-grained income estimation model can allow the per-capita and the household income data generated in fine-grained resolution to couple with other types of data, such as the air pollution or the epidemic data, of the same scale, to ensure that any location-specific socio-economic-related study and evidence-based decision-making at the fine-grained resolution can be conducted. Future research will focus on extending our model for fine-grained income estimation in developing metropolises, and for developing other socio-economic indicators.INDEX TERMS Daytime satellite image, developed metropolis, fine-grained resolution, GP-Mixed-Siamese-like-Double-Ridge model, house price, household income, per-capita income, Siamese-like Convolutional Neural Network, street view I. INTRODUCTIONMeasuring income 1 distribution at a high spatial resolution is critical but challenging, even for developed economies [1-3].1 According to the definition of American Community Survey, "Total income" refers to the sum of incomes reported separately for wage or salary income; net self-employment income; interest, dividends, or net rental or royalty income, or income from estates and trusts; Social Security or Railroad Retirement Income; Supplemental Security Income (SSI); Accurate income data are mainly obtained from field surveys, which can be highly capital intensive [2]. Over the past few decades, attempts have been made to overcome data scarcity and to estimate fine-grained income distribution across developing or non-urban areas [4][5][6][7]. Few studies have attempted to make good use of proxy data and deep learning public assistance or welfare payments; retireme...

show abstract

mentioning

confidence: 99%

Siamese-Like Convolutional Neural Network for Fine-Grained Income Estimation of Developed Economies

Bai

Lam

2020

IEEE Access

View full text Add to dashboard Cite

show abstract

“…Lately, Zhang and Cheng (2018) explored inferring demographics by leveraging a variety of spatial and temporal features extracted from the raw transaction records. Ding, Huang, Zhao and Fu (2019) developed a deep learning model to estimate socioeconomic status using temporal-sequential features and general statistical features generated from SC data. However, the success of these works heavily relied on elaborated feature engineering.…”

Section: Demographic Inference Using Geo-tagged Datamentioning

confidence: 99%

You are how you travel: A multi-task learning framework for Geodemographic inference using transit smart card data

Zhang

Aslam

Lai

et al. 2020

Computers, Environment and Urban Systems

View full text Add to dashboard Cite

“…Another important user-generated data type is mobile phone data. However, most of the existing studies only focus on group-level SES inference (at least until the acceptance of our work [25] in 2019). Soto et al explore how to use information derived from the aggregated use of cell phone records to identify the socioeconomic levels of a population [87].…”

Section: Ses Estimation Based On Cell Phone Datamentioning

confidence: 99%

“…And in the end, researchers utilize a commonly-used machine learning algorithm, the ridge regression model, to predict the logged income of Facebook users. tweets SES [73] tweets income [58] tweets income [93] tweets education, income [4] tweets occupation, income [40] tweets income [94] tweets education, income [95] tweets education, income [14] tweets family income [61] Facebook Likes income [13] mobile phone metadata personal income [87] mobile phone records SES [29] mobile phone call detail records income [12] mobile phone metadata income [90] mobile phone metadata income [8] cookie income, education level [68] retail transaction records income,education level [96] retail transaction records income, education level [25] smart card transportation records SES [74] WiFi log education, income…”

Section: Sea Inference Based On Social Media Datamentioning

confidence: 99%

“…We also collected the volunteers' checkin data on a famous online social network platform called QQ. Inspired by [25], we combine the most visited check-in location during the night and collected home location to calculate the latitude and longitude of a person's home. Among 32,443 volunteers, 4,509 of them reported at least one socioeconomic attribute and agreed to share their home location for research purposes.…”

Section: Ground Truth Datasetmentioning

confidence: 99%

See 1 more Smart Citation

User Attribute Inference via Mining User-Generated Data

Shichang¹

Self Cite

View full text Add to dashboard Cite

show abstract

Estimating Socioeconomic Status via Temporal-Spatial Mobility Analysis - A Case Study of Smart Card Data

Cited by 19 publications

References 33 publications

Siamese-Like Convolutional Neural Network for Fine-Grained Income Estimation of Developed Economies

Siamese-Like Convolutional Neural Network for Fine-Grained Income Estimation of Developed Economies

You are how you travel: A multi-task learning framework for Geodemographic inference using transit smart card data

User Attribute Inference via Mining User-Generated Data

Contact Info

Product

Resources

About