Abstract:Although many studies have investigated the non-linear relationship between the built environment and rail patronage, it remains unclear whether this influence is equally applicable to primary and secondary school students due to their physiological characteristics and cognitive limitations. This study applies the GBDT model to Wuhan student metro swipe data in order to investigate the relative importance and non-linear association of the built environment on the school-commuting metro ridership. The results s… Show more
“…Its goal is to simulate real values through the minimum loss function, iterate multiple times, and reduce the prediction error. Compared to ordinary regression models, GBDT has two advantages [18]: (1) it does not demand normality of data and can accommodate variables with missing values; (2) it solves the problem of multicollinearity within the data serves to avert the intercorrelation among the built environment indicators.…”
“…From the perspective of the urban built environment, with the massive generation of urban crowdsourcing data, research based on the built environment within the life circle has gradually become enriched. Its content has expanded from traditional planning data such as land use type [17,18] to the functional properties represented by points of interest (POI) [19,20], traffic network data [21,22], pedestrian vitality data based on location-based services (LBSs) [23,24], blue-green space areas [25], etc. With the increased popularity of various social media application, "checking in" at scenic spots and places of consumption has become a trend among young people, which has yielded a new type of open source data with a strong tendency toward consumption behavior, such as from Weibo, Xiaohongshu, Meituan, etc.…”
Section: Introductionmentioning
confidence: 99%
“…The rapid development of artificial intelligence (AI) has led to the application of neural networks (NNs) in the field of spatial information mining, which has led to a new method suitable for analyzing, mining, and extracting crowdsourcing data, i.e., machine learning (ML). Machine learning has matured in geoscience research, and the gradient boosting decision tree (GBDT) algorithm has shown excellent applicability for the information mining of urban crowdsourcing data and nonlinear relationships in the living environment [18,50]. On the one hand, its parameter variability can overcome the difficulty in fitting the hedonic model under multimodality [22,51].…”
Determining the optimal planning scale for urban life circles and analyzing the associated built environment factors are crucial for comprehending and regulating residential differentiation. This study aims to bridge the current research void concerning the nonlinear hierarchical relationships between the built environment and residential differentiation under the multiscale effect. Specifically, six indicators were derived from urban crowdsourcing data: diversity of built environment function (DBEF1), density of built environment function (DBEF2), blue–green environment (BGE), traffic accessibility (TA), population vitality (PV), and shopping vitality (SV). Then, a gradient boosting decision tree (GBDT) was applied to derive the analysis of these indicators. Finally, the interpretability of machine learning was leveraged to quantify the relative importance and nonlinear relationships between built environment indicators and housing prices. The results indicate a hierarchical structure and inflection point effect of the built environment on residential premiums. Notably, the impact trend of the built environment on housing prices within a 15 min life circle remains stable. The effect of crowd behavior, as depicted by PV and SV, on housing prices emerges as the most significant factor. Furthermore, this study also categorizes housing into common and high-end residences, thereby unveiling that distinct residential neighborhoods exhibit varying degrees of dependence on the built environment. The built environment exerts a scale effect on the formation of residential differentiation, with housing prices exhibiting increased sensitivity to the built environment at a smaller life circle scale. Conversely, the effect of the built environment on housing prices is amplified at a larger life circle scale. Under the dual influence of the scale and hierarchical effect, this framework can dynamically adapt to the uncertainty of changes in life circle planning policies and residential markets. This provides strong theoretical support for exploring the optimal life circle scale, alleviating residential differentiation, and promoting group fairness.
“…Its goal is to simulate real values through the minimum loss function, iterate multiple times, and reduce the prediction error. Compared to ordinary regression models, GBDT has two advantages [18]: (1) it does not demand normality of data and can accommodate variables with missing values; (2) it solves the problem of multicollinearity within the data serves to avert the intercorrelation among the built environment indicators.…”
“…From the perspective of the urban built environment, with the massive generation of urban crowdsourcing data, research based on the built environment within the life circle has gradually become enriched. Its content has expanded from traditional planning data such as land use type [17,18] to the functional properties represented by points of interest (POI) [19,20], traffic network data [21,22], pedestrian vitality data based on location-based services (LBSs) [23,24], blue-green space areas [25], etc. With the increased popularity of various social media application, "checking in" at scenic spots and places of consumption has become a trend among young people, which has yielded a new type of open source data with a strong tendency toward consumption behavior, such as from Weibo, Xiaohongshu, Meituan, etc.…”
Section: Introductionmentioning
confidence: 99%
“…The rapid development of artificial intelligence (AI) has led to the application of neural networks (NNs) in the field of spatial information mining, which has led to a new method suitable for analyzing, mining, and extracting crowdsourcing data, i.e., machine learning (ML). Machine learning has matured in geoscience research, and the gradient boosting decision tree (GBDT) algorithm has shown excellent applicability for the information mining of urban crowdsourcing data and nonlinear relationships in the living environment [18,50]. On the one hand, its parameter variability can overcome the difficulty in fitting the hedonic model under multimodality [22,51].…”
Determining the optimal planning scale for urban life circles and analyzing the associated built environment factors are crucial for comprehending and regulating residential differentiation. This study aims to bridge the current research void concerning the nonlinear hierarchical relationships between the built environment and residential differentiation under the multiscale effect. Specifically, six indicators were derived from urban crowdsourcing data: diversity of built environment function (DBEF1), density of built environment function (DBEF2), blue–green environment (BGE), traffic accessibility (TA), population vitality (PV), and shopping vitality (SV). Then, a gradient boosting decision tree (GBDT) was applied to derive the analysis of these indicators. Finally, the interpretability of machine learning was leveraged to quantify the relative importance and nonlinear relationships between built environment indicators and housing prices. The results indicate a hierarchical structure and inflection point effect of the built environment on residential premiums. Notably, the impact trend of the built environment on housing prices within a 15 min life circle remains stable. The effect of crowd behavior, as depicted by PV and SV, on housing prices emerges as the most significant factor. Furthermore, this study also categorizes housing into common and high-end residences, thereby unveiling that distinct residential neighborhoods exhibit varying degrees of dependence on the built environment. The built environment exerts a scale effect on the formation of residential differentiation, with housing prices exhibiting increased sensitivity to the built environment at a smaller life circle scale. Conversely, the effect of the built environment on housing prices is amplified at a larger life circle scale. Under the dual influence of the scale and hierarchical effect, this framework can dynamically adapt to the uncertainty of changes in life circle planning policies and residential markets. This provides strong theoretical support for exploring the optimal life circle scale, alleviating residential differentiation, and promoting group fairness.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.