Abstract:Abstract:Missing data reconstruction is a critical step in the analysis and mining of spatio-temporal data; however, few studies comprehensively consider missing data patterns, sample selection and spatio-temporal relationships. As a result, traditional methods often fail to obtain satisfactory accuracy or address high levels of complexity. To combat these problems, this study developed an effective two-step method for spatio-temporal missing data reconstruction (ST-2SMR). This approach includes a coarse-grain… Show more
“…The Pearson correlation coefficient [30] was calculated to quantitatively measure the spatiotemporal correlation between healthcare worker infections and patient infections, i.e., the difference in space and time between healthcare worker infections and patient infections. As shown in Figure 5 , the spatial correlation was obtained by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$t_{3}^{h}$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$t_{3}^{p}$ \end{document} , and the temporal correlation was obtained by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$s_{3}^{h}$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$s_{3}^{p}$ \end{document} [31] . The temporal and spatial correlation coefficients of the two series were calculated using Equations (1) and (2) , respectively: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}\begin{align*} r_{temporal} = \frac {{{\mathrm{Cov(}}s_{i}^{p} {\mathrm{,}}s_{i}^{h} {\mathrm{)}}}}{{\sqrt {\rm {D}(s_{i}^{p}) } \sqrt {\rm {D}(s_{i}^{h})} }} \tag{1}\\ r_{spatial} = \frac {{{\mathrm{Cov(}}t_{i}^{p} {\mathrm{,}}t_{i}^{h} {\mathrm{)}}}}{{\sqrt {\rm {D}(t_{i}^{p})} \sqrt {\rm {D}(t_{i}^{h})} }}\tag{2}\end{align*} \end{document} where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$\rm {Cov}(X,Y)$ \end{document} is the covariance of random variables \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$X$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$Y$ \end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgre...…”
Section: Methodsmentioning
confidence: 99%
“…The Pearson correlation coefficient [30] was calculated to quantitatively measure the spatiotemporal correlation between healthcare worker infections and patient infections, i.e., the difference in space and time between healthcare worker infections and patient infections. As shown in Figure 5, the spatial correlation was obtained by t h 3 and t p 3 , and the temporal correlation was obtained by s h 3 and s p 3 [31]. The temporal and spatial correlation coefficients of the two series were calculated using Equations (1) and (2), respectively:…”
Studying the spatiotemporal differences in coronavirus disease (COVID-19) between social groups such as healthcare workers (HCWs) and patients can aid in formulating epidemic containment policies. Most previous studies of the spatiotemporal characteristics of COVID-19 were conducted in a single group and did not explore the differences between groups. To fill this research gap, this study assessed the spatiotemporal characteristics and differences among patients and HCWs infection in Wuhan, Hubei (excluding Wuhan), and China (excluding Hubei). The temporal difference was greater in Wuhan than in the rest of Hubei, and was greater in Hubei (excluding Wuhan) than in the rest of China. The incidence was high in healthcare workers in the early stages of the epidemic. Therefore, it is important to strengthen the protective measures for healthcare workers in the early stage of the epidemic. The spatial difference was less in Wuhan than in the rest of Hubei, and less in Hubei (excluding Wuhan) than in the rest of China. The spatial distribution of healthcare worker infections can be used to infer the spatial distribution of the epidemic in the early stage and to formulate control measures accordingly.
“…The Pearson correlation coefficient [30] was calculated to quantitatively measure the spatiotemporal correlation between healthcare worker infections and patient infections, i.e., the difference in space and time between healthcare worker infections and patient infections. As shown in Figure 5 , the spatial correlation was obtained by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$t_{3}^{h}$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$t_{3}^{p}$ \end{document} , and the temporal correlation was obtained by \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$s_{3}^{h}$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$s_{3}^{p}$ \end{document} [31] . The temporal and spatial correlation coefficients of the two series were calculated using Equations (1) and (2) , respectively: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}\begin{align*} r_{temporal} = \frac {{{\mathrm{Cov(}}s_{i}^{p} {\mathrm{,}}s_{i}^{h} {\mathrm{)}}}}{{\sqrt {\rm {D}(s_{i}^{p}) } \sqrt {\rm {D}(s_{i}^{h})} }} \tag{1}\\ r_{spatial} = \frac {{{\mathrm{Cov(}}t_{i}^{p} {\mathrm{,}}t_{i}^{h} {\mathrm{)}}}}{{\sqrt {\rm {D}(t_{i}^{p})} \sqrt {\rm {D}(t_{i}^{h})} }}\tag{2}\end{align*} \end{document} where \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$\rm {Cov}(X,Y)$ \end{document} is the covariance of random variables \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$X$ \end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgreek} \usepackage{mathrsfs} \setlength{\oddsidemargin}{-69pt} \begin{document} }{}$Y$ \end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{upgre...…”
Section: Methodsmentioning
confidence: 99%
“…The Pearson correlation coefficient [30] was calculated to quantitatively measure the spatiotemporal correlation between healthcare worker infections and patient infections, i.e., the difference in space and time between healthcare worker infections and patient infections. As shown in Figure 5, the spatial correlation was obtained by t h 3 and t p 3 , and the temporal correlation was obtained by s h 3 and s p 3 [31]. The temporal and spatial correlation coefficients of the two series were calculated using Equations (1) and (2), respectively:…”
Studying the spatiotemporal differences in coronavirus disease (COVID-19) between social groups such as healthcare workers (HCWs) and patients can aid in formulating epidemic containment policies. Most previous studies of the spatiotemporal characteristics of COVID-19 were conducted in a single group and did not explore the differences between groups. To fill this research gap, this study assessed the spatiotemporal characteristics and differences among patients and HCWs infection in Wuhan, Hubei (excluding Wuhan), and China (excluding Hubei). The temporal difference was greater in Wuhan than in the rest of Hubei, and was greater in Hubei (excluding Wuhan) than in the rest of China. The incidence was high in healthcare workers in the early stages of the epidemic. Therefore, it is important to strengthen the protective measures for healthcare workers in the early stage of the epidemic. The spatial difference was less in Wuhan than in the rest of Hubei, and less in Hubei (excluding Wuhan) than in the rest of China. The spatial distribution of healthcare worker infections can be used to infer the spatial distribution of the epidemic in the early stage and to formulate control measures accordingly.
“…Due to equipment failure and other factors, some data were missing from the original traffic-speed data sets. Considering the spatiotemporal characteristics of traffic data, we used the existing spatiotemporal interpolation algorithm to fill in the missing data in order to better reconstruct the original traffic conditions [42]. The basic principle is to consider the missing data pattern in the interpolation process and use coarse-grained interpolation to obtain partial reconstruction results in order to eliminate the effect of missing continuous block data on the subsequent interpolation process.…”
Short-term traffic forecasting plays an important part in intelligent transportation systems. Spatiotemporal k-nearest neighbor models (ST-KNNs) have been widely adopted for short-term traffic forecasting in which spatiotemporal matrices are constructed to describe traffic conditions. The performance of the models is closely related to the spatial dependencies, the temporal dependencies, and the interaction of spatiotemporal dependencies. However, these models use distance functions and correlation coefficients to identify spatial neighbors and measure the temporal interaction by only considering the temporal closeness of traffic, which result in existing ST-KNNs that cannot fully reflect the essential features of road traffic. This study proposes an improved spatiotemporal k-nearest neighbor model for short-term traffic forecasting by utilizing a multi-view learning algorithm named MVL-STKNN that fully considers the spatiotemporal dependencies of traffic data. First, the spatial neighbors for each road segment are automatically determined using cross-correlation under different temporal dependencies. Three spatiotemporal views are built on the constructed spatiotemporal closeness, periodic, and trend matrices to represent spatially heterogeneous traffic states. Second, a spatiotemporal weighting matrix is introduced into the ST-KNN model to recognize similar traffic patterns in the three spatiotemporal views. Finally, the results of traffic pattern recognition under these three spatiotemporal views are aggregated by using a neural network algorithm to describe the interaction of spatiotemporal dependencies. Extensive experiments were conducted using real vehicular-speed datasets collected on city roads and expressways. In comparison with baseline methods, the results show that the MVL-STKNN model greatly improves short-term traffic forecasting by lowering the mean absolute percentage error between 28.24% and 46.86% for the city road dataset and, between 53.80% and 90.29%, for the expressway dataset. The results suggest that multi-view learning merits further attention for traffic-related data mining under such a dynamic and data-intensive environment, which owes to its comprehensive consideration of spatial correlation and heterogeneity as well as temporal fluctuation and regularity in road traffic.
“…e spatial resolution of cell-id trajectory data depends on the service radius of each BTS, which varies in different areas, e.g., of hundred meters in metropolitan cities, and several kilometers in rural areas [5]. Meanwhile, the records of a cell-id trajectory are collected in a relatively long-time interval, which inevitably results in the problem of trajectory discontinuity or sparsity [6][7][8][9][10]. erefore, due to the inconformity between the spatiotemporal sparsity of the cell-id trajectory data and the requirement on fine-scale footprints, the refinement of cell-id trajectories becomes a research hotspot due to its extensive application prospects.…”
Mobile phone data have become a critical data source for transportation research. While a cell-id trajectory was routinely reorganized by International Mobile Subscriber Identity (IMSI), it potentially allows to analyze transportation behaviors and social interaction of total population, with a full temporal coverage at low cost. However, cell-id trajectory is often sparse due to low reporting frequency and uncertainness of mobile holders’ position. So, the cell-id trajectory refinement has been recognized as challenging work to further facilitate trajectory data mining. This paper presents a comprehensive approach to identify cell-id trajectories of public service vehicles (PSVs) from large volume of trajectories and further refines these cell-id trajectories by a heuristic global optimization approach. The modified longest common subsequence (LCSS) method is used to match a cell-id trajectory and a public transportation route (PTR) and correspondingly calculates their similarities for determining whether the trajectory is PSV mode or not. Taking full advantages of the nature of a PSV tends to move on the PTR in uniform motion to meet a prescript visit to stops, a heuristic global optimization approach is deployed to build a spatiotemporal model of a PSV motion, which estimates new locations of cell-id trajectories on the PTR. The approach was finally tested using Beijing cellular network signaling datasets. The precision of PSV trajectory detection is 90%, and the recall is 88%. Evaluated by our GNSS-logged trajectories, the mean absolute error (MAE) of refined PSV trajectories is 144.5 m and the standard deviation (St. Dev) is 81.8 m. It shows a significant improvement in comparison of traditional interpolation methods.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.