The wide adoption of location-enabled devices, together with the acceptance of services that leverage (personal) data as payment, allows scientists to push through some of the previous barriers imposed by data insufficiency and privacy skepticism. The research problems whose study require hard-to-obtain data (e.g., transportation mode detection, service contextualization, etc.) have now become more accessible to scientists because of the availability of data collecting outlets. One such problem is the detection of a user's transportation mode. Different fields have approached the problem of transportation mode detection with different aims: Location Based Services is a field that focuses on understanding the transportation mode in real-time, Transportation Science is a field that focuses on measuring the daily travel patterns of individuals or groups of individuals, and Human Geography is a field that focuses on enriching a trajectory by adding domain-specific semantics. While different fields providing solutions to the same problem could be viewed as a positive outcome, it is difficult to compare these solutions because the reported performance indicators depend on the type of approach and its aim (e.g., the real-time availability of Location Based Services requires the performance to be computed on each classified location). The contributions of this paper are three fold. First, the paper reviews the critical aspects that are desired by each field of research when providing solutions to the transportation mode detection problem. Second, it proposes three dimensions that separate three branches of science based on their main interest. Finally, it identifies important gaps in research and future directions,i.e., proposing: widely accepted error measures meaningful for all disciplines, methods robust to new datasets and a benchmark dataset for performance validation.
The efficient analysis of spatio-temporal data, generated by moving objects, is an essential requirement for intelligent location-based services. Spatiotemporal rules can be found by constructing spatio-temporal baskets, from which traditional association rule mining methods can discover spatio-temporal rules. When the items in the baskets are spatio-temporal identifiers and are derived from trajectories of moving objects, the discovered rules represent frequently travelled routes. For some applications, e.g., an intelligent ridesharing application, these frequent routes are only interesting if they are long and sharable, i.e., can potentially be shared by several users. This paper presents a database projection based method for efficiently extracting such long, sharable frequent routes. The method prunes the search space by making use of the minimum length and sharable requirements and avoids the generation of the exponential number of sub-routes of long routes. Considering alternative modelling options for trajectories, leads to the development of two effective variants of the method. SQL-based implementations are described, and extensive experiments on both real life-and large-scale synthetic data show the effectiveness of the method and its variants.
Rooted in the phylosophy of point-and segment-based approaches for transportation mode segmentation of trajectories, the measures that researchers have adoped to evaluate the quality of the results (1) are incomparable across approaches, hence slowing the progress in the field, and (2) do not provide insight about the quality of the continuous transportation mode segmentation. To address these problems, this paper proposes new error measures that can be applied to measure how well a continuous transportation mode segmentation model performs. The error measures introduced are based on aligning multiple inferred continuous intervals to ground truth intervals, and measure the cardinality of the alignment and the spatial and temporal discrepancy between the corresponding aligned segments. The utility of this new way of computing errors is shown by evaluating the segmentation of three generic transportation mode segmentation approaches (implicit, explicit-holistic and explicit-consensus-based transport mode segmentation), which can be implemented in a thick client architecture. Empirical evaluations on a large real-word dataset reveal the superiority of explicit-consensus-based transport mode segmentation, which can be attributed to the explicit modeling of segments and transitions that allows for a meaningful decomposition of the complex learning task.
Despite the availability of mobile positioning technologies and scientists' interests in tracking, modelling and predicting the movements of individuals and populations, these technologies are seldom efficiently used. The continuous changes in mobile positioning and other sensor technologies overburden scientists who are interested in data collection with the task of developing, implementing and testing tracking algorithms and their efficiency in terms of battery consumption. To this extent, this article proposes an adaptive, battery conscious tracking algorithm that collects trajectory data fused with accelerometer data and presents Mobility Collector, which is a prototype platform that, using the tracking algorithm, can produce highly configurable, off-the-shelf, multi-user tracking systems suitable for research purposes. The applicability of the tracking system is tested within the transport science domain by collecting labelled movement traces and related motion data, i.e. accelerometer data and derived information (number of steps and other useful movement features based on temporal aggregates of the raw readings) to develop and evaluate a method that automatically classifies the transportation mode of users with a 90.8% prediction accuracy. QC 20150312
For additional information, see the DB TECH REPORTS homepage: www.cs.aau.dk/DBTR . Any software made available via DB TECH REPORTS is provided "as is" and without any express or implied warranties, including, without limitation, the implied warranty of merchantability and fitness for a particular purpose.The DB TECH REPORTS icon is made from two letters in an early version of the Rune alphabet, which was used by the Vikings, among others. Runes have angular shapes and lack horizontal lines because the primary storage medium was wood, although they may also be found on jewelry, tools, and weapons. Runes were perceived as having magic, hidden powers. The first letter in the logo is "Dagaz," the rune for day or daylight and the phonetic equivalent of "d." Its meanings include happiness, activity, and satisfaction. The second letter is "Berkano," which is associated with the birch tree. Its divinatory meanings include health, new beginnings, growth, plenty, and clearance. It is associated with Idun, goddess of Spring, and with fertility. It is the phonetic equivalent of "b." AbstractThe popularity of embedded positioning technologies in mobile devices and the development of mobile communication technology have paved the way for powerful location-based services (LBSs). To make LBSs useful and user-friendly, heavy use is made of context information, including patterns in user location data which are extracted by data mining methods. However, there is a potential conflict of interest: the data mining methods want as precise data as possible, while the users want to protect their privacy by not disclosing their exact movements. This paper aims to resolve this conflict by proposing a general framework that allows user location data to be anonymized, thus preserving privacy, while still allowing interesting patterns to be discovered. The framework allows users to specify individual desired levels of privacy that the data collection and mining system will then meet. Privacy-preserving methods are proposed for two core data mining tasks, namely finding dense spatiotemporal regions and finding frequent routes. An extensive set of experiments evaluate the methods, comparing them to their non-privacy-preserving equivalents. The experiments show that the framework still allows most patterns to be found, even when privacy is preserved.
Natural hazards result in devastating losses in human life, environmental assets and personal, and regional and national economies. The availability of different big data such as satellite imageries, Global Positioning System (GPS) traces, mobile Call Detail Records (CDRs), social media posts, etc., in conjunction with advances in data analytic techniques (e.g., data mining and big data processing, machine learning and deep learning) can facilitate the extraction of geospatial information that is critical for rapid and effective disaster response. However, disaster response systems development usually requires the integration of data from different sources (streaming data sources and data sources at rest) with different characteristics and types, which consequently have different processing needs. Deciding which processing framework to use for a specific big data to perform a given task is usually a challenge for researchers from the disaster management field. Therefore, this paper contributes in four aspects. Firstly, potential big data sources are described and characterized. Secondly, the big data processing frameworks are characterized and grouped based on the sources of data they handle. Then, a short description of each big data processing framework is provided and a comparison of processing frameworks in each group is carried out considering the main aspects such as computing cluster architecture, data flow, data processing model, fault-tolerance, scalability, latency, back-pressure mechanism, programming languages, and support for machine learning libraries, which are related to specific processing needs. Finally, a link between big data and processing frameworks is established, based on the processing provisioning for essential tasks in the response phase of disaster management.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.