The collection of moving object data is becoming more and more common, and therefore there is an increasing need for the efficient analysis and knowledge extraction of these data in different application domains. Trajectory data are normally available as sample points, and do not carry semantic information, which is of fundamental importance for the comprehension of these data. Therefore, the analysis of trajectory data becomes expensive from a computational point of view and complex from a user's perspective. Enriching trajectories with semantic geographical information may simplify queries, analysis, and mining of moving object data. In this paper we propose a data preprocessing model to add semantic information to trajectories in order to facilitate trajectory data analysis in different application domains. The model is generic enough to represent the important parts of trajectories that are relevant to the application, not being restricted to one specific application. We present an algorithm to compute the important parts and show that the query complexity for the semantic analysis of trajectories will be significantly reduced with the proposed model.
Several works have been proposed in the last few years for raw trajectory data analysis, and some attempts have been made to define trajectories from a more semantic point of view. Semantic trajectory data analysis has received significant attention recently, but the formal definition of semantic trajectory, the set of aspects that should be considered to semantically enrich trajectories and a conceptual data model integrating these aspects from a broad sense is still missing. This article presents a semantic trajectory conceptual data model named CONSTAnT, which defines the most important aspects of semantic trajectories. We believe that this model will be the foundation for the design of semantic trajectory databases, where several aspects that make a trajectory "semantic" are taken into account. The proposed model includes the concepts of semantic subtrajectory, semantic points, geographical places, events, goals, environment and behavior, to create a general concept of semantic trajectory. The proposed model is the result of several years of work by the authors in an effort to add more semantics to raw trajectory data for real applications. Two application examples and different queries show the flexibility of the model for different domains.In order to clarify the proposed conceptual model for semantic trajectories, this section defines some basic concepts and presents the closest related works.
CONSTAnT -A Conceptual Data Model for Semantic Trajectories 67
Abstract-Existing works for semantic trajectory data analysis have focused on the intersection of trajectories with application important geographic information and the use of the speed to find interesting places. In this paper we present a novel approach to find interesting places in trajectories, considering the variation of the direction as the main aspect. The proposed approach has been validated with real trajectory data associated to oceanic fishing vessels, with the objective to automatically find the real places where vessels develop fishing activities. Results have demonstrated that the method is very appropriate for applications in which the direction variation plays the essential role.
Most existing approaches aiming at measuring trajectory similarity are focused on two‐dimensional sequences of points, called raw trajectories. However, recent proposals have used background geographic information and social media data to enrich these trajectories with a semantic dimension, giving rise to the concept of semantic trajectories. Only a few works have proposed similarity measures for semantic trajectories or multidimensional sequences, having limitations such as predefined weight of the dimensions, sensitivity to noise, tolerance for gaps with different sizes, and the prevalence of the worst dimension similarity. In this article we propose MSM, a novel similarity measure for multidimensional sequences that overcomes the aforementioned limitations by considering and weighting the similarity in all dimensions. MSM is evaluated through an extensive experimental study that, based on a seed trajectory, creates sets of semantic trajectories with controlled transformations to introduce different kinds and levels of dissimilarity. For each set, we compute the similarity between the seed and the transformed trajectories, using different measures. The results showed that MSM was more robust and efficient than related approaches in the domain of semantic trajectories.
For many years trajectory data have been treated as sequences of space-time points or stops and moves. However, with the explosion of the Internet of Things and the flood of big data generated on the Internet, such as weather chan-
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.