Spatio-temporal c1ustering is a process of grouping objects based on their spatial and temporal similarity. It is relatively new subfield of data mining which gained high popularity especially in geographie information scienees due to the pervasiveness of all kinds of location-based or environmental devices that record position, time or/and environmental properties of an object or set of objects in real-time. As a consequence, different types and large amounts of spatio-temporal data became available that introduce new challenges to data analysis and require novel approaches to knowledge discovery. In this chapter we concentrate on the spatio-temporal c1ustering in geographic space. First, we provide a c1assification of different types of spatio-temporal data. Then, we focus on one type of spatio-temporal c1ustering -trajectory c1ustering, provide an overview of the state-of-the-art approach es and methods of spatio-temporal c1ustering and finally present several scenarios in different application domains such as movement, ceIIular networks and environmental studies.
This is the unspecified version of the paper.This version of the publication may differ from the final published version. Permanent repository link AbstractMovement data link together space, time, and objects positioned in space and time. They hold valuable and multifaceted information about moving objects, properties of space and time as well as events and processes occurring in space and time. We present a conceptual framework that describes in a systematic and comprehensive way the possible types of information that can be extracted from movement data and on this basis defines the respective types of analytical tasks. Tasks are distinguished according to the type of information they target and according to the level of analysis, which may be elementary (i.e. addressing specific elements of a set) or synoptic (i.e. addressing a set or subsets). We also present a taxonomy of generic analytic techniques, in which the types of tasks are linked to the corresponding classes of techniques that can support fulfilling them. We include techniques from several research fields: visualization and visual analytics, geographic information science, database technology, and data mining.We expect the taxonomy to be valuable for analysts and researchers. Analysts will receive guidance in choosing suitable analytic techniques for their data and tasks.Researchers will learn what approaches exist in different fields and compare or relate them to the approaches they are going to undertake.3
Photo-sharing websites such as Flickr and Panoramio contain millions of geotagged images contributed by people from all over the world. Characteristics of these data pose new challenges in the domain of spatio-temporal analysis. In this paper, we define several different tasks related to analysis of attractive places, points of interest and comparison of behavioral patterns of different user communities on geotagged photo data. We perform analysis and comparison of temporal events, rankings of sightseeing places in a city, and study mobility of people using geotagged photos. We take a systematic approach to accomplish these tasks by applying scalable computational techniques, using statistical and data mining algorithms, combined with interactive geo-visualization. We provide exploratory visual analysis environment, which allows the analyst to detect spatial and temporal patterns and extract additional knowledge from large geotagged photo collections. We demonstrate our approach by applying the methods to several regions in the world.
This article presents a geovisual analytics approach to discovering people's preferences for landmarks and movement patterns from photos posted on the Flickr website. The approach combines an exploratory spatio-temporal analysis of geographic coordinates and dates representing locations and time of taking photos with basic thematic information available through the Google Maps Web mapping service, and interpretation of the analyzed area. The article describes data aggregation and filtering techniques to reduce the size of the dataset and focuses on information addressing research questions. The results of analysis for the Seattle metropolitan area help to distinguish between sites that are occasionally popular among the photographers and can be considered as potential attractions from sites that are regularly visited and already known as city landmarks. The analysis of photographers' movements across the metropolitan area shows that most photographers' itineraries are short and highly localized.
Abstract-Many applications that employ data mining techniques involve mining data that include private and sensitive information about the subjects. One way to enable effective data mining while preserving privacy is to anonymize the dataset that include private information about subjects before being released for data mining. One way to anonymize dataet is to manipulate its content so that the records adhere to k-anonymity. Two common manipulation techniques used to achieve kanonymity of a dataset are generalization and suppression. Generalization refers to replacing a value with a less specific but semantically consistent value, while suppression refers to not releasing a value at all. Generalization is more commonly applied in this domain since suppression may dramatically reduce the quality of the data mining results if not properly used. However, generalization presents a major drawback as it requires a manually generated domain hierarchy taxonomy for every quasiidentifier in the dataset on which k-anonymity has to be performed. In this paper we propose a new method for achieving kanonymity named K-anonymity of Classification Trees Using Suppression (kACTUS). In kACTUS efficient multi-dimensional suppression is performed, i.e., values are suppressed only on certain records depending on other attribute values, without the need for manually-produced domain hierarchy trees. Thus, in kACTUS we identify attributes that have less influence on the classification of the data records and we suppress them if needed in order to comly with k-anonymity. The kACTUS method was evaluated on ten separate datasets to evaluate its accuracy as compared to other k-anonymity generalization and suppressionbased methods. Encouraging results suggest that kACTUS' predictive performance is better than that of existing k-anonymity algorithms. Specifically, on average the accuracies of TDS, TDR and kADET are lower than kACTUS in 3.5%, 3.3% and 1.9% respectively despite their usage of manually defined domain trees. The accuracy gap is increased to 5.3%, 4.3% and 3.1% respectively when no domain trees are used.
The vastly increasing number of online hotel room bookings are not only intensifying the competition in the travel industry as a whole, but also prompt travel intermediates (i.e. e-companies that aggregate information about different travel products from different travel suppliers) into a fierce competition for the best prices of travel products, i.e. hotel rooms. An important factor that affects revenues is the ability to conclude profitable deals with different travel suppliers. However, the profitability of a contract not only depends on the communication skills of a contract manager. It significantly depends on the objective information obtained about a specific travel supplier and his/her products. While the contract manager usually has a broad knowledge of the travel business in general, collecting and processing specific information about travel suppliers is usually a time and cost expensive task. Our goal is to develop a tool that assists the travel intermediate to acquire the missing strategic information about individual hotels in order to leverage profitable deals. We present a GIS-based decision-support system that can both, estimate objective hotel room rates using essential hotel and locational characteristics and predict temporal room rate prices. Information about objective hotel room rates allow for an objective comparison and provide the basis for a realistic computation of the contract's profitability. The temporal prediction of room rates can be used for monitoring past hotel room rates and for adjusting the price of the future contract. This paper makes three major contributions. First, we present a GIS-based decision support system, the first of its kinds, for hotel brokers. Second, the DSS can be applied to virtually any part of the world, which makes it a very attractive business tool in real-life situations. Third, it integrates a widely used data mining framework that provides access to dozens of ready to run algorithms to be used by a domain expert and it offers the possibility of adding new algorithms once they are developed. The system has been designed and evaluated in close cooperation with a company that develops travel technology solutions, in particular inventory management and pricing solutions for many well-known websites and travel agencies around the world. This company has also provided us with real, large datasets to evaluate the system. We demonstrate the functionality of the DSS using the hotel data in the area of Barcelona, Spain. The results indicate the potential usefulness of the proposed system.
In this paper we present a novel approach for analyzing the trajectories of moving objects and of people in particular. The minded data from these sequences can provide valuable information for understanding the surrounding locations, discovering attractive place or mining frequent sequences of visited places. Based on geotagged photos, our framework mines semantically annotated sequences. Our framework is capable of mining semantically annotated sequences of any length to discover patterns that are not necessarily immediate antecedents. The approach consists of four main steps. In the first step, every photo location is semantically annotated by assigning it to a known nearby point of interest. In the second step, a density-based clustering algorithm is applied to all unassigned photos, creating regions of unknown points of interest. In the third step, a travel sequence of every individual is built. In the final step, travel sequence patterns are mined using the semantics that were obtained from the first two steps. Case studies of Guimarães, Portugal (where the conference takes place) and Berlin, Germany demonstrate the capabilities of the proposed framework.
Space- and time-referenced data published on the Web by general people can be viewed in a dual way: as independent spatiotemporal events and as trajectories of people in the geographical space. These two views suppose different approaches to the analysis, which can yield different kinds of valuable knowledge about places and about people. We define possible types of analysis tasks related to the two views of the data and present several analysis methods appropriate for these tasks. The methods are suited to large amounts of the data
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.