Population density and distribution of services represents the growth and demographic shift of the cities. For urban planners, population density and check-in behavior in space and time are vital factors for planning and development of sustainable cities. Location-based social network (LBSN) data seems to be a complement to many traditional methods (i.e., survey, census) and is used to study check-in behavior, human mobility, activity analysis, and social issues within a city. This check-in phenomenon of sharing location, activities, and time by users has encouraged this research on gender difference and frequency of using LBSN. Therefore, in this study, we investigate the check-in behavior of Chinese microblog Sina Weibo (referred as "Weibo") in 10 districts of Shanghai, China, for which we observe the gender difference and their frequency of use over a period. The mentioned districts were spatially analyzed for check-in spots by kernel density estimation (KDE) using ArcGIS. Furthermore, our results reveal that female users have a high rate of social media use, and significant difference is observed in check-in behavior during weekdays and weekends in the studied districts of Shanghai. Increase in check-ins is observed during the night as compared to the morning. From the results, it can be assumed that LBSN data can be helpful to observe gender difference.
Social media data analytics is the art of extracting valuable hidden insights from vast amounts of semi-structured and unstructured social media data to enable informed and insightful decision-making. Analysis of social media data has been applied for discovering patterns that may support urban planning decisions in smart cities. In this paper, Weibo social media data are used to analyze social-geographic human mobility in the CBD area of Shanghai to track citizen’s behavior. Our main motivation is to test the validity of geo-located Weibo data as a source for discovering human mobility and activity patterns. In addition, our goal is to identify important locations in people’s lives with the support of location-based services. The algorithms used are described and the results produced are presented using adequate visualization techniques to illustrate the detected human mobility patterns obtained by the large-scale social media data in order to support smart city planning decisions. The outcome of this research is helpful not only for city planners, but also for business developers who hope to extend their services to citizens.
Extracting features from crowd flow analysis has become an important research challenge due to its social cost and the impact of inadequate planning of high-quality services and security monitoring on the lives of citizens. This paper descriptively reviews and compares existing crowd analysis approaches based on different data sources. This survey provides the fundamentals of crowd analysis and considers three main approaches: crowd video analysis, crowd spatio-temporal analysis, and crowd social media analysis. The key research contributions in each approach are presented, and the most significant techniques and algorithms used to improve the precision of results that could be integrated into solutions to enhance the quality of services in a smart city are analyzed.
Most commercial data mining products provide a large number of models and tools for performing various data mining tasks, but few provide intelligent assistance for addressing many important decisions that must be considered during the mining process. In this paper, we propose the realization of a hybrid data mining assistant, based on the CBR paradigm and the use of an ontology, in order to empower the user during the various phases of the data mining process.
We present an approach for tackling the Sentiment Analysis problem in SemEval 2015. The approach is based on the use of a cooccurrence graph to represent existing relationships among terms in a document with the aim of using centrality measures to extract the most representative words that express the sentiment. These words are then used in a supervised learning algorithm as features to obtain the polarity of unknown documents. The best results obtained for the different datasets are: 77.76% for positive, 100% for negative and 68.04% for neutral, showing that the proposed graph-based representation could be a way of extracting terms that are relevant to detect a sentiment.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.