2013 IEEE 4th International Conference on Cognitive Infocommunications (CogInfoCom) 2013
DOI: 10.1109/coginfocom.2013.6719259
|View full text |Cite
|
Sign up to set email alerts
|

A multi-terabyte relational database for geo-tagged social network data

Abstract: Despite their relatively low sampling factor, the freely available, randomly sampled status streams of Twitter are very useful sources of geographically embedded social network data. To statistically analyze the information Twitter provides via these streams, we have collected a year's worth of data and built a multi-terabyte relational database from it. The database is designed for fast data loading and to support a wide range of studies focusing on the statistics and geographic features of social networks, a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
39
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(39 citation statements)
references
References 16 publications
0
39
0
Order By: Relevance
“…Tweets were marked as coming from the USA if their longitude fell between -130 and -70 degrees and their latitude between 24 and 52 degrees. The messages and their metadata were organised into a large relational database that enabled fast and efficient querying (Dobos, 2013) at the Department of Physics of Complex Systems of Eotvos Lorand University, Budapest.…”
Section: Methodsmentioning
confidence: 99%
“…Tweets were marked as coming from the USA if their longitude fell between -130 and -70 degrees and their latitude between 24 and 52 degrees. The messages and their metadata were organised into a large relational database that enabled fast and efficient querying (Dobos, 2013) at the Department of Physics of Complex Systems of Eotvos Lorand University, Budapest.…”
Section: Methodsmentioning
confidence: 99%
“…Other results propose Optics [46] and grid-based [45] clustering to compile a structure of locations for GPS trajectory mining. Similar to our result, in [15], GADM is used over the same Twitter data set, but only for visualization purposes.…”
Section: Related Workmentioning
confidence: 65%
“…We use a four-month collection of 400 million geo-tagged Twitter messages detailed in [15]. We mention that the metadata of tweets may contain not only GPS coordinates but also a place attribute that can contain the name and type of the place.…”
Section: Data Setmentioning
confidence: 99%
“…The other system [5] uses MSSQL for data storage. They use the advantages of MSQL to build indices for localization.…”
Section: Related Workmentioning
confidence: 99%