2011
DOI: 10.1007/978-3-642-23091-2_15
|View full text |Cite
|
Sign up to set email alerts
|

Towards an On-Line Analysis of Tweets Processing

Abstract: Abstract. Tweets exchanged over the Internet represent an important source of information, even if their characteristics make them difficult to analyze (a maximum of 140 characters, etc.). In this paper, we define a data warehouse model to analyze large volumes of tweets by proposing measures relevant in the context of knowledge discovery. The use of data warehouses as a tool for the storage and analysis of textual documents is not new but current measures are not well-suited to the specificities of the manipu… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
30
0
1

Year Published

2013
2013
2024
2024

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 30 publications
(31 citation statements)
references
References 10 publications
0
30
0
1
Order By: Relevance
“…Nevertheless, we identified few studies that have focused on the use of multidimensional tweets (Content and metadata). Among these works, the one of Bringay et al (2011) defined a multidimensional star model for analyzing a large number of tweets. However, the proposed model was dedicated to a particular trend.…”
Section: Related Workmentioning
confidence: 99%
“…Nevertheless, we identified few studies that have focused on the use of multidimensional tweets (Content and metadata). Among these works, the one of Bringay et al (2011) defined a multidimensional star model for analyzing a large number of tweets. However, the proposed model was dedicated to a particular trend.…”
Section: Related Workmentioning
confidence: 99%
“…Among these works, the one of [4] defined a multidimensional star model for analyzing a large number of tweets. However the proposed model was dedicated to a particular trend.…”
Section: Related Workmentioning
confidence: 99%
“…For instance, the characteristics of the words in tweets are not necessarily the same in a State and in a City. In [7], in a very different context, we proposed a new measure called T F -IDF adaptative . This measure has been defined in order not to focus on the number of documents but rather to the number of documents for a specific class and take into account the level in the hierarchy.…”
Section: Some Proposed Measuresmentioning
confidence: 99%
“…Since its introduction in 2006, the Twitter website 6 has become so popular that it is currently ranked as the 10 th most visited site over the world 7 . In January 2012, Twitter has been visited 2.5 billion times and in October 2011, more than 250 million tweets are posted every day with a user base of about 300 million people.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation