A new research direction has emerged as the investigation of On-line Social Networks. Twitter is one of the most well-known social networks. Analysis of the Twitter is easier than other social networks because it provides the opportunity for collecting and downloading of a certain percentage of the messages without any restrictions. There are several researches on topics as detecting news and events, human behaviors, analyzing and mining of opinions. The on-line messages are available only through a continuous stream. To store the messages from the stream effectively and efficiently is a serious challenge against software system design and architecture. Every day about 10 GBs data are generated by this way and storing of this volume of data is not an easy task. In this paper we present a technique and architecture for collecting and storing the messages of the Twitter, and we present a prototype where data can be accessed for further analysis. Our system makes use specific techniques and methods of Oracle environment. Our software architecture approach is in contrast to previous solutions in which the systems use MSSQL or MySQL DBMS. We demonstrate that indexing and Job scheduler of the Oracle provide advantages to retrieve and handle large amounts of data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.