A Survey on Real-Time Big Data Analytics: Applications and Tools

Yadranjiaghdam, Babak; Pool, Nathan; Tabrizi, Nasseh

doi:10.1109/csci.2016.0083

Cited by 30 publications

(16 citation statements)

References 32 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Security is another challenge for stream data ingestion process which comes out from quick growth of the internet, web-based systems who are facing malicious and suspicions files threatening in their security, so the ingestion process should provide security, auditing, and provenance. The analytical value from the stream data depends on accuracy and completeness of data so achieving good and accurate stream data ingestion is complicated and challenging task that require good planning and expertise (Yadranjiaghdam,B.,Yasrobi,S.,& Tabrizi,N.,217) (Pal, G., Li, G., & Atkinson, K., 2018) (Gurcan, F., & Berigel, M., 2018) 3.4.1 Flume Apache It's a distributed reliable, available and efficient service for importing, collecting, aggregating and bringing in huge amount of data with its streaming feature and ingest it in a way that makes it easy for processing tool, hardly supports fault tolerance with accurate consistency ways, the data model used by flume is particularly used for online analytic application It has the most important role in data ingestion for real time data analytics, which is responsible for data refining and data visualization (Yadranjiaghdam, B., Pool, N., & Tabrizi, N., 2016 The data flow in flume same as pipeline that ingest data from the source to destination. Regarding to figure 5 below that discussed Flume architecture, data is transformed from source to destination based on flume agent which is JVM process that host the components during the data flow from the source to next end and it contains of channel, sink and the source.…”

Section: Stream Data Ingestionmentioning

confidence: 99%

Big Data Ingestion and Preparation Tools

Alwidian¹,

Rahman²,

Gnaim³

et al. 2020

MAS

View full text Add to dashboard Cite

Developing in Big Data applications become very important in the last few years, many organizations and industries are aware that data analysis is becoming an important factor to be more competitive and discover new trends and insights. Data ingestion and preparation step is the starting point for developing any Big Data project. This paper is a review for some of the most widely used Big Data ingestion and preparation tools, it discusses the main features, advantages and usage for each tool. The purpose of this paper is to help users to select the right ingestion and preparation tool according to their needs and applications’ requirements.

show abstract

Section: Stream Data Ingestionmentioning

confidence: 99%

Big Data Ingestion and Preparation Tools

Alwidian¹,

Rahman²,

Gnaim³

et al. 2020

MAS

View full text Add to dashboard Cite

show abstract

“…Most of the healthcare analytics solution mainly focused on Hadoop [20], it can process a large volume and diverse data sources in case of batch oriented computing. Hadoop would be limited for real-time computing, which Spark is faster than Hadoop and has a better performance especially in problems involving iterative machine learning [21].…”

Section: Related Workmentioning

confidence: 99%

A new Internet of Things architecture for real-time prediction of various diseases using machine learning on big data environment

Ed-daoudy

Maalmi

2019

J Big Data

View full text Add to dashboard Cite

show abstract

“…MapReduce has quickly become popular and wildly get adopted. There are various field of researches that use MapReduce to enhance their performance, for example survey research in health care [14,29], government [6], sentiment analysis [19], set operations [15], or real-time data analytic [37]. There are researches that focus on state-of-the-art of MapReduce and its applications [20,24].…”

Section: Apriori Algorithms: Background and Remarksmentioning

confidence: 99%

On using MapReduce to scale algorithms for Big Data analytics: a case study

2019

View full text Add to dashboard Cite

Scale adds cost. It also adds complexity and can make even the simplest computing infeasible. Many data analytics algorithms are originally designed for in-memory data. When facing with huge volume of data, these algorithms fail to scale due to limitation of processing capacity, storage capacity and operations on a single machine. Thus, to improve scalability and efficiency, parallel and distributed algorithms are developed to

show abstract

A Survey on Real-Time Big Data Analytics: Applications and Tools

Cited by 30 publications

References 32 publications

Big Data Ingestion and Preparation Tools

Big Data Ingestion and Preparation Tools

A new Internet of Things architecture for real-time prediction of various diseases using machine learning on big data environment

On using MapReduce to scale algorithms for Big Data analytics: a case study

Contact Info

Product

Resources

About