2015
DOI: 10.1007/s00778-015-0393-2
|View full text |Cite
|
Sign up to set email alerts
|

epiC: an extensible and scalable system for processing Big Data

Abstract: The Big Data problem is characterized by the so called 3V features: Volume -a huge amount of data, Velocity -a high data ingestion rate, and Variety -a mix of structured data, semi-structured data, and unstructured data. The state-of-the-art solutions to the Big Data problem are largely based on the MapReduce framework (aka its open source implementation Hadoop). Although Hadoop handles the data volume challenge successfully, it does not deal with the data variety well since the programming interfaces and its … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 25 publications
(21 citation statements)
references
References 27 publications
0
21
0
Order By: Relevance
“…Our proposed recovery scheme is an important component of our epiCG project: a scalable graph engine on top of epiC [12]. In this paper, we use Giraph [1] as the underlying graph engine for the experiments as epiCG was under development when our recovery method was being designed.…”
Section: Introductionmentioning
confidence: 99%
“…Our proposed recovery scheme is an important component of our epiCG project: a scalable graph engine on top of epiC [12]. In this paper, we use Giraph [1] as the underlying graph engine for the experiments as epiCG was under development when our recovery method was being designed.…”
Section: Introductionmentioning
confidence: 99%
“…After gathering the newest user features and system status, the recommendation module implements a series of machine learning methods to recommend a list of dating partners to each user. In this experiment, we adopt epiC [6], an extensible and scalable system as the integrated storage/computing platform(our recommendation system is also available on other big data platforms). It is designed to handle multistructured data under a united interface by decoupling the concurrent programming model and data processing model, which processes the data-intensive computing automatically in parallel efficiently.…”
Section: System Architecturementioning
confidence: 99%
“…Addressing this problem, Dryad [10] enables the user to handle a DAG of tasks, and Pregel [13] deals with vertex-centric programming for large graphs. Recently, epiC [11] provides a unified framework for a variety of data types, combining the merits of all afore-mentioned systems. We refer the reader to a comprehensive survey [12] for other related systems.…”
Section: Related Workmentioning
confidence: 99%