Internet-scale support for map-reduce processing

Costa, Fernando Albuquerque; Veiga, Luís; Ferreira, Paulo J.

doi:10.1186/1869-0238-4-18

Cited by 17 publications

(9 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most of the existing volunteer computing systems, as well as the architectures described in Section IV, consider a centralized system, with communication going through a single server or coordinator that fulfills the role of job scheduler and handles the task distribution and result validation. As a result, these systems either do not implement any result aggregation and validation mechanisms or create a considerable overhead on the server and are thus limited to embarrassingly parallel applications [64]. Recent solutions tolerate clients' failures by assigning the same task to multiple devices and by replicating the intermediate and final output of the computations across different devices [64], [65].…”

Section: Results Validation and Aggregationmentioning

confidence: 99%

Volunteer Computing on Mobile Devices

Tapparello

Funai

Hijazi

et al. 2016

Mobile Computing and Wireless Networks

View full text Add to dashboard Cite

Different forms of parallel computing have been proposed to address the high computational requirements of many applications. Building on advances in parallel computing, volunteer computing has been shown to be an efficient way to exploit the computational resources of under utilized devices that are available around the world. The idea of including mobile devices, such as smartphones and tablets, in existing volunteer computing systems has recently been investigated. In this chapter, we present the current state of the art in the mobile volunteer computing research field, where personal mobile devices are the elements that perform the computation. Starting from the motivations and challenges behind the adoption of personal mobile devices as computational resources, we then provide a literature review of the different architectures that have been proposed to support parallel computing on mobile devices. Finally, we present some open issues that need to be investigated in order to extend user participation and improve the overall system performance for mobile volunteer computing.

show abstract

Section: Results Validation and Aggregationmentioning

confidence: 99%

Volunteer Computing on Mobile Devices

Tapparello

Funai

Hijazi

et al. 2016

Mobile Computing and Wireless Networks

View full text Add to dashboard Cite

show abstract

“…Nevertheless, as MapReduce's popularity increased, some platforms decided to use available resources over the Internet to run MapReduce jobs. Therefore, solutions such as SCOLARS [5], MOON [11], Tang [15] and Marozzo [7] already support MapReduce applications.…”

Section: Related Workmentioning

confidence: 99%

“…With respect to the few solutions [11,15,7,5] that support MapReduce, we are able to point out some issues (more details on Section 4): data distribution could be improved, intermediate data availability is overlooked, and lack of support for multiple cycle MapReduce applications.…”

mentioning

confidence: 98%

Scadamar

Bruno

Ferreira

2014

Proceedings of the 2nd International Workshop on CrossCloud Systems

Self Cite

View full text Add to dashboard Cite

Recent developments of popular programming models, namely MapReduce, have raised the interest of running MapReduce applications over the large scale Internet. However, current data distribution techniques used in Internet wide computing platforms to distribute the high volumes of information, which are needed to run MapReduce jobs, are naive, and therefore need to be re-thought.Thus, we present a computing platform called SCADA-MAR that runs MapReduce jobs over the Internet and provides two new main contributions: i) improves data distribution by using the BitTorrent protocol to distribute all data, and ii) improves intermediate data availability by replicating tasks or data through nodes in order to avoid losing intermediate data and consequently preventing big delays on the MapReduce overall execution time.Along with the design of our solution, we present an extensive set of performance results which confirm the usefulness of the above mentioned contributions, improved data distribution and availability, thus making our platform a feasible approach to run MapReduce jobs.

show abstract

“…Another similar work is VMR [5], a volunteer computing system able to run MapReduce applications on top of volunteer resources, spread throughout the Internet. VMR leverages users' bandwidth through the use of inter-client communication, and uses a lightweight task validation mechanism.…”

Section: Mapreduce On Non-dedicated Computing Resourcesmentioning

confidence: 99%

“…MapReduce is an emerging programming model for large-scale data processing [6]. Recently, there are some MapReduce implementations that are designed for large-scale parallel data processing specialized on desktop grid or volunteer resources in Intranet or Internet, such as MOON [11], P2P-MapReduce [13], VMR [5], HybridMR [18], etc. In our previous work, we also implemented a MapReduce system called BitDew-MapReduce, specifically for desktop grid environment [19].…”

Section: Introductionmentioning

confidence: 99%

Availability and Network-Aware MapReduce Task Scheduling over the Internet

Tang¹,

Xie²,

He³

et al. 2015

Algorithms and Architectures for Parallel Processing

View full text Add to dashboard Cite

International audienceMapReduce offers an ease-of-use programming paradigm for processing large datasets. In our previous work, we have designed a MapReduce framework called BitDew-MapReduce for desktop grid and volunteer computing environment, that allows nonexpert users to run data-intensive MapReduce jobs on top of volunteer resources over the Internet. However, network distance and resource availability have great impact on MapReduce applications running over the Internet. To address this, an availability and network-aware MapReduce framework over the Internet is proposed. Simulation results show that the MapReduce job response time could be decreased by 27.15%, thanks to Naive Bayes Classifier-based availability prediction and landmark-based network estimation

show abstract

Internet-scale support for map-reduce processing

Cited by 17 publications

References 18 publications

Volunteer Computing on Mobile Devices

Volunteer Computing on Mobile Devices

Scadamar

Availability and Network-Aware MapReduce Task Scheduling over the Internet

Contact Info

Product

Resources

About