Proceedings of the 2018 International Conference on Management of Data 2018
DOI: 10.1145/3183713.3196891
|View full text |Cite
|
Sign up to set email alerts
|

Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications

Abstract: MapReduce is a popular programming paradigm for developing large-scale, data-intensive computation. Many frameworks that implement this paradigm have recently been developed. To leverage these frameworks, however, developers must become familiar with their APIs and rewrite existing code. We present Casper, a new tool that automatically translates sequential Java programs into the MapReduce paradigm. Casper identifies potential code fragments to rewrite and translates them in two steps: (1) Casper uses program … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
26
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(26 citation statements)
references
References 37 publications
0
26
0
Order By: Relevance
“…To the best of our knowledge Parsynt is the only fully automatic tool that can synthesize divide-and-conquer programs of the class described in this paper from a reference implementation. A number of tools, including BIG [23], and Casper [1], synthesize various types of MapReduce [7] programs. The MapReduce model is too restrictive for splitting or partitioning divides, and all the tools mentioned fail to synthesize a solution for POP example from Section 2 or LIS example from Section 6.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…To the best of our knowledge Parsynt is the only fully automatic tool that can synthesize divide-and-conquer programs of the class described in this paper from a reference implementation. A number of tools, including BIG [23], and Casper [1], synthesize various types of MapReduce [7] programs. The MapReduce model is too restrictive for splitting or partitioning divides, and all the tools mentioned fail to synthesize a solution for POP example from Section 2 or LIS example from Section 6.…”
Section: Resultsmentioning
confidence: 99%
“…We made the simplifying assumption that ⊙ = •, but in general, it is unknown. Therefore, instead of knowing that POP( ) • POP([ 1 , 2 ]) is the join expression, we have 1 Appendix C.1 spells out the rewriting steps for the interested reader. to characterize the shape of valid join expressions.…”
Section: Deductive Recursion Synthesismentioning
confidence: 99%
See 1 more Smart Citation
“…ALICE extracts an average of 670K logic facts from each repository. The second data set is from the evaluation dataset of Casper [20], an automated code optimization technique. This dataset consists of groups of similar code fragments that follow the same data access patterns (e.g., a sequential loop over lists) and can be systematically optimized by Casper.…”
Section: Simulation Experimentsmentioning
confidence: 99%
“…We evaluate ALICE using two benchmarks from prior work [9,20]. These benchmarks consist of 20 groups of similar code fragments in large-scale projects such as Eclipse JDT.…”
Section: Introductionmentioning
confidence: 99%