Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications

Ahmad, Maaz Bin; Cheung, Alvin

doi:10.1145/3183713.3196891

Cited by 30 publications

(26 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To the best of our knowledge Parsynt is the only fully automatic tool that can synthesize divide-and-conquer programs of the class described in this paper from a reference implementation. A number of tools, including BIG [23], and Casper [1], synthesize various types of MapReduce [7] programs. The MapReduce model is too restrictive for splitting or partitioning divides, and all the tools mentioned fail to synthesize a solution for POP example from Section 2 or LIS example from Section 6.…”

Section: Resultsmentioning

confidence: 99%

“…We made the simplifying assumption that ⊙ = •, but in general, it is unknown. Therefore, instead of knowing that POP( ) • POP([ 1 , 2 ]) is the join expression, we have 1 Appendix C.1 spells out the rewriting steps for the interested reader. to characterize the shape of valid join expressions.…”

Section: Deductive Recursion Synthesismentioning

confidence: 99%

“…first homomorphism theorem). The literature on divide-and-conquer synthesis can be divided into two categories based on the class of input computations targeted: (1) those with list homomorphisms as input, with the aim of synthesizing efficient map-reduce [7] programs [1,14,21,23], (2) those that go beyond list homomorphisms [8,9,11,15,19,22], and target code with more dependencies. In category (2), the techniques in [8,9,11] synthesize list homomorphisms through some variation of lifting, the approach in [22] uses symbolic execution at runtime and to identify and defer dependencies, and Bellmania [15] targets input programs in the style of dynamic programming and orchestrates an efficient execution schedule to accommodate the dependencies.…”

Section: Related Workmentioning

confidence: 99%

See 2 more Smart Citations

Phased synthesis of divide and conquer programs

Farzan

Nicolet

2021

Proceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation

View full text Add to dashboard Cite

We propose a fully automated method that takes as input an iterative or recursive reference implementation and produces divide-and-conquer implementations that are functionally equivalent to the input. Three interdependent components have to be synthesized: a function that divides the original problem instance, a function that solves each sub-instance, and a function that combines the results of sub-computations. We propose a methodology that splits the synthesis problem into three successive phases, each with a substantially reduced state space compared to the original monolithic task, and therefore substantially more tractable. Our methodology is implemented as an addition to the existing synthesis tool Parsynt, and we demonstrate the efficacy of it by synthesizing highly nontrivial divide-and-conquer implementations of a set of benchmarks fully automatically. CCS Concepts: • Theory of computation → Program reasoning; Divide and conquer; Parallel computing models; • Software and its engineering → Automatic programming.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Deductive Recursion Synthesismentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Phased synthesis of divide and conquer programs

Farzan

Nicolet

2021

Proceedings of the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation

View full text Add to dashboard Cite

show abstract

“…ALICE extracts an average of 670K logic facts from each repository. The second data set is from the evaluation dataset of Casper [20], an automated code optimization technique. This dataset consists of groups of similar code fragments that follow the same data access patterns (e.g., a sequential loop over lists) and can be systematically optimized by Casper.…”

Section: Simulation Experimentsmentioning

confidence: 99%

“…We evaluate ALICE using two benchmarks from prior work [9,20]. These benchmarks consist of 20 groups of similar code fragments in large-scale projects such as Eclipse JDT.…”

Section: Introductionmentioning

confidence: 99%

Active Inductive Logic Programming for Code Search

Sivaraman

Zhang

Broeck

et al. 2019

2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE)

View full text Add to dashboard Cite

Modern search techniques either cannot efficiently incorporate human feedback to refine search results or cannot express structural or semantic properties of desired code. The key insight of our interactive code search technique ALICE is that user feedback can be actively incorporated to allow users to easily express and refine search queries. We design a query language to model the structure and semantics of code as logic facts. Given a code example with user annotations, ALICE automatically extracts a logic query from code features that are tagged as important. Users can refine the search query by labeling one or more examples as desired (positive) or irrelevant (negative). ALICE then infers a new logic query that separates positive examples from negative examples via active inductive logic programming. Our comprehensive simulation experiment shows that ALICE removes a large number of false positives quickly by actively incorporating user feedback. Its search algorithm is also robust to user labeling mistakes. Our choice of leveraging both positive and negative examples and using nested program structure as an inductive bias is effective in refining search queries. Compared with an existing interactive code search technique, ALICE does not require a user to manually construct a search pattern and yet achieves comparable precision and recall with much fewer search iterations. A case study with real developers shows that ALICE is easy to use and helps express complex code patterns.

show abstract

Counterexample-Guided Partial Bounding for Recursive Function Synthesis

Nicolet

Farzan

2021

Computer Aided Verification

View full text Add to dashboard Cite

Quantifier bounding is a standard approach in inductive program synthesis in dealing with unbounded domains. In this paper, we propose one such bounding method for the synthesis of recursive functions over recursive input data types. The synthesis problem is specified by an input reference (recursive) function and a recursion skeleton. The goal is to synthesize a recursive function equivalent to the input function whose recursion strategy is specified by the recursion skeleton. In this context, we illustrate that it is possible to selectively bound a subset of the (recursively typed) parameters, each by a suitable bound. The choices are guided by counterexamples. The evaluation of our strategy on a broad set of benchmarks shows that it succeeds in efficiently synthesizing non-trivial recursive functions where standard across-the-board bounding would fail.

show abstract

Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications

Cited by 30 publications

References 37 publications

Phased synthesis of divide and conquer programs

Phased synthesis of divide and conquer programs

Active Inductive Logic Programming for Code Search

Counterexample-Guided Partial Bounding for Recursive Function Synthesis

Contact Info

Product

Resources

About