Evaluating program analysis and testing tools with the RUGRAT random benchmark application generator

Hussain, Ishtiaque; Csallner, Christoph; Grechanik, Mark; Fu, Chen; Xie, Qing; Park, Sang Min; Taneja, Kunal; Hossain, Belal

doi:10.1145/2338966.2336798

Cited by 7 publications

(3 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Threats to external validity limit the extent to which the results will generalize to other kinds of concurrent programs. Whereas there are large benchmark suites and benchmark generators for sequential programs, there is no large and widely accepted bug benchmark suites for concurrent programs, and thus this threat is common to all prior work in this area. To mitigate this problem, the benchmark suite includes most of the benchmarks used in other work and furthermore, includes both Java and C++ programs.…”

Section: Empirical Studiesmentioning

confidence: 99%

UNICORN: a unified approach for localizing non‐deadlock concurrency bugs

Park

Vuduc

Harrold

2014

Software Testing Verif & Rel

Self Cite

View full text Add to dashboard Cite

SummaryUNICORN is an automated dynamic pattern‐detection‐based technique that finds and ranks problematic memory access patterns for non‐deadlock concurrency bugs. It monitors pairs of memory accesses, combines the pairs into problematic patterns and ranks the patterns by their suspiciousness scores. It detects significant classes of bug types, including order violations and both single‐variable and multivariable atomicity violations, which have been shown to be the most important classes of non‐deadlock concurrency bugs. This paper describes the UNICORN approach, its implementations in Java and C++, and evaluates these implementations empirically. The evaluation shows that UNICORN can effectively compute and rank the patterns that represent concurrency bugs, and perform computation and ranking with reasonable efficiency. Copyright © 2014 John Wiley & Sons, Ltd.

show abstract

Section: Empirical Studiesmentioning

confidence: 99%

UNICORN: a unified approach for localizing non‐deadlock concurrency bugs

Park

Vuduc

Harrold

2014

Software Testing Verif & Rel

Self Cite

View full text Add to dashboard Cite

show abstract

“…A practical challenge for these kinds of generators is to construct realistic programs. However, an empirical study indicates that it is statistically impossible for a program analysis technique to differentiate a program written by a human from one that the tool generates [20]. The study compared real and generated programs with 78 existing software metrics.…”

Section: Objects Of Analysismentioning

confidence: 99%

A Comparative Study of Incremental Constraint Solving Approaches in Symbolic Execution

Liu

Araújo

d’Amorim

et al. 2014

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Constraint solving is a major source of cost in Symbolic Execution (SE). This paper presents a study to assess the importance of some sensible options for solving constraints in SE. The main observation is that stack-based approaches to incremental solving is often much faster compared to cache-based approaches, which are more popular. Considering all 96 C programs from the KLEE benchmark that we analyzed, the median speedup obtained with a (non-optimized) stack-based approach was of 5x. Results suggest that tools should take advantage of incremental solving support from modern SMT solvers and researchers should look for ways to combine stack-and cache-based approaches to reduce execution cost even further. Instructions to reproduce results are available online: http://asa.iti.kit.edu/130_392.php

show abstract

“…‡ ‡ This article is an extended version of our previous work presented at the 10th International Workshop on Dynamic Analysis (WODA 2012) .…”

mentioning

confidence: 99%

RUGRAT: Evaluating program analysis and testing tools and compilers with large generated random benchmark applications

Hussain

Csallner

Grechanik

et al. 2014

Softw. Pract. Exper.

Self Cite

View full text Add to dashboard Cite

Summary Benchmarks are heavily used in different areas of computer science to evaluate algorithms and tools. In program analysis and testing, open‐source and commercial programs are routinely used as benchmarks to evaluate different aspects of algorithms and tools. Unfortunately, many of these programs are written by programmers who introduce different biases, not to mention that it is very difficult to find programs that can serve as benchmarks with high reproducibility of results. We propose a novel approach for generating random benchmarks for evaluating program analysis and testing tools and compilers. Our approach uses stochastic parse trees, where language grammar production rules are assigned probabilities that specify the frequencies with which instantiations of these rules will appear in the generated programs. We implemented our tool for Java and applied it to generate a set of large benchmark programs of up to 5M lines of code each with which we evaluated different program analysis and testing tools and compilers. The generated benchmarks let us independently rediscover several issues in the evaluated tools. Copyright © 2014 John Wiley & Sons, Ltd.

show abstract

Evaluating program analysis and testing tools with the RUGRAT random benchmark application generator

Cited by 7 publications

References 33 publications

UNICORN: a unified approach for localizing non‐deadlock concurrency bugs

UNICORN: a unified approach for localizing non‐deadlock concurrency bugs

A Comparative Study of Incremental Constraint Solving Approaches in Symbolic Execution

RUGRAT: Evaluating program analysis and testing tools and compilers with large generated random benchmark applications

Contact Info

Product

Resources

About