The effect of compiler-flag tuning on SPEC benchmark performance

Chan, Yin; Sudarsanam, Ashok; Wolfe, Andrew

doi:10.1145/190787.190807

Cited by 5 publications

(3 citation statements)

References 6 publications

(4 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This makes the search for adequate flags a trial-and-error process. Moreover, as Chan et al pointed out in 1994 [10], it is difficult to ensure that none of the flags chosen has been added to the compiler just to optimize some SPEC program, a situation not allowed by any SPEC benchmark release. Besides this, the base speed metric forces to use the same flags to compile the entire benchmark.…”

Section: Compiling and Running Issuesmentioning

confidence: 99%

Using SPEC CPU2006 to evaluate the sequential and parallel code generated by commercial and open-source compilers

2010

View full text Add to dashboard Cite

The role of the compiler is fundamental to exploit the hardware capabilities of a system running a particular application, minimizing the sequential execution time and, in some cases, offering the possibility of parallelizing part of the code automatically. This paper relies on the SPEC CPU2006 v1.1 benchmark suite to evaluate the performance of the code generated by three widely-used compilers (Intel C++/Fortran Compiler 11.0, Sun Studio 12 and GCC 4.3.2). Performance is measure in terms of base speed for reference problem sizes. Both sequential and automatic parallel performance obtained is analyzed, using different hardware architectures and configurations. The study includes a detailed description of the different problems that arise while compiling SPEC CPU2006 benchmarks with these tools, an information difficult to obtain elsewhere.Having in mind that performance is a moving target in the field of compilers, our evaluation shows that the sequential code generated by both Sun and Intel compilers for the SPEC CPU2006 integer benchmarks present a similar performance, while the floating-point code generated by Intel compiler is faster than its competitors. With respect to the auto-parallelization options offered by Intel and Sun compilers, our study shows that their benefits only apply to some floating-point benchmarks, with an average speedup of 1.2× with four processors. Meanwhile, the GCC suite evaluated is not capable of compiling the SPEC CPU2006 benchmark with auto-parallelization options enabled.

show abstract

Section: Compiling and Running Issuesmentioning

confidence: 99%

Using SPEC CPU2006 to evaluate the sequential and parallel code generated by commercial and open-source compilers

2010

View full text Add to dashboard Cite

show abstract

“…This soon prompted the question whether such excessive optimizations are representative for real programming, or whether it would be better to "burn all flags" [13]. The issue was later discussed in academic publications [1,8] and in the trade press. After a considerable debate, the final compromise for SPEC, established in January 1994, was that two metrics were established, "baseline" and "peak": Every SPEC CPU benchmark measurement has to measure performance with a restricted "baseline" set of flags (metric name: SPECint_base2000 or similar), and optionally with a more extended "peak" set of flags.…”

Section: Evolution Of the Spec Cpu Benchmarks Issuesmentioning

confidence: 99%

Benchmarking

Weicker¹

2002

Performance Evaluation of Complex Systems: Techniques and Tools

View full text Add to dashboard Cite

Abstract. After a definition (list of properties) of a benchmark, the major benchmarks currently used are classified according to several criteria: Ownership, definition of the benchmark, pricing, area covered by the benchmark. The SPEC Open Systems Group, TPC and SAP benchmarks are discussed in more detail. The use of benchmarks in academic research is discussed. Finally, some current issues in benchmarking are listed that users of benchmark results should be aware of.

show abstract

“…This makes the search for adequate flags a trial-and-error process. Moreover, as Chan et al pointed out in 1994 [32], it is difficult to ensure that none of the flags chosen is offered by the compiler just to optimize some SPEC program, a situation not allowed by any released SPEC benchmark. Besides this, the base speed metric forces to use the same flags to compile the entire benchmark.…”

Section: Compiler Flagsmentioning

confidence: 99%

Compile-time support for thread-level speculation

López¹

View full text Add to dashboard Cite

Multicore technologies have increased the peak performance of computing systems during the last decade. However, unlike previous advances in computer architecture, existent code cannot immediately take advantage of these architectures improvements. To fully exploit multicore capabilities, programmers should parallelize their applications, a difficult task that requires an in-depth knowledge of both the application and the underlying computer architecture. Parallelization techniques such as Thread-Level Speculation (TLS) eases this task, ensuring that the parallel code satisfies the sequential semantics. Nevertheless, this technique also requires manual and tough intervention by expert programmers. This Ph.D. thesis addresses this problem defining a new OpenMP clause, called speculative, which points out those variables that may lead to dependency violation, and a compile-time system that seamlessly translates this new OpenMP clause into calls to our TLS runtime system. This ensures that all accesses to these speculative variables will be carried out according to sequential semantics, and frees programmers from the manual augmentation of the source code required by the speculative parallelization. Before instrumenting a loop with OpenMP constructs, including our proposed speculative clause, programmers firstly need to extract certain information about the source code that they aim to parallelize. Without automatic tools, programmers have to manually extract the information. This Ph.D. thesis also addresses the problem of automatic characterization and coverage of sequential loops, with the aim of finding parallelization niches in widely-used benchmarks that may benefit from softwarebased speculative parallelization. To do this, we have proposed a system that takes advantage of an XML-based representation of the source and combines profiling information to extract all this information. Besides, we have also proposed a system that leverages this information, automatically synthesizing and generating the OpenMP constructs needed to parallelize the source code speculatively. We believe that the implementation of the new OpenMP clause in a mainstream compiler, together with the automation of the whole process of the parallelization, will help thread-level speculation to be mature enough for its production use.

show abstract

The effect of compiler-flag tuning on SPEC benchmark performance

Cited by 5 publications

References 6 publications

Using SPEC CPU2006 to evaluate the sequential and parallel code generated by commercial and open-source compilers

Using SPEC CPU2006 to evaluate the sequential and parallel code generated by commercial and open-source compilers

Benchmarking

Compile-time support for thread-level speculation

Contact Info

Product

Resources

About