1993
DOI: 10.1109/71.205651
|View full text |Cite
|
Sign up to set email alerts
|

Mtool: an integrated system for performance debugging shared memory multiprocessor applications

Abstract: This paper describes Mtool, a software tool for analyzing performance losses in shared memory parallel programs. Mtool augments a program with low overhead instrumentation which perturbs the program's execution as little as possible while generating enough information to isolate memory and synchronization bottlenecks. After running the instrumented version of the parallel program, the programmer can use Mtool's windowbased user interface to view compute time, memory, and synchronization bottlenecks at increasi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

1997
1997
2014
2014

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(19 citation statements)
references
References 14 publications
(7 reference statements)
0
19
0
Order By: Relevance
“…Most existing cache and data locality analyzers measure the locality or the cache misses and indicate at which locations in the source code, or for which data structures, most cache misses occur [2,4,5,8,12,13,20,21,23,28,29,32]. While this information is helpful in identifying the main bottlenecks in the program, it can still be difficult to deduce a suitable program transformation from it.…”
Section: Related Workmentioning
confidence: 99%
“…Most existing cache and data locality analyzers measure the locality or the cache misses and indicate at which locations in the source code, or for which data structures, most cache misses occur [2,4,5,8,12,13,20,21,23,28,29,32]. While this information is helpful in identifying the main bottlenecks in the program, it can still be difficult to deduce a suitable program transformation from it.…”
Section: Related Workmentioning
confidence: 99%
“…Although the arithmetic complexity of these two code segments is equivalent, the second version produces approximately a 9% speedup over the first version on a 300Mhz Sun Ultra10 workstation 1 .…”
Section: The Basics Of Cache Memorymentioning
confidence: 99%
“…In this case, programmers can benefit from a profile that focuses specifically on a program's cache behavior [8,1,11,10], identifying problematic code sections and data structures. A simple cache profile that annotates source lines with the number of cache misses it incurs is certainly beneficial.…”
Section: A Cache Profilementioning
confidence: 99%
“…Mtool [4] provides information about the amount of performance lost due to the memory hierarchy, but only relates this information back to program source lines, not data structures. A system with more similarity to the techniques in this paper is MemSpy [7].…”
mentioning
confidence: 99%