2005
DOI: 10.1007/11602569_31
|View full text |Cite
|
Sign up to set email alerts
|

The Impact of Noise on the Scaling of Collectives: A Theoretical Approach

Abstract: Abstract. The performance of parallel applications running on large clusters is known to degrade due to the interference of kernel and daemon activities on individual nodes, often referred to as noise. In this paper, we focus on an important class of parallel applications, which repeatedly perform computation, followed by a collective operation such as a barrier. We model this theoretically and demonstrate, in a rigorous way, the effect of noise on the scalability of such applications. We study three natural a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
41
0

Year Published

2005
2005
2019
2019

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 36 publications
(44 citation statements)
references
References 8 publications
(10 reference statements)
1
41
0
Order By: Relevance
“…It is increasingly becoming evident that operating system interference in the form of daemon activity and interrupts contribute significantly to performance degradation of parallel applications in large clusters. An earlier theoretical study has evaluated the impact of system noise on application performance for different noise distributions [1]. Our work complements the theoretical analysis by presenting an empirical study of noise in production clusters.…”
mentioning
confidence: 82%
See 3 more Smart Citations
“…It is increasingly becoming evident that operating system interference in the form of daemon activity and interrupts contribute significantly to performance degradation of parallel applications in large clusters. An earlier theoretical study has evaluated the impact of system noise on application performance for different noise distributions [1]. Our work complements the theoretical analysis by presenting an empirical study of noise in production clusters.…”
mentioning
confidence: 82%
“…A formal approach to study the impact of noise in these large systems was initiated by Agarwal et al [1]. The parallel application studied was a typical class of kernel that appears in most scientific applications.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, results can increase the application running time dramatically, as shown in [16]. Theoretical [17] and practical analyses [18,16] show that operating system noise and resulting process skew is definitively influencing the performance of parallel applications using blocking collective operations. Non-blocking collective operations avoid explicit synchronization unless it is necessary (if the programmer wants to wait for the operation to finish).…”
Section: Possible Performance Benefitsmentioning
confidence: 99%