Distributed-thread scheduling methods for reducing page-thrashing

Sudo, Yasuhiro; Suzuki, Seigo; Shibayama, Shigeki

doi:10.1109/hpdc.1997.626444

Cited by 7 publications

(5 citation statements)

References 25 publications

(13 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Two threads that frequently access the same shared pages can be pr esumed to share data. We define a density function as the access rate of thread i to page p. The correlation of two threads over page p can be computed as the product of the density function of the two threads for page p. The overall correlation of the two threads, then, is the sum of the correlations for each page in the system [18]. Unfortunately, page-based DSMs have no efficient way of deriving density functions because they can not track ind ividual accesses.…”

Section: Cost Evaluationsmentioning

confidence: 99%

See 1 more Smart Citation

Thread migration and communication minimization in DSM systems

Thitikamol

Keleher

1999

Proc. IEEE

View full text Add to dashboard Cite

Networks of workstations are characterized by dynamic resource capacities. Such environments can only be efficiently exploited by applications that are dynamically re-configurable. This paper explores mechanisms and policies that enable online reconfiguration of shared-memory applications through thread migration. We describe the design and preliminary performance of a DSM system that performs online re-mappings of threads to nodes based on sharing behavior. Our system obtains complete sharing information through a novel correlation-tracking phase that avoids the thread thrashing that characterizes previous approaches. This information is used to evaluate the communication required by a given thread mapping, and to predict the resulting performance.

show abstract

Section: Cost Evaluationsmentioning

confidence: 99%

“…Thread migration has also been studied in the Millipede [22] and PARSEC [18] DSMs. Both systems implement thread migration in the context of sequential consistency rather than a relaxed consistency model.…”

Section: Related Workmentioning

confidence: 99%

Thread migration and communication minimization in DSM systems

Thitikamol

Keleher

1999

Proc. IEEE

View full text Add to dashboard Cite

show abstract

“…Two threads that frequently access the same shared pages can be presumed to share data. We define a density function as the access rate of thread i to page p. The correlation of two threads over page p can be computed as the product of the density function of the two threads for page p. The overall correlation of the two threads, then, is the sum of the correlations for each page in the system [18]. Unfortunately, page-based DSMs have no efficient way of deriving density functions because they can not track individual accesses.…”

Section: Cost Evaluationsmentioning

confidence: 99%

Thread Migration and Communication Minimization in DSM Systems

Thitikamol¹,

Keleher²

1998

View full text Add to dashboard Cite

“…Such a profile is however difficult to obtain without high overheads. Passive correlation tracking (used in [10,11]) that relies on remote page faults to activate access logging can only capture partial sharing behavior because access to a validated page by other local threads is missed logging. Active correlation tracking [12] was proposed to track the sharing information.…”

Section: Introductionmentioning

confidence: 99%

Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime

Lam

Luo

Wang

2010

2010 IEEE International Symposium on Parallel &Amp; Distributed Processing (IPDPS)

View full text Add to dashboard Cite

Abstract-Extending the standard Java virtual machine (JVM) for cluster-awareness is a transparent approach to scaling out multithreaded Java applications. While this clustering solution is gaining momentum in recent years, efficient runtime support for fine-grained object sharing over the distributed JVM remains a challenge. The system efficiency is strongly connected to the global object sharing profile that determines the overall communication cost. Once the sharing or correlation between threads is known, access locality can be optimized by collocating highly correlated threads via dynamic thread migrations. Although correlation tracking techniques have been studied in some page-based software DSM systems, they would entail prohibitively high overheads and low accuracy when ported to fine-grained object-based sharing systems. In this paper, we propose a lightweight sampling-based profiling technique for tracking inter-thread sharing. To preserve locality across migrations, we also propose a thread stack sampling mechanism for profiling the set of objects which are tightly coupled with a migrant thread. Sampling rates in both techniques can vary adaptively to strike a balance between preciseness and overhead. Such adaptive techniques are particularly useful for applications whose sharing patterns could change dynamically. The profiling results can be exploited for effective thread-tocore placement and dynamic load balancing in systems ranging from a multicore machine up to a cloud computing environment. We present the design and preliminary performance result of the JESSICA2 distributed JVM with the profiling implemented. Experimental results show that the profiling is able to obtain over 95% accurate global sharing profiles at a cost of only a few percents of execution time increase for fineto medium-grained applications.

show abstract

Distributed-thread scheduling methods for reducing page-thrashing

Cited by 7 publications

References 25 publications

Thread migration and communication minimization in DSM systems

Thread migration and communication minimization in DSM systems

Thread Migration and Communication Minimization in DSM Systems

Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime

Contact Info

Product

Resources

About