Optimistic implementation of bulk data transfer protocols

Carter, J. B.; Zwaenepoel, Willy

doi:10.1145/75372.75379

Cited by 4 publications

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Mirage+: A kernel implementation of distributed shared memory on a network of personal computers

1994

View full text Add to dashboard Cite

We describe the evolution of a distributed shared memory (DSM) system, Mirage, and the difficulties encountered when moving the system from a Unix‐based* kernel on the VAX to a Unix‐based kernel on personal computers. Mirage provides a network transparent form of shared memory for a loosely coupled environment. The system hides network boundaries for processes that are accessing shared memory and is upward compatible with the Unix System V Interface Definition. This paper addresses the architectural dependencies in the design of the system and evaluates performance of the implementation. The new version, MIRAGE+, performs well compared to Mirage even though eight times the amount of data is sent on each page fault because of the larger page size used in the implementation. We show that performance of systems with a large page size to network packet size can be dramatically improved on conventional hardware by applying three well‐known techniques: packet blasting, compression, and running at interrupt level. The measured time for a page fault in MIRAGE+ has been reduced 37 per cent by sending a page using packet blasting instead of using a handshake for each portion of the page. When compression was added to MIRAGE+, the time to fault a page across the network was further improved by 47 per cent when the page was compressed into one network packet. Our measured performance compares favorably with the amount of time it takes to fault a page from disk. Lastly, running at interrupt level may improve performance 16 per cent when faulting pages without compression.

show abstract

Mirage+: A kernel implementation of distributed shared memory on a network of personal computers

1994

View full text Add to dashboard Cite

show abstract

Efficient Message Passing Interface (MPI) for Parallel Computing on Clusters of Workstations

Bruck

Dolev

et al. 1997

Journal of Parallel and Distributed Computing

View full text Add to dashboard Cite

INTRODUCTIONRecently, a Message Passing Interface (MPI) [16] has been proposed as an industrial standard for writing ''portable'' message-passing parallel programs. The MPI standardization effort involved about 60 people from 40 organizations including universities, national laboratories, and most MPP vendors. Version 1 of MPI was released in May 1994. MPI adopts most, if not all, common practices from existing communication libraries. One of the key components of MPI is the collective communication subset that allows users to conveniently call library routines for various ''global'' communication operations, like broadcast, scatter, and gather. All MPI collective communication routines are implicitly defined with respect to a process group [3] which specifies an ordered set of processors within which the collective communication will be performed. For example, a multicast is specified as a broadcast to a particular process group. The performance of a parallel program depends on an efficient implementation of point-to-point as well as collective communication.In existing parallel programming environments, such as PVM, EXPRESS, and IBM's MPL [2,12,20], for Local Area Networks (LANs), collective communication routines are implemented on top of point-to-point communication. As a result, these environments suffer from poor collective communication performance. For example, a broadcast that is implemented using a TCP or point-topoint UDP over a LAN is obviously inefficient as it is not utilizing the fact that most LANs are based on a broadcast medium.In this paper, we present an efficient design and implementation of the Collective Communication Library in MPI (MPI-CCL) that is optimized for clusters of workstations. In particular, we demonstrate the implementation on a traditional 10-Mbit Ethernet-based LAN. We note here that the ideas presented in this paper can be easily extended to any Network of Workstations (NOW) [21] that provides an unreliable broadcast transport protocol (such as to an ATM network where the ATM switches have broadcast capability as provided by many vendors nowadays).JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 40, 19-34 (1997) Our system is integrated with the operating system via an efficient kernel extension mechanism that we developed. The kernel extension significantly improves the performance of our implementation as it can handle part of the communication overhead without involving user space. We have implemented our system on a collection of IBM RS/6000 workstations connected via a 10-Mbit Ethernet LAN. Our performance measurements are taken from typical scientific programs that run in a parallel mode by means of the MPI. The hypothesis behind our design is that the system's performance will be bounded by interactions between the kernel and user space rather than by the bandwidth delivered by the LAN Data-Link Layer. Our results indicate that the performance of our MPI Broadcast (on top of Ethernet) is about twice as fast as a recently published software implementation of broadcas...

show abstract

Architectural concepts in implementation of end-system protocols for high performance communications

Ravindran

Singh

Woodside

Proceedings of 1996 International Conference on Network Protocols (ICNP-96)

View full text Add to dashboard Cite

Optimistic implementation of bulk data transfer protocols

Cited by 4 publications

References 7 publications

Mirage+: A kernel implementation of distributed shared memory on a network of personal computers

Mirage+: A kernel implementation of distributed shared memory on a network of personal computers

Efficient Message Passing Interface (MPI) for Parallel Computing on Clusters of Workstations

Architectural concepts in implementation of end-system protocols for high performance communications

Contact Info

Product

Resources

About