High-performance supercomputers generally comprise millions of CPUs in which interconnection networks play an important role to achieve high performance. New design paradigms of dynamic on-chip interconnection network involve a) topology b) synthesis, modeling and evaluation c) quality of service, fault tolerance and reliability d) routing procedures. To construct a dynamic highly fault tolerant interconnection networks requires more disjoint paths from each source-destination node pair at each stage and dynamic rerouting capability to use the various available paths effectively. Fast routing and rerouting strategy is needed to provide reliable performance on switch/link failures. This paper proposes two new architecture designs of fault tolerant interconnection networks named as reliable interconnection networks (RIN-1 and RIN-2). The proposed layouts are multipath multistage interconnection networks providing four disjoint paths for all the source-destination node pairs with dynamic rerouting capability. The designs can withstand switch failures in all the stages (including input and output stages) and provide more reliability. Reliability analysis of various MIN architectures is evaluated. On comparing the results with some existing MINs it is evident that the proposed designs provides higher reliability values and fault tolerance.
Multistage interconnection networks (MINs) are widely used for reliable data communication in tightly coupled large-scale multiprocessor systems. Reliability evaluation of interconnection networks is still a challenge owing to high complexity. Need of reliability evaluation for MINs is quite evident as these measures provide user-oriented performance. Terminal pair reliability (TPR) is the most commonly used reliability performance index of MINs. This paper provides a global view of different reliability measures and approaches for evaluation of these measures. Based on the critical literature review, shortcomings are identified and analyzed. Then the multi-variable inversion algorithm is applied to evaluate the reliability of one of the most common MINs, namely, Omega network, in a compact form. Terminal, broadcast, and network reliability for the Omega, Omega with an additional stage (Omega+), and Omega with two additional stages (Omega+2) systems are analyzed and compared. Then we extend our work to trace the minimal path sets of various MINs, and terminal pair reliabilities are evaluated and compared.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.