Abstract:The paper considers performance and reliability of fault-tolerant software running on a hardware system that consists of multiple processing units. The software consists of functionally equivalent but independently developed versions that start execution simultaneously. The computational complexity and reliability of different versions are different. The system completes the task execution when the outputs of a pre-specified number of versions coincide. The processing units are characterized by different avail… Show more
“…This way, we obtain all the system solutions in a decreasing lexicographic order. For instance, assuming d 1 þ d 2 ¼ 15 with 0 r d 1 r 10 and 0 r d 2 r 12, the first solution is (10,5) and the other ones are subsequently (9,6), (8,7), (7,8), (6,9), (5,10), (4,11), and (3,12). The most benefit of using this approach to solve the system is that we obtain all the system solutions in a decreasing lexicographic order.…”
Section: Definitionmentioning
confidence: 94%
“…Fig. 2 shows an almost large benchmark ARPANET example [31,32] with M ¼ (13,12,14,8,12,8,11,12,9,7,14,12,8,13,12,6,13,15,14,10,12,6,7,10,11,14,13,12,14,9,15,15), L¼ (3,8,7,7,4,7,6,2,7,6,3,5,7,8,3,6,9,8,7,…”
Section: 3mentioning
confidence: 99%
“…Nowadays, network reliability theory has extensively been applied to a variety of real-world systems such as transportation [2], mobile ad hoc wireless [3,4], power transmission and distribution [5,6], grid computing [7], manufacturing [8][9][10], and computer and communication [9,11]. Moreover, in some optimization problems such as maximizing system reliability [6,8] or optimal design of a network subject to reliability constraint [12], there is an increasingly significant need for efficiently computing or estimating the system reliability.…”
“…This way, we obtain all the system solutions in a decreasing lexicographic order. For instance, assuming d 1 þ d 2 ¼ 15 with 0 r d 1 r 10 and 0 r d 2 r 12, the first solution is (10,5) and the other ones are subsequently (9,6), (8,7), (7,8), (6,9), (5,10), (4,11), and (3,12). The most benefit of using this approach to solve the system is that we obtain all the system solutions in a decreasing lexicographic order.…”
Section: Definitionmentioning
confidence: 94%
“…Fig. 2 shows an almost large benchmark ARPANET example [31,32] with M ¼ (13,12,14,8,12,8,11,12,9,7,14,12,8,13,12,6,13,15,14,10,12,6,7,10,11,14,13,12,14,9,15,15), L¼ (3,8,7,7,4,7,6,2,7,6,3,5,7,8,3,6,9,8,7,…”
Section: 3mentioning
confidence: 99%
“…Nowadays, network reliability theory has extensively been applied to a variety of real-world systems such as transportation [2], mobile ad hoc wireless [3,4], power transmission and distribution [5,6], grid computing [7], manufacturing [8][9][10], and computer and communication [9,11]. Moreover, in some optimization problems such as maximizing system reliability [6,8] or optimal design of a network subject to reliability constraint [12], there is an increasingly significant need for efficiently computing or estimating the system reliability.…”
“…Levitin et al [7] estimated the task execution time and reliability of a multi-processing-unit hardware system. However, the two systems were not organized as a master-slave infrastructure, and the redundant schemes they utilized were not cold-standby schemes.…”
MapReduce as a master-slave infrastructure consists of two master-side severs and a large number of slave-side working nodes. In this paper, we derive a job completion reliability (JCR for short) model from a single-job perspective for a general MapReduce infrastructure in which no redundancy scheme is adopted on the master side, and a coldstandby scheme is employed on the slave side. Without loss of generality, the JCR model is derived based on a Poisson distribution. In addition, we calculate the corresponding job energy consumption (JEC for short). Through the simulation and analytical results, MapReduce managers and service providers can comprehend how this infrastructure behaves and how to improve the infrastructure so as to achieve a more reliable and energy-efficient MapReduce environment.
“…As network reliability plays an important role in ensuring the normal output of data, many researches focus on reliability assurance or reliability improvement. For a given network, in order to guarantee the network reliability, lots of studies focus on the network backup paths or the spare paths [19]- [24], the network tolerance [25], [26], the network resilience [27], [28], etc.…”
As the improvement of the network reliability can be achieved by the components assignments, the optimal transmission line assignment with maximal reliabilities and minimal cost (OTLAMRMC) problem under the transmission time constraints is investigated. The OTLAMRMC problem contains two sub-problems: the reliabilities and assignment cost evaluation under the transmission time constraint (RACETTC) and the multi-objective transmission line assignment optimization problem. First, the RACETTC algorithm is proposed to evaluate reliabilities and assignment cost under the transmission time constraints for a certain transmission line configuration. Then, the Non-dominated Sorting Genetic Algorithm III (NSGA-III) is adopted to search the optimal transmission line assignment based on the results obtained by the RACETTC algorithm. Therefore, combining the RACETTC algorithm and the NSGA-III together, the RACETTC-NSGA-III algorithm is proposed to solve the OTLAMRMC problem. Finally, example simulations are given to illustrate the proposed algorithm. The example results show that the RACETTC-NSGA-III algorithm can provide efficient solutions in a reasonable time.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.