Lou Degenaro scite author profile

Lou Degenaro

5Publications

33Citation Statements Received

25Citation Statements Given

How they've been cited

How they cite others

Affiliations

IBM Research - Thomas J. Watson Research Center, IBM (United States)

Publications

Order By: Most citations

Towards Autonomic Fault Recovery in System-S

Jacques-Silva

Challenger

Degenaro

et al. 2007

View full text Add to dashboard Cite

System-S is a stream processing infrastructure which enables program fragments to be distributed and connected to form complex applications. There may be potentially tens of thousands of interdependent and heterogeneous program fragments running across thousands of nodes. While the scale and interconnection imply the need for automation to manage the program fragments, the need is intensified because the applications operate on live streaming data and thus need to be highly available. System-S has been designed with components that autonomically manage the program fragments, but the system components themselves are also susceptible to failures which can jeopardize the system and its applications.The work we present addresses the self healing nature of these management components in System-S. In particular, we show how one key component of System-S, the job management orchestrator, can be abruptly terminated and then recover without interrupting any of the running program fragments by reconciling with other autonomous system components. We also describe techniques that we have developed to validate that the system is able to autonomically respond to a wide variety of error conditions including the abrupt termination and recovery of key system components. Finally, we show the performance of the job management orchestrator recovery for a variety of workloads.

show abstract

Extending business objects with business rules

Rouvellou

Degenaro

Rasmus³

et al.

View full text Add to dashboard Cite

Combining Different Business Rules Technologies: A Rationalization

et al. 2001

View full text Add to dashboard Cite

Fusion: a system for business users to manage program variability

Weber

Chan

Degenaro

et al. 2005

IIEEE Trans. Software Eng.

View full text Add to dashboard Cite

Self healing in System-S

et al. 2008

View full text Add to dashboard Cite

Faults in a cluster are inevitable. The larger the cluster, the more likely the occurrence of some failure in hardware, in software, or by human error. System-S software must detect and self-repair failures while carrying out its prime directive-enabling stream processing program fragments to be distributed and connected to form complex applications. Depending on the type of failure, System-S may be able to continue with little or no disruption to potentially tens of thousands of interdependent and heterogeneous program fragments running across thousands of nodes.We extend the work we previously presented on the self healing nature of the job manager component in System-S by presenting how it can handle failures of other system components, applications and network infrastructure. We also evaluate the recoverability of the job management orchestrator component of System-S, considering crash failures with and without error propagation.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Lou Degenaro

Towards Autonomic Fault Recovery in System-S

Extending business objects with business rules

Combining Different Business Rules Technologies: A Rationalization

Fusion: a system for business users to manage program variability

Self healing in System-S

Contact Info

Product

Resources

About