Dariusz Król scite author profile

Computational science is well established as the third pillar of scientific discovery and is on par with experimentation and theory. However, as we move closer toward the ability to execute exascale calculations and process the ensuing extreme-scale amounts of data produced by both experiments and computations alike, the complexity of managing the compute and data analysis tasks has grown beyond the capabilities of domain scientists. Thus, workflow management systems are absolutely necessary to ensure current and future scientific discoveries. A key research question for these workflow management systems concerns the performance optimization of complex calculation and data analysis tasks. The central contribution of this article is a description of the PANORAMA approach for modeling and diagnosing the run-time performance of complex scientific workflows. This approach integrates extreme-scale systems testbed experimentation, structured analytical modeling, and parallel systems simulation into a comprehensive workflow framework called Pegasus for understanding and improving the overall performance of complex scientific workflows.

show abstract

Practical Resource Monitoring for Robust High Throughput Computing

Juve

Tovar

Silva

et al. 2015

View full text Add to dashboard Cite

Robust high throughput computing requires effective monitoring and enforcement of a variety of resources including CPU cores, memory, disk, and network traffic. Without effective monitoring and enforcement, it is easy to overload machines, causing failures and slowdowns, or underutilize machines, which results in wasted opportunities. This paper explores how to describe, measure, and enforce resources used by computational tasks. We focus on tasks running in distributed execution systems, in which a task requests the resources it needs, and the execution system ensures the availability of such resources. This presents two non-trivial problems: how to measure the resources consumed by a task, and how to monitor and report resource exhaustion in a robust and timely manner. For both of these tasks, operating systems have a variety of mechanisms with different degrees of availability, accuracy, overhead, and intrusiveness. We describe various forms of monitoring and the available mechanisms in contemporary operating systems. We then present two specific monitoring tools that choose different tradeoffs in overhead and accuracy, and evaluate them on a selection of benchmarks.

show abstract

Virtual Laboratory for Development and Execution of Biomedical Collaborative Applications

Bubak

Gubała²,

Malawski

et al. 2008

View full text Add to dashboard Cite

A Cloud-Based Data Farming Platform for Molecular Dynamics Simulations

Król

Orzechowski

Kitowski

et al. 2014

View full text Add to dashboard Cite

Self-scalable services in service oriented software for cost-effective data farming

Król

Kitowski

2016

Future Generation Computer Systems

View full text Add to dashboard Cite

Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows

Mandal

Ruth

Baldin

et al. 2016

View full text Add to dashboard Cite

Uniform and Efficient Access to Data in Organizationally Distributed Environments

Dutka

Słota

Wrzeszcz

et al. 2014

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dariusz Król

Anomaly detection for scientific workflow applications on networked clouds

PANORAMA: An approach to performance modeling and diagnosis of extreme-scale workflows

Practical Resource Monitoring for Robust High Throughput Computing

Virtual Laboratory for Development and Execution of Biomedical Collaborative Applications

A Cloud-Based Data Farming Platform for Molecular Dynamics Simulations

Self-scalable services in service oriented software for cost-effective data farming

Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows

Uniform and Efficient Access to Data in Organizationally Distributed Environments

Contact Info

Product

Resources

About