On Building Parallel &amp; Grid Applications: Component Technology and Distributed Services

“…In this architecture, resources are encapsulated to be Grid services [2,4] with standard interfaces and behaviors. XCAT [5,6] and ICENI [7] attempt to build an application component framework on top of OGSA for distributed computation, and support grid applications that require the collaboration of different Grid services. Neither OGSA nor XCAT takes account of complementary or competitive resources in resource scheduling.…”

Section: Related Work and Conclusionmentioning

confidence: 99%

Coordinating Distributed Resources for Complex Scientific Computation

Ding

2004

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. There exist a large number of computational resources in the domain of scientific and engineering computation. They are distributed, heterogeneous and often too restricted in computability for oneself to satisfy the requirement of modern scientific problems. To address this challenge, this paper proposes a component-based architecture for managing and accessing legacy applications on the computational grid. It automatically schedules legacies with domain expertise, and coordinates them to serve large-scale scientific computation. A prototype has been implemented to evaluate the architecture.

show abstract

“…While the Web service approach of MCS provides interoperability, it also hurts the performance when compared to use of a standard database for storage and retrieval. Scientific applications such as Mesoscale meteorology [2] depend on the orchestration of several workflows, defined in XML format. The international HapMap project aims to develop a haplotype of the human genome.…”

Section: Introductionmentioning

confidence: 99%

Parallel and distributed approach for processing large-scale XML datasets

Fadika

Head

Govindaraju

2009

2009 10th IEEE/ACM International Conference on Grid Computing

View full text Add to dashboard Cite

Abstract-An emerging trend is the use of XML as the data format for many distributed scientific applications, with the size of these documents ranging from tens of megabytes to hundreds of megabytes. Our earlier benchmarking results revealed that most of the widely available XML processing toolkits do not scale well for large sized XML data. A significant transformation is necessary in the design of XML processing for scientific applications so that the overall application turn-around time is not negatively affected. We present both a parallel and distributed approach to analyze how the scalability and performance requirements of large-scale XML-based data processing can be achieved. We have adapted the Hadoop implementation to determine the threshold data sizes and computation work required per node, for a distributed solution to be effective. We also present an analysis of parallelism using our PIXIMAL toolkit for processing large-scale XML datasets that utilizes the capabilities for parallelism that are available in the emerging multi-core architectures. Multi-core processors are expected to be widely available in research clusters and scientific desktops, and it is critical to harness the opportunities for parallelism in the middleware, instead of passing on the task to application programmers. Our parallelization approach for a multi-core node is to employ a DFA-based parser that recognizes a useful subset of the XML specification, and convert the DFA into an NFA that can be applied to an arbitrary subset of the input. Speculative NFAs are scheduled on available cores in a node to effectively utilize the processing capabilities and achieve overall performance gains. We evaluate the efficacy of this approach in terms of potential speedup that can be achieved for representative XML data sets.

show abstract

On Building Parallel & Grid Applications: Component Technology and Distributed Services

Cited by 25 publications

References 20 publications

JGRIM: An approach for easy gridification of applications

JGRIM: An approach for easy gridification of applications

Coordinating Distributed Resources for Complex Scientific Computation

Parallel and distributed approach for processing large-scale XML datasets

Contact Info

Product

Resources

About