2008 IEEE Fourth International Conference on eScience 2008
DOI: 10.1109/escience.2008.77
|View full text |Cite
|
Sign up to set email alerts
|

Parallel Processing of Large-Scale XML-Based Application Documents on Multi-core Architectures with PiXiMaL

Abstract: Very large scientific datasets are becoming increasingly available in XML formats. Our earlier benchmarking results show that parsing XML is a time consuming process when compared with binary formats optimized for large scale documents. This performance bottleneck will get exacerbated as size of XML data increases in e-science applications. Our focus in this paper is on addressing this performance bottleneck. In recent times, the microprocessor industry has made rapid strides towards Chip Multi Processors (CMP… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(9 citation statements)
references
References 15 publications
0
9
0
Order By: Relevance
“…Recent benchmarking works in [32,33] demonstrate that most existing implementations of WS do not scale well when the size of the SOAP/XML document being processed is increased. The authors in [32,33]argue that most existing software toolkitsare typically designed to process small-sized XML datasets, and thus are not suited for large-scale comptuging applications, e.g., [25,62].Hence, recent studies have attempted to alleviate the limitations of XML software performance bottlenecks by applying nontraditional parallel processor architectures, e.g., [8,23,30,36,55,78]. On one hand, general-purpose (scalar) processorsare characterized by the sequential nature of instruction execution, where instructions are selected based on their sequential memory addresses, conditions being evaluated one at a time.…”
Section: Parallelization and Hardware Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…Recent benchmarking works in [32,33] demonstrate that most existing implementations of WS do not scale well when the size of the SOAP/XML document being processed is increased. The authors in [32,33]argue that most existing software toolkitsare typically designed to process small-sized XML datasets, and thus are not suited for large-scale comptuging applications, e.g., [25,62].Hence, recent studies have attempted to alleviate the limitations of XML software performance bottlenecks by applying nontraditional parallel processor architectures, e.g., [8,23,30,36,55,78]. On one hand, general-purpose (scalar) processorsare characterized by the sequential nature of instruction execution, where instructions are selected based on their sequential memory addresses, conditions being evaluated one at a time.…”
Section: Parallelization and Hardware Approachesmentioning
confidence: 99%
“…Handling XMLstreams entirely in software (for instance, by mapping processing pipeline stages to software threads) prevents the execution speed to be improved beyond a best processing rate of tens of clock cycles per character, and that best case performance can result in rates on the order of hundreds of clock cycles per character for many practical XML applications [78]. As a result, recent studies have addressed these performance bottlenecks by investigating non-traditional processors, namely parallel processing architectures and ‚XML ma-chines‛, e.g., [8,23,30].…”
Section: Introductionmentioning
confidence: 99%
“…In our previous work in this area, we focused on statescalability for the parser and the memory requirements for arrays of primitives when multiple threads operate concurrently to read large input files [18]. One related project by Pan et.…”
Section: Related Work In Xml Processingmentioning
confidence: 99%
“…A thorough description and analysis of the effective memory bandwidth of the PIXIMAL approach is presented in another venue [9]. In this section, we present a summary of research findings on these two topics.…”
Section: Memory Bandwidth and State-scalabilitymentioning
confidence: 99%
“…While this technique works well for serial processing, it is not tailored for processing on multi-core nodes, especially for very large document sizes. In our previous work in this area, we focused just on the memory bandwidth in multi-core architectures when multiple threads operate concurrently to read large input files [9].…”
Section: Related Workmentioning
confidence: 99%