Adaptive Configuration of In Situ Lossy Compression for Cosmology Simulations via Fine-Grained Rate-Quality Modeling

Jin, Sian; Pulido, Jesus; Grosset, Pascal; Tian, Jiannan; Tao, Dingwen; Ahrens, James

doi:10.1145/3431379.3460653

Cited by 12 publications

(6 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We explain the reasons for the differences in the impact of different fault types on the post-analysis results. As aforementioned, the halo-finder algorithm searches for the halos from all the simulated data, with the following two criteria: (1) the mass of an object(s) must be greater than a threshold (e.g., 81.66 times the average mass of the whole dataset) to become a halo cell candidate [34], [35], and (2) there must be enough halo cell candidates in a certain area to form a halo. Below, for each fault type, we explain in details how each fault type potentially affects the halo-finder procedure.…”

Section: B Results For Faults Affecting Application Datamentioning

confidence: 99%

“…We also note that the baryon density field in Nyx can be easily compressed (i.e., compression ratio ranging from tens to hundreds) [34], [35], thus the importance of metadata would be greatly raised due to its increasing portion in the whole file. And since some metadata fields are related to each other, certain faults in the metadata can be detected and corrected as aforementioned; in other words, as the metadata of HDF5 file format itself has a certain degree of redundancy (correlation), we do not choose to replicate the metadata.…”

Section: ) Correction Methodologymentioning

confidence: 99%

See 1 more Smart Citation

Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights

Fang

Wang

Jin

et al. 2021

2021 IEEE International Conference on Cluster Computing (CLUSTER)

Self Cite

View full text Add to dashboard Cite

In recent years, the increasing complexity in scientific simulations and emerging demands for training heavy artificial intelligence models require massive and fast data accesses, which urges high-performance computing (HPC) platforms to equip with more advanced storage infrastructures such as solidstate disks (SSDs). While SSDs offer high-performance I/O, the reliability challenges faced by the HPC applications under the SSD-related failures remains unclear, in particular for failures resulting in data corruptions. The goal of this paper is to understand the impact of SSD-related faults on the behaviors of complex HPC applications. To this end, we propose FFIS, a FUSE-based fault injection framework that systematically introduces storage faults into the application layer to model the errors originated from SSDs. FFIS is able to plant different I/O related faults into the data returned from underlying file systems, which enables the investigation on the error resilience characteristics of the scientific file format. We demonstrate the use of FFIS with three representative real HPC applications, showing how each application reacts to the data corruptions, and provide insights on the error resilience of the widely adopted HDF5 file format for the HPC applications.

show abstract

Section: B Results For Faults Affecting Application Datamentioning

confidence: 99%

Section: ) Correction Methodologymentioning

confidence: 99%

Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights

Fang

Wang

Jin

et al. 2021

2021 IEEE International Conference on Cluster Computing (CLUSTER)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Metric 4: Similar to prior work [14], [15], [42], [32], [20], [43], [35], we plot the rate-distortion curve to compare the distortion quality with the same bit-rate, for a fair comparison between different compression approaches, taking into account diverse compression algorithms.…”

Section: Evaluation Metricsmentioning

confidence: 99%

TAC+: Drastically Optimizing Error-Bounded Lossy Compression for 3D AMR Simulations

Daoce¹,

Pulido²,

Grosset³

et al. 2023

Preprint

View full text Add to dashboard Cite

Today's scientific simulations require a significant reduction of data volume because of extremely large amounts of data they produce and the limited I/O bandwidth and storage space. Error-bounded lossy compression has been considered one of the most effective solutions to the above problem. However, little work has been done to improve error-bounded lossy compression for Adaptive Mesh Refinement (AMR) simulation data. Unlike the previous work that only leverages 1D compression, in this work, we propose an approach (TAC) to leverage high-dimensional SZ compression for each refinement level of AMR data. To remove the data redundancy across different levels, we propose several pre-process strategies and adaptively use them based on the data characteristics. We further optimize TAC to TAC+ by improving the lossless encoding stage of SZ compression to efficiently handle many small AMR data blocks after the pre-processing. Experiments on 8 AMR datasets from a real-world large-scale AMR simulation demonstrate that TAC+ can improve the compression ratio by up to 4.9× under the same data distortion, compared to the state-of-the-art method. In addition, we leverage the flexibility of our approach to tune the error bound for each level, which achieves much lower data distortion on two application-specific metrics.! 1. The patch-based AMR data redundantly saves the data block to be refined at the next finer level in the current coarse level (will be introduced in detail in Section 2.3).

show abstract

“…3) Test Datasets: We conduct our evaluation and comparison based on eight typical 1D∼4D real-world HPC simulation datasets, including six from Scientific Data Reduction Benchmarks [34]: 1D HACC cosmology simulation [12], 2D LAMMPS (part of the EXAALT ECP project) molecular dynamics simulation [24], 3D CESM-ATM climate simulation [6], 3D Nyx cosmology simulation [31], 4D Hurricane ISABEL simulation [16], and 4D QMCPack quantum simulation [32]. They have been widely used in much prior work [37,26,27,47,46,38,40,39,20,4] and are good representatives of production-level simulation datasets. Additionally, we also evaluate two datasets that highlight our decoders' potential to be used as in-memory compressors as discussed in §I, including 3D RTM simulation data for petroleum exploration [17] and 1D GAMESS data for quantum chemistry simulation [10].…”

Section: Performance Evaluationmentioning

confidence: 99%

Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs

Rivera¹,

Di²,

Tian³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

More and more HPC applications require fast and effective compression techniques to handle large volumes of data in storage and transmission. Not only do these applications need to compress the data effectively during simulation, but they also need to perform decompression efficiently for post hoc analysis. SZ is an error-bounded lossy compressor for scientific data, and cuSZ is a version of SZ designed to take advantage of the GPU's power. At present, cuSZ's compression performance has been optimized significantly while its decompression still suffers considerably lower performance because of its sophisticated lossless compression step-a customized Huffman decoding. In this work, we aim to significantly improve the Huffman decoding performance for cuSZ, thus improving the overall decompression performance in turn. To this end, we first investigate two state-ofthe-art GPU Huffman decoders in depth. Then, we propose a deep architectural optimization for both algorithms. Specifically, we take full advantage of CUDA GPU architectures by using shared memory on decoding/writing phases, online tuning the amount of shared memory to use, improving memory access patterns, and reducing warp divergence. Finally, we evaluate our optimized decoders on an Nvidia V100 GPU using eight representative scientific datasets. Our new decoding solution obtains an average speedup of 3.64× over cuSZ's Huffman decoder and improves its overall decompression performance by 2.43× on average.

show abstract

Adaptive Configuration of In Situ Lossy Compression for Cosmology Simulations via Fine-Grained Rate-Quality Modeling

Cited by 12 publications

References 30 publications

Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights

Characterizing Impacts of Storage Faults on HPC Applications: A Methodology and Insights

TAC+: Drastically Optimizing Error-Bounded Lossy Compression for 3D AMR Simulations

Optimizing Huffman Decoding for Error-Bounded Lossy Compression on GPUs

Contact Info

Product

Resources

About