Balazs Gerofi scite author profile

Balazs Gerofi

5Publications

69Citation Statements Received

85Citation Statements Given

How they've been cited

129

How they cite others

Affiliations

Intel (United States), RIKEN Center for Computational Science, The University of Tokyo

Publications

Order By: Most citations

“Big Data Assimilation” Toward Post-Petascale Severe Weather Prediction: An Overview and Progress

et al. 2016

View full text Add to dashboard Cite

Following the invention of the telegraph, electronic computer, and remote sensing, "big data" is bringing another revolution to weather prediction. As sensor and computer technologies advance, orders of magnitude bigger data are produced by new sensors and high-precision computer simulation or "big simulation." Data assimilation (DA) is a key to numerical weather prediction (NWP) by integrating the real-world sensor data into simulation. However, the current DA and NWP systems are not designed to handle the "big data" from next-generation sensors and big simulation. Therefore, we propose "big data assimilation" (BDA) innovation to fully utilize the big data. Since October 2013, the Japan's BDA project has been exploring revolutionary NWP at 100-m mesh refreshed every 30 s, orders of magnitude finer and faster than the current typical NWP systems, by taking advantage of the fortunate combination of next-generation technologies: the 10-petaflops K computer, phased array weather radar, and geostationary satellite Himawari-8. So far, a BDA prototype system was developed and tested with real-world retrospective local rainstorm cases. This paper summarizes the activities and progress of the BDA project, and concludes with perspectives toward the post-petascale supercomputing era.

show abstract

Partially Separated Page Tables for Efficient Operating System Assisted Hierarchical Memory Management on Heterogeneous Architectures

Gerofi

Shimada

Hori

et al. 2013

View full text Add to dashboard Cite

Utilizing Memory Content Similarity for Improving the Performance of Replicated Virtual Machines

Gerofi

Vass

Ishikawa

2011

View full text Add to dashboard Cite

Checkpoint-recovery based Virtual Machine (VM) replication is an emerging approach towards accommodating VM installations with high availability. However, it comes with the price of significant performance degradation of the application executed in the VM due to the large amount of state that needs to be synchronized between the primary and the backup machines. It is therefore critical to find new ways for attaining good performance, and at the same time, maintaining fault tolerant execution. In this paper, we present a novel approach to improve the performance of services deployed over replicated virtual machines by exploiting data similarity within the VM's memory image to reduce the network traffic during synchronization. For identifying similar memory areas, we propose a bit density based hash function, upon which, we build a content addressable hash table. We present a quantitative analysis on the degree of similarity we found in various workloads, and introduce a lightweight compression method, which, compared to existing replication techniques, reduces network traffic by up to 80% and yields a performance improvement over 90% for certain latency sensitive applications. I. INTRODUCTIONWith the recent increase in cloud computing's prevalence, the number of online services deployed over virtualized infrastructures has experienced a tremendous growth. At the same time, however, the latest hardware trend of growing component number in current computing systems renders hardware failures common place rather than exceptional [1]. Replication at the Virtual Machine Monitor (VMM) layer is an attractive technique to ensure fault tolerance in such environments, primarily, because it provides seamless failover for the entire software stack executed inside the Virtual Machine (VM), regardless the application or the underlying operating system. One particular approach, checkpoint-recovery based VM replication, has gained a lot of attention recently [2], [3], [4], [5].Checkpoint-recovery based replication of virtual machines is attained by capturing the entire execution state of the running VM at relatively high frequency in order to propagate changes to the backup machine almost instantly. Essentially, it keeps the backup machine nearly up-to-date with the latest execution state of the primary machine so that the backup can take over the execution in case the primary fails [2].Between checkpoints the VM executes in log-dirty mode, i.e., write accessed pages are recorded so that when the snapshot is taken only pages that were modified in the most

show abstract

High-speed classification of coherent X-ray diffraction patterns on the K computer for high-resolution single biomolecule imaging

Tokuhisa

Arai

Joti

et al. 2013

J Synchrotron Radiat

View full text Add to dashboard Cite

Single-particle coherent X-ray diffraction imaging using an X-ray free-electron laser has the potential to reveal the three-dimensional structure of a biological supra-molecule at sub-nanometer resolution. In order to realise this method, it is necessary to analyze as many as 1 Â 10 6 noisy X-ray diffraction patterns, each for an unknown random target orientation. To cope with the severe quantum noise, patterns need to be classified according to their similarities and average similar patterns to improve the signal-to-noise ratio. A high-speed scalable scheme has been developed to carry out classification on the K computer, a 10PFLOPS supercomputer at RIKEN Advanced Institute for Computational Science. It is designed to work on the real-time basis with the experimental diffraction pattern collection at the X-ray free-electron laser facility SACLA so that the result of classification can be feedback for optimizing experimental parameters during the experiment. The present status of our effort developing the system and also a result of application to a set of simulated diffraction patterns is reported. About 1 Â 10 6 diffraction patterns were successfully classificatied by running 255 separate 1 h jobs in 385-node mode.

show abstract

Interface for heterogeneous kernels: A framework to enable hybrid OS designs targeting high performance computing on manycore architectures

Shimosawa

Gerofi

Takagi

et al. 2014

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Balazs Gerofi

“Big Data Assimilation” Toward Post-Petascale Severe Weather Prediction: An Overview and Progress

Partially Separated Page Tables for Efficient Operating System Assisted Hierarchical Memory Management on Heterogeneous Architectures

Utilizing Memory Content Similarity for Improving the Performance of Replicated Virtual Machines

High-speed classification of coherent X-ray diffraction patterns on the K computer for high-resolution single biomolecule imaging

Interface for heterogeneous kernels: A framework to enable hybrid OS designs targeting high performance computing on manycore architectures

Contact Info

Product

Resources

About