The NVIDIA Volta GPU microarchitecture introduces a specialized unit, called Tensor Core that performs one matrix-multiplyand-accumulate on 4×4 matrices per clock cycle. The NVIDIA Tesla V100 accelerator, featuring the Volta microarchitecture, provides 640 Tensor Cores with a theoretical peak performance of 125 Tflops/s in mixed precision. In this paper, we investigate current approaches to program NVIDIA Tensor Cores, their performances and the precision loss due to computation in mixed precision.Currently, NVIDIA provides three different ways of programming matrix-multiply-and-accumulate on Tensor Cores: the CUDA Warp Matrix Multiply Accumulate (WMMA) API, CUTLASS, a templated library based on WMMA, and cuBLAS GEMM. After experimenting with different approaches, we found that NVIDIA Tensor Cores can deliver up to 83 Tflops/s in mixed precision on a Tesla V100 GPU, seven and three times the performance in single and half precision respectively. A WMMA implementation of batched GEMM reaches a performance of 4 Tflops/s. While precision loss due to matrix multiplication with half precision input might be critical in many HPC applications, it can be considerably reduced at the cost of increased computation. Our results indicate that HPC applications using matrix multiplications can strongly benefit from using of NVIDIA Tensor Cores.
The primary target of the Magnetospheric MultiScale (MMS) mission is the electron-scale di usion layer around reconnection sites. Here we study where these regions are found in full three-dimensional simulations. In two dimensions the sites of electron di usion, defined as the regions where magnetic topology changes and electrons move with respect to the magnetic field lines, are located near the reconnection site. But in three dimensions we find that the reconnection exhaust far from the primary reconnection site also becomes host to secondary reconnection sites. Four diagnostics are used to demonstrate the point: the direct observation of topology impossible without secondary reconnection, the direct measurement of topological field line breakage, the measurement of electron jets emerging from secondary reconnection regions, and the violation of the frozen-in condition. We conclude that secondary reconnection occurs in a large part of the exhaust, providing many more chances for MMS to find itself in the right region to hit its target.T he primary focus of the planned Magnetospheric MultiScale (MMS) Mission 1 is the identification and in situ study of electron-scale regions where magnetic reconnection develops (http://mms.gsfc.nasa.gov/science.html). Magnetic reconnection 2 is believed to be the engine of many space and astrophysical processes where magnetic energy is stored over relatively long times to be released suddenly in bursts of kinetic energy. The mission planning and the thinking of the community has been in large part informed by the two-dimensional (2D) picture that has emerged from decades of simulations based on the classic field reversal configuration, where two regions of oppositely directed magnetic fields reconnect at one central location called the x-point. There magnetic field lines break and form new connections.This paradigm has been tremendously successful and from 2D simulations has found confirmation in many laboratory experiments 3 and in situ space observations 4 . When going from two to three dimensions, the model still remains valid if one can assume the presence of a region where the variations along the out-of-plane directions are small. In this case, extended reconnection regions develop, retaining a 2D-like configuration over significant widths in the out-of-plane direction 5,6 . But 3D effects can drastically alter the structure of a reconnection site 7 , with the possibility of inherently 3D configurations even in fast kinetic reconnection 8 .We think it is imperative to ask the question as to how these new 3D discoveries impact the execution of the MMS mission. The mission was designed with the two-nested-box vision 9 : around the reconnection site the ions become decoupled from magnetic field lines in a larger region, whereas the electrons continue to move along with the field lines until a smaller inner region is reached. This scheme is etched into the minds of every researcher working in reconnection and appears in the place of honour in the science section of the MMS web...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.