Abstract:SUMMARYThis paper presents a fast-multipole surface-chargesimulation method for calculating three-dimensional Laplacian fields in voxel models. This method treats a surface of a voxel that has different inside and outside conductivities as a surface element of the indirect BEM (boundary element method). The main features of the proposed method are as follows. (1) An O(D 2 ) performance in the memory capacity and operation cost is provided by applying the diagonal form fast multipole method (FMM), when the numb… Show more
“…By contrast, Equation (16) has O.p 2 / complexity because it requires no summation and it depends on , Â, and . Therefore, after M m n is translated into W j k , the W j k is repeatedly available for a lot of translations using Equation (16). Therefore, the most significant part of the M2L operation is reduced to O.p 2 /.…”
Section: Resultsmentioning
confidence: 99%
“…After selecting a translation direction from˙x,˙y, and˙´, the following operations are sequentially performed: rotate all related M toward the selected direction using Equation (19); translate all M to exponential expansion coefficients W using Equations (14) and (15); translate W to W 0 that corresponds to L using Equation (16); translate W 0 to L with Equations (17) and (18); reversely rotate all related L using Equation (20). D-M2L is completed after performing all of these processes in all directions.…”
Section: Cpu Codesmentioning
confidence: 99%
“…The parameter k max used in D-M2L [9] was set to 18. Parameters M.k/, k D 1 to k max , were set to 6,8,12,16,20,26,30,34,38,44, 48, 52, 56, 60, 60, 52, 4, and 2. Odd numbers were avoided for M.k/ to improve the calculation efficiency [16].…”
Section: Cpu Codesmentioning
confidence: 99%
“…Therefore, the CUDA kernels of these processes can share the template in Table II. However, this required a modification in the template for diagonal form translation based on Equation (16). Lines 05 and 06 were exchanged irregularly, that is, 'For loop 2' specified source boxes, whereas CUDA threads handled 568 pairs of target and source coefficients that correspond to the one-to-one mapping in the diagonal-form translation.…”
Section: Graphics Processing Unit Codesmentioning
confidence: 99%
“…The BEM is geared to electrostatic field analysis in voxel models, and it considers square walls on cubic voxels as boundary surface elements [16]. Using the BEM, three-dimensional fields were analyzed in human voxel models derived from anatomical images.…”
SUMMARYAn indirect boundary element method that is geared to electrostatic field analysis in voxel models is accelerated by graphics processing units (GPUs). The method considers square walls on cubic voxels as boundary surface elements and uses the fast multipole method (FMM) to analyze large-scale models. On the basis of two conventional CPU codes, three GPU codes are programmed in search of higher computing performance. These GPU codes are designed as follows: In GPU code 1, direct and far fields in the FMM are simultaneously calculated on the GPU and the CPU, respectively; in GPU code 2, both fields are calculated on the GPU with a rotation-coaxial translation-rotation decomposition algorithm; and in GPU code 3, both fields are calculated on the GPU with a diagonal translation scheme. The electric fields in human models are generated by applying a 50-Hz magnetic field or by injecting direct-current (DC) current through two electrodes and they were calculated successfully using a personal computer with three GPUs and six CPU cores. An analysis with 3.9 million surface elements took 89.4 s to solve its governing linear system with double-precision floating-point arithmetic. GPU codes 1, 2, and 3 demonstrated the least memory usage, the greatest speed-up ratio, and the fastest calculation time, respectively. These results show an example of the trade-off relationships of computation performances on a heterogeneous CPU-GPU system.
“…By contrast, Equation (16) has O.p 2 / complexity because it requires no summation and it depends on , Â, and . Therefore, after M m n is translated into W j k , the W j k is repeatedly available for a lot of translations using Equation (16). Therefore, the most significant part of the M2L operation is reduced to O.p 2 /.…”
Section: Resultsmentioning
confidence: 99%
“…After selecting a translation direction from˙x,˙y, and˙´, the following operations are sequentially performed: rotate all related M toward the selected direction using Equation (19); translate all M to exponential expansion coefficients W using Equations (14) and (15); translate W to W 0 that corresponds to L using Equation (16); translate W 0 to L with Equations (17) and (18); reversely rotate all related L using Equation (20). D-M2L is completed after performing all of these processes in all directions.…”
Section: Cpu Codesmentioning
confidence: 99%
“…The parameter k max used in D-M2L [9] was set to 18. Parameters M.k/, k D 1 to k max , were set to 6,8,12,16,20,26,30,34,38,44, 48, 52, 56, 60, 60, 52, 4, and 2. Odd numbers were avoided for M.k/ to improve the calculation efficiency [16].…”
Section: Cpu Codesmentioning
confidence: 99%
“…Therefore, the CUDA kernels of these processes can share the template in Table II. However, this required a modification in the template for diagonal form translation based on Equation (16). Lines 05 and 06 were exchanged irregularly, that is, 'For loop 2' specified source boxes, whereas CUDA threads handled 568 pairs of target and source coefficients that correspond to the one-to-one mapping in the diagonal-form translation.…”
Section: Graphics Processing Unit Codesmentioning
confidence: 99%
“…The BEM is geared to electrostatic field analysis in voxel models, and it considers square walls on cubic voxels as boundary surface elements [16]. Using the BEM, three-dimensional fields were analyzed in human voxel models derived from anatomical images.…”
SUMMARYAn indirect boundary element method that is geared to electrostatic field analysis in voxel models is accelerated by graphics processing units (GPUs). The method considers square walls on cubic voxels as boundary surface elements and uses the fast multipole method (FMM) to analyze large-scale models. On the basis of two conventional CPU codes, three GPU codes are programmed in search of higher computing performance. These GPU codes are designed as follows: In GPU code 1, direct and far fields in the FMM are simultaneously calculated on the GPU and the CPU, respectively; in GPU code 2, both fields are calculated on the GPU with a rotation-coaxial translation-rotation decomposition algorithm; and in GPU code 3, both fields are calculated on the GPU with a diagonal translation scheme. The electric fields in human models are generated by applying a 50-Hz magnetic field or by injecting direct-current (DC) current through two electrodes and they were calculated successfully using a personal computer with three GPUs and six CPU cores. An analysis with 3.9 million surface elements took 89.4 s to solve its governing linear system with double-precision floating-point arithmetic. GPU codes 1, 2, and 3 demonstrated the least memory usage, the greatest speed-up ratio, and the fastest calculation time, respectively. These results show an example of the trade-off relationships of computation performances on a heterogeneous CPU-GPU system.
SUMMARYThis paper describes an application of the equivalent multipole moment method (EMMM) with polar translations to calculation of magnetic fields induced by a current dipole placed in a human head model. Although the EMMM is a conventional Laplacian field solver based on spherical harmonic functions, the polar translations enable it to treat eccentric and exclusive spheres in arbitrary arrangements. The head model is composed of seven spheres corresponding to skin, two eyeballs, skull, cerebral spinal fluid, gray matter, and white matter. The validity of the calculated magnetic fields and the magnetic flux linkages with a loop coil located near the model is successfully confirmed by the reciprocity theorem derived by Eaton.
SUMMARYThis paper describes a benchmark model proposed for the clarification of the characteristic of various methods for modeling the laminated iron core. In order to obtain a reference solution of the benchmark model, a large-scale nonlinear magnetostatic field analysis with a mesh fine enough to represent the microscopic structure of the laminated iron core is carried out by using the hybrid finite element-boundary element (FE-BE) method combined with the fast multipole method (FMM) based on diagonal forms for translation operators. The computational costs and accuracy of two kinds of homogenization methods are discussed, comparing them with the reference solution. As a consequence, it is verified that the homogenization methods can analyze magnetic fields in laminated iron core within acceptable computational costs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.