2012 IEEE 14th International Conference on High Performance Computing and Communication &Amp; 2012 IEEE 9th International Confe 2012
DOI: 10.1109/hpcc.2012.119
|View full text |Cite
|
Sign up to set email alerts
|

Implementation and Analysis of AES Encryption on GPU

Abstract: GPU is continuing its trend of vastly outperforming CPU while becoming more general purpose. In order to improve the efficiency of AES algorithm, this paper proposed a CUDA implementation of Electronic Codebook (ECB) mode encoding process and Cipher Feedback (CBC) mode decoding process on GPU. In our implementation, the frequently accessed T-boxes were allocated on on-chip shared memory and the granularity that one thread handles a 16 Bytes AES block was adopted. Finally, we achieved the highest performance of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
27
0
1

Year Published

2013
2013
2019
2019

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 80 publications
(28 citation statements)
references
References 9 publications
0
27
0
1
Order By: Relevance
“…No related work for CIPHERUNICORN-A, Hierocrypt-3, SC2000, nor CLEFIA using GPGPU is described in the literature without our previous work. [19] CUDA AES GTX285 37.8 Nishikawa et al [19] CUDA Camellia GTX285 38.0 Osvik et al [25] CUDA AES GTX295 * 3 30.9 * 4 Li et al [23] CUDA AES C2050 60 Gilger et al [26] CUDA AES GTX295 * 3 29.6 Gilger et al [26] OpenCL AES GTX295 * 3 25.4 Gilger et al [26] CUDA Camellia GTX295 * 3 26.3 Gilger et al [26] OpenCL Camellia GTX295 * 3 26.3 * 3 They use only 1 of 2 chips of GPU (240 cores) * 4 Including data transfer time Qinjian Li et al [23] implemented AES on Nvidia C2050 using CUDA and achieved 60 Gb/sec throughput by reordering plaintext data. A quantitative cost of reordering was not shown.…”
Section: Performance Resultsmentioning
confidence: 99%
“…No related work for CIPHERUNICORN-A, Hierocrypt-3, SC2000, nor CLEFIA using GPGPU is described in the literature without our previous work. [19] CUDA AES GTX285 37.8 Nishikawa et al [19] CUDA Camellia GTX285 38.0 Osvik et al [25] CUDA AES GTX295 * 3 30.9 * 4 Li et al [23] CUDA AES C2050 60 Gilger et al [26] CUDA AES GTX295 * 3 29.6 Gilger et al [26] OpenCL AES GTX295 * 3 25.4 Gilger et al [26] CUDA Camellia GTX295 * 3 26.3 Gilger et al [26] OpenCL Camellia GTX295 * 3 26.3 * 3 They use only 1 of 2 chips of GPU (240 cores) * 4 Including data transfer time Qinjian Li et al [23] implemented AES on Nvidia C2050 using CUDA and achieved 60 Gb/sec throughput by reordering plaintext data. A quantitative cost of reordering was not shown.…”
Section: Performance Resultsmentioning
confidence: 99%
“…Li et al also implemented AES-128 with TBox on Nvidia Tesla C2050 using CUDA, giving throughput of 60.0 Gbps ignoring data transfer [6]. However, they reordered the plaintext in host memory before loading it onto device memory to ingenerate coalesced access into global memory.…”
Section: Throughput Of Block Ciphers On Gpusmentioning
confidence: 99%
“…Instead, some works have been done for the evaluation of throughputs on GPUs with Nvidia Fermi and AMD VLIW architectures [6]- [8], [15].…”
Section: Throughput Of Block Ciphers On Gpusmentioning
confidence: 99%
See 1 more Smart Citation
“…Iwai et al achieved approximately a throughput of 35Gbps (Gigabits per second) on a NVIDIA Geforce GTX285 [12]. Li et al [17] achieved the highest performance, around 60Gbps throughput on a NVIDIA Tesla C2050 GPU, which runs up to 50 times faster than an Intel Core i7-920. More recent work accelerated asymmetric ciphers by exploiting the power of GPUs [31].…”
mentioning
confidence: 99%