MaungMaung AprilPyone scite author profile

In this paper, we propose a novel defensive transformation that enables us to maintain a high classification accuracy under the use of both clean images and adversarial examples for adversarially robust defense. The proposed transformation is a block-wise preprocessing technique with a secret key to input images. We developed three algorithms to realize the proposed transformation: Pixel Shuffling, Bit Flipping, and FFX Encryption. Experiments were carried out on the CIFAR-10 and ImageNet datasets by using both black-box and white-box attacks with various metrics including adaptive ones. The results show that the proposed defense achieves high accuracy close to that of using clean images even under adaptive attacks for the first time. In the best-case scenario, a model trained by using images transformed by FFX Encryption (block size of 4) yielded an accuracy of 92.30 % on clean images and 91.48 % under PGD attack with a noise distance of 8/255, which is close to the non-robust accuracy (95.45 %) for the CIFAR-10 dataset, and it yielded an accuracy of 72.18 % on clean images and 71.43 % under the same attack, which is also close to the standard accuracy (73.70 %) for the ImageNet dataset. Overall, all three proposed algorithms are demonstrated to outperform state-ofthe-art defenses including adversarial training whether or not a model is under attack.

show abstract

Piracy-Resistant DNN Watermarking by Block-Wise Image Transformation with Secret Key

AprilPyone¹,

Kiya²

2021

Preprint

View full text Add to dashboard Cite

In this paper, we propose a novel DNN watermarking method that utilizes a learnable image transformation method with a secret key. The proposed method embeds a watermark pattern in a model by using learnable transformed images and allows us to remotely verify the ownership of the model. As a result, it is piracy-resistant, so the original watermark cannot be overwritten by a pirated watermark, and adding a new watermark decreases the model accuracy unlike most of the existing DNN watermarking methods. In addition, it does not require a special pre-defined training set or trigger set. We empirically evaluated the proposed method on the CIFAR-10 dataset. The results show that it was resilient against fine-tuning and pruning attacks while maintaining a high watermark-detection accuracy.

show abstract

Transfer Learning-Based Model Protection With Secret Key

AprilPyone¹,

Kiya²

2021

Preprint

View full text Add to dashboard Cite

Adversarial Test on Learnable Image Encryption

AprilPyone¹,

Sirichotedumrong²,

Kiya³

2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.