Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Dong, Junhao; Koniusz, Piotr; Chen, Junxi; Wang, Z. Jane; Ong, Yew Soon

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Date

2024

Authors

Dong, Junhao

Koniusz, Piotr

Chen, Junxi

Wang, Z. Jane

Ong, Yew Soon

Publisher

IEEE Computer Society

Abstract

Adversarially robust knowledge distillation aims to com-press large-scale models into lightweight models while preserving adversarial robustness and natural performance on a given dataset. Existing methods typically align probability distributions of natural and adversarial samples between teacher and student models, but they overlook intermediate adversarial samples along the 'adversarial path' formed by the multi-step gradient ascent of a sample towards the decision boundary. Such paths capture rich information about the decision boundary. In this paper, we propose a novel adversarially robust knowledge distillation approach by incorporating such adversarial paths into the alignment process. Recognizing the diverse impacts of intermediate adversarial samples (ranging from benign to noisy), we propose an adaptive weighting strategy to selectively em-phasize informative adversarial samples, thus ensuring efficient utilization of lightweight model capacity. Moreover, we propose a dual-branch mechanism exploiting two following insights: (i) complementary dynamics of adversar-ial paths obtained by targeted and untargeted adversarial learning, and (ii) inherent differences between the gradient ascent path from class ci towards the nearest class bound-ary and the gradient descent path from a specific class cj towards the decision region of ci(i≠ j). Comprehensive experiments demonstrate the effectiveness of our method on lightweight models under various settings.

Keywords

Adversarial learning, Adversarially robust knowledge distillation, Intermediate adversarial sample

URI

http://www.scopus.com/inward/record.url?scp=85206358670&partnerID=8YFLogxK
https://hdl.handle.net/1885/733752768

Collections

ANU Research Publications

Type

Conference paper

Book Title

Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024

Entity type

Publication

DOI

10.1109/CVPR52733.2024.02686

Full item page

Cultural advice

Robust Distillation via Untargeted and Targeted Intermediate Adversarial Samples

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Access Statement

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Citation

URI

Collections

Source

Type

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until