RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery

Jingtao Li; Adnan Siraj Rakin; Zhezhi He; Deliang Fan; Chaitali Chakrabarti

doi:10.23919/DATE51398.2021.9474113

RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery

Jingtao Li, Adnan Siraj Rakin, Zhezhi He, Deliang Fan, Chaitali Chakrabarti

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

13 Scopus citations

Abstract

Adversarial attacks on Neural Network weights, such as the progressive bit-flip attack (PBFA), can cause a catastrophic degradation in accuracy by flipping a very small number of bits. Furthermore, PBFA can be conducted at run time on the weights stored in DRAM main memory. In this work, we propose RADAR, a Run-time adversarial weight Attack Detection and Accuracy Recovery scheme to protect DNN weights against PBFA. We organize weights that are interspersed in a layer into groups and employ a checksum-based algorithm on weights to derive a 2-bit signature for each group. At run time, the 2-bit signature is computed and compared with the securely stored golden signature to detect the bit-flip attacks in a group. After successful detection, we zero out all the weights in a group to mitigate the accuracy drop caused by malicious bit-flips. The proposed scheme is embedded in the inference computation stage. For the ResNet-18 ImageNet model, our method can detect 9.6 bit-flips out of 10 on average. For this model, the proposed accuracy recovery scheme can restore the accuracy from below 1% caused by 10 bit flips to above 69%. The proposed method has extremely low time and storage overhead. System-level simulation on gem5 shows that RADAR only adds < 1% to the inference time, making this scheme highly suitable for run-time attack detection and mitigation.

Original language	English (US)
Title of host publication	Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	790-795
Number of pages	6
ISBN (Electronic)	9783981926354
DOIs	https://doi.org/10.23919/DATE51398.2021.9474113
State	Published - Feb 1 2021
Event	2021 Design, Automation and Test in Europe Conference and Exhibition, DATE 2021 - Virtual, Online Duration: Feb 1 2021 → Feb 5 2021

Publication series

Name	Proceedings -Design, Automation and Test in Europe, DATE
Volume	2021-February
ISSN (Print)	1530-1591

Conference

Conference	2021 Design, Automation and Test in Europe Conference and Exhibition, DATE 2021
City	Virtual, Online
Period	2/1/21 → 2/5/21

Keywords

Neural networks
protection
run-time detection
weight attack

ASJC Scopus subject areas

General Engineering

Access to Document

10.23919/DATE51398.2021.9474113

Cite this

Li, J., Rakin, A. S., He, Z., Fan, D., & Chakrabarti, C. (2021). RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery. In Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021 (pp. 790-795). Article 9474113 (Proceedings -Design, Automation and Test in Europe, DATE; Vol. 2021-February). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.23919/DATE51398.2021.9474113

RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery. / Li, Jingtao; Rakin, Adnan Siraj; He, Zhezhi et al.
Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 790-795 9474113 (Proceedings -Design, Automation and Test in Europe, DATE; Vol. 2021-February).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Li, J, Rakin, AS, He, Z, Fan, D & Chakrabarti, C 2021, RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery. in Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021., 9474113, Proceedings -Design, Automation and Test in Europe, DATE, vol. 2021-February, Institute of Electrical and Electronics Engineers Inc., pp. 790-795, 2021 Design, Automation and Test in Europe Conference and Exhibition, DATE 2021, Virtual, Online, 2/1/21. https://doi.org/10.23919/DATE51398.2021.9474113

Li J, Rakin AS, He Z, Fan D, Chakrabarti C. RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery. In Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 790-795. 9474113. (Proceedings -Design, Automation and Test in Europe, DATE). doi: 10.23919/DATE51398.2021.9474113

@inproceedings{a08515ea81a64898ba8d31dcdc836698,

title = "RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery",

abstract = "Adversarial attacks on Neural Network weights, such as the progressive bit-flip attack (PBFA), can cause a catastrophic degradation in accuracy by flipping a very small number of bits. Furthermore, PBFA can be conducted at run time on the weights stored in DRAM main memory. In this work, we propose RADAR, a Run-time adversarial weight Attack Detection and Accuracy Recovery scheme to protect DNN weights against PBFA. We organize weights that are interspersed in a layer into groups and employ a checksum-based algorithm on weights to derive a 2-bit signature for each group. At run time, the 2-bit signature is computed and compared with the securely stored golden signature to detect the bit-flip attacks in a group. After successful detection, we zero out all the weights in a group to mitigate the accuracy drop caused by malicious bit-flips. The proposed scheme is embedded in the inference computation stage. For the ResNet-18 ImageNet model, our method can detect 9.6 bit-flips out of 10 on average. For this model, the proposed accuracy recovery scheme can restore the accuracy from below 1% caused by 10 bit flips to above 69%. The proposed method has extremely low time and storage overhead. System-level simulation on gem5 shows that RADAR only adds < 1% to the inference time, making this scheme highly suitable for run-time attack detection and mitigation.",

keywords = "Neural networks, protection, run-time detection, weight attack",

author = "Jingtao Li and Rakin, {Adnan Siraj} and Zhezhi He and Deliang Fan and Chaitali Chakrabarti",

note = "Publisher Copyright: {\textcopyright} 2021 EDAA.; 2021 Design, Automation and Test in Europe Conference and Exhibition, DATE 2021 ; Conference date: 01-02-2021 Through 05-02-2021",

year = "2021",

month = feb,

day = "1",

doi = "10.23919/DATE51398.2021.9474113",

language = "English (US)",

series = "Proceedings -Design, Automation and Test in Europe, DATE",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "790--795",

booktitle = "Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021",

}

TY - GEN

T1 - RADAR

T2 - 2021 Design, Automation and Test in Europe Conference and Exhibition, DATE 2021

AU - Li, Jingtao

AU - Rakin, Adnan Siraj

AU - He, Zhezhi

AU - Fan, Deliang

AU - Chakrabarti, Chaitali

PY - 2021/2/1

Y1 - 2021/2/1

N2 - Adversarial attacks on Neural Network weights, such as the progressive bit-flip attack (PBFA), can cause a catastrophic degradation in accuracy by flipping a very small number of bits. Furthermore, PBFA can be conducted at run time on the weights stored in DRAM main memory. In this work, we propose RADAR, a Run-time adversarial weight Attack Detection and Accuracy Recovery scheme to protect DNN weights against PBFA. We organize weights that are interspersed in a layer into groups and employ a checksum-based algorithm on weights to derive a 2-bit signature for each group. At run time, the 2-bit signature is computed and compared with the securely stored golden signature to detect the bit-flip attacks in a group. After successful detection, we zero out all the weights in a group to mitigate the accuracy drop caused by malicious bit-flips. The proposed scheme is embedded in the inference computation stage. For the ResNet-18 ImageNet model, our method can detect 9.6 bit-flips out of 10 on average. For this model, the proposed accuracy recovery scheme can restore the accuracy from below 1% caused by 10 bit flips to above 69%. The proposed method has extremely low time and storage overhead. System-level simulation on gem5 shows that RADAR only adds < 1% to the inference time, making this scheme highly suitable for run-time attack detection and mitigation.

AB - Adversarial attacks on Neural Network weights, such as the progressive bit-flip attack (PBFA), can cause a catastrophic degradation in accuracy by flipping a very small number of bits. Furthermore, PBFA can be conducted at run time on the weights stored in DRAM main memory. In this work, we propose RADAR, a Run-time adversarial weight Attack Detection and Accuracy Recovery scheme to protect DNN weights against PBFA. We organize weights that are interspersed in a layer into groups and employ a checksum-based algorithm on weights to derive a 2-bit signature for each group. At run time, the 2-bit signature is computed and compared with the securely stored golden signature to detect the bit-flip attacks in a group. After successful detection, we zero out all the weights in a group to mitigate the accuracy drop caused by malicious bit-flips. The proposed scheme is embedded in the inference computation stage. For the ResNet-18 ImageNet model, our method can detect 9.6 bit-flips out of 10 on average. For this model, the proposed accuracy recovery scheme can restore the accuracy from below 1% caused by 10 bit flips to above 69%. The proposed method has extremely low time and storage overhead. System-level simulation on gem5 shows that RADAR only adds < 1% to the inference time, making this scheme highly suitable for run-time attack detection and mitigation.

KW - Neural networks

KW - protection

KW - run-time detection

KW - weight attack

UR - http://www.scopus.com/inward/record.url?scp=85111011069&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85111011069&partnerID=8YFLogxK

U2 - 10.23919/DATE51398.2021.9474113

DO - 10.23919/DATE51398.2021.9474113

M3 - Conference contribution

AN - SCOPUS:85111011069

T3 - Proceedings -Design, Automation and Test in Europe, DATE

SP - 790

EP - 795

BT - Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 1 February 2021 through 5 February 2021

ER -

RADAR: Run-time Adversarial Weight Attack Detection and Accuracy Recovery

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this