Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution

Jingtao Li; Manqing Mao; Chaitali Chakrabarti

doi:10.1109/SiPS47522.2019.9020318

Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution

Jingtao Li, Manqing Mao, Chaitali Chakrabarti

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Binary deep neural networks, that have been implemented in resistive random access memory (ReRAM) for storage efficiency, suffer from poor recognition performance in the presence of hardware errors. This paper addresses this problem by deriving a novel weight distribution and representation scheme that mitigates errors due to faulty ReRAM cells with minimal storage overhead. In the proposed scheme, the weight matrix is partitioned into grains, and each weight in a grain is represented by the sum of a multi-bit mean and a 1-bit deviation. The grain size as well as the mean to deviation ratio of the weights in a grain can be chosen such that the network is resilient to hardware errors. A hybrid processing-in-memory (PIM) architecture is proposed to support this scheme. The mean values are stored in a small SRAM and processed by a CMOS unit, and the deviations are stored and processed by the ReRAM unit. Compared to the baseline binary neural network which fails in the presence of severe hardware errors, the proposed hybrid scheme has only a mild recognition performance degradation. Simulation results show the proposed scheme achieves 97.84% test accuracy (a 0.84% accuracy drop) on a MNIST dataset, and 88.07% test accuracy (a 1.10% accuracy drop) on a CIFAR-10 dataset under 9.04% stuck-At-1 and 1.75% stuck-At-0 faults.

Original language	English (US)
Title of host publication	2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	189-194
Number of pages	6
ISBN (Electronic)	9781728119274
DOIs	https://doi.org/10.1109/SiPS47522.2019.9020318
State	Published - Oct 2019
Event	33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019 - Nanjing, China Duration: Oct 20 2019 → Oct 23 2019

Publication series

Name	IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation
Volume	2019-October
ISSN (Print)	1520-6130

Conference

Conference	33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019
Country/Territory	China
City	Nanjing
Period	10/20/19 → 10/23/19

Keywords

Neural networks
ReRAM
accuracy
hardware-centered training
reliability

ASJC Scopus subject areas

Electrical and Electronic Engineering
Signal Processing
Applied Mathematics
Hardware and Architecture

Access to Document

10.1109/SiPS47522.2019.9020318

Cite this

Li, J., Mao, M., & Chakrabarti, C. (2019). Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution. In 2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019 (pp. 189-194). Article 9020318 (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation; Vol. 2019-October). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/SiPS47522.2019.9020318

Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution. / Li, Jingtao; Mao, Manqing; Chakrabarti, Chaitali.
2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 189-194 9020318 (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation; Vol. 2019-October).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Li, J, Mao, M & Chakrabarti, C 2019, Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution. in 2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019., 9020318, IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation, vol. 2019-October, Institute of Electrical and Electronics Engineers Inc., pp. 189-194, 33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019, Nanjing, China, 10/20/19. https://doi.org/10.1109/SiPS47522.2019.9020318

Li J, Mao M, Chakrabarti C. Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution. In 2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 189-194. 9020318. (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation). doi: 10.1109/SiPS47522.2019.9020318

Li, Jingtao ; Mao, Manqing ; Chakrabarti, Chaitali. / Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution. 2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 189-194 (IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation).

@inproceedings{81e3b8cf71774f73ac93c6be1d81058a,

title = "Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution",

abstract = "Binary deep neural networks, that have been implemented in resistive random access memory (ReRAM) for storage efficiency, suffer from poor recognition performance in the presence of hardware errors. This paper addresses this problem by deriving a novel weight distribution and representation scheme that mitigates errors due to faulty ReRAM cells with minimal storage overhead. In the proposed scheme, the weight matrix is partitioned into grains, and each weight in a grain is represented by the sum of a multi-bit mean and a 1-bit deviation. The grain size as well as the mean to deviation ratio of the weights in a grain can be chosen such that the network is resilient to hardware errors. A hybrid processing-in-memory (PIM) architecture is proposed to support this scheme. The mean values are stored in a small SRAM and processed by a CMOS unit, and the deviations are stored and processed by the ReRAM unit. Compared to the baseline binary neural network which fails in the presence of severe hardware errors, the proposed hybrid scheme has only a mild recognition performance degradation. Simulation results show the proposed scheme achieves 97.84% test accuracy (a 0.84% accuracy drop) on a MNIST dataset, and 88.07% test accuracy (a 1.10% accuracy drop) on a CIFAR-10 dataset under 9.04% stuck-At-1 and 1.75% stuck-At-0 faults.",

keywords = "Neural networks, ReRAM, accuracy, hardware-centered training, reliability",

author = "Jingtao Li and Manqing Mao and Chaitali Chakrabarti",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019 ; Conference date: 20-10-2019 Through 23-10-2019",

year = "2019",

month = oct,

doi = "10.1109/SiPS47522.2019.9020318",

language = "English (US)",

series = "IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "189--194",

booktitle = "2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019",

}

TY - GEN

T1 - Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution

AU - Li, Jingtao

AU - Mao, Manqing

AU - Chakrabarti, Chaitali

PY - 2019/10

Y1 - 2019/10

N2 - Binary deep neural networks, that have been implemented in resistive random access memory (ReRAM) for storage efficiency, suffer from poor recognition performance in the presence of hardware errors. This paper addresses this problem by deriving a novel weight distribution and representation scheme that mitigates errors due to faulty ReRAM cells with minimal storage overhead. In the proposed scheme, the weight matrix is partitioned into grains, and each weight in a grain is represented by the sum of a multi-bit mean and a 1-bit deviation. The grain size as well as the mean to deviation ratio of the weights in a grain can be chosen such that the network is resilient to hardware errors. A hybrid processing-in-memory (PIM) architecture is proposed to support this scheme. The mean values are stored in a small SRAM and processed by a CMOS unit, and the deviations are stored and processed by the ReRAM unit. Compared to the baseline binary neural network which fails in the presence of severe hardware errors, the proposed hybrid scheme has only a mild recognition performance degradation. Simulation results show the proposed scheme achieves 97.84% test accuracy (a 0.84% accuracy drop) on a MNIST dataset, and 88.07% test accuracy (a 1.10% accuracy drop) on a CIFAR-10 dataset under 9.04% stuck-At-1 and 1.75% stuck-At-0 faults.

AB - Binary deep neural networks, that have been implemented in resistive random access memory (ReRAM) for storage efficiency, suffer from poor recognition performance in the presence of hardware errors. This paper addresses this problem by deriving a novel weight distribution and representation scheme that mitigates errors due to faulty ReRAM cells with minimal storage overhead. In the proposed scheme, the weight matrix is partitioned into grains, and each weight in a grain is represented by the sum of a multi-bit mean and a 1-bit deviation. The grain size as well as the mean to deviation ratio of the weights in a grain can be chosen such that the network is resilient to hardware errors. A hybrid processing-in-memory (PIM) architecture is proposed to support this scheme. The mean values are stored in a small SRAM and processed by a CMOS unit, and the deviations are stored and processed by the ReRAM unit. Compared to the baseline binary neural network which fails in the presence of severe hardware errors, the proposed hybrid scheme has only a mild recognition performance degradation. Simulation results show the proposed scheme achieves 97.84% test accuracy (a 0.84% accuracy drop) on a MNIST dataset, and 88.07% test accuracy (a 1.10% accuracy drop) on a CIFAR-10 dataset under 9.04% stuck-At-1 and 1.75% stuck-At-0 faults.

KW - Neural networks

KW - ReRAM

KW - accuracy

KW - hardware-centered training

KW - reliability

UR - http://www.scopus.com/inward/record.url?scp=85082385482&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85082385482&partnerID=8YFLogxK

U2 - 10.1109/SiPS47522.2019.9020318

DO - 10.1109/SiPS47522.2019.9020318

M3 - Conference contribution

AN - SCOPUS:85082385482

T3 - IEEE Workshop on Signal Processing Systems, SiPS: Design and Implementation

SP - 189

EP - 194

BT - 2019 IEEE International Workshop on Signal Processing Systems, SiPS 2019

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 33rd IEEE International Workshop on Signal Processing Systems, SiPS 2019

Y2 - 20 October 2019 through 23 October 2019

ER -

Improving Reliability of ReRAM-Based DNN Implementation through Novel Weight Distribution

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this