Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access

Minkyu Kim; Jae Sun Seo

doi:10.1109/CICC48029.2020.9075931

Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access

Minkyu Kim, Jae Sun Seo

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

7 Scopus citations

Abstract

This paper presents an ASIC accelerator for deep convolutional neural networks (DCNNs) featuring a novel conditional computing scheme that synergistically combines precision-cascading with zero-skipping. To reduce many redundant convolution operations that are followed by max-pooling operations, we propose precision-cascading, where the input features are divided into a number of low-precision groups and approximate convolutions with only the most significant bits (MSBs) are performed first. Based on this approximate computation, the full-precision convolution is performed only on the maximum pooling output that is found. This way, the total number of bit-wise convolutions can be reduced by 2× without affecting the output feature values and with <0.8% degradation in final ImageNet classification accuracy. Precision-cascading provides the added benefit of increased sparsity per low-precision group, which we exploit with zero-skipping to eliminate clock cycles as well as external memory access that involve zero inputs. By jointly optimizing the conditional computing scheme and hardware architecture, the 40nm prototype chip demonstrates a peak energy-efficiency of 8.85 TOPS/W at 0.9V supply and low external memory access of 55.31 MB (or 0.0018 access/MAC) for ImageNet classification with VGG-16 CNN.

Original language	English (US)
Title of host publication	2020 IEEE Custom Integrated Circuits Conference, CICC 2020
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728160313
DOIs	https://doi.org/10.1109/CICC48029.2020.9075931
State	Published - Mar 2020
Event	2020 IEEE Custom Integrated Circuits Conference, CICC 2020 - Boston, United States Duration: Mar 22 2020 → Mar 25 2020

Publication series

Name	Proceedings of the Custom Integrated Circuits Conference
Volume	2020-March
ISSN (Print)	0886-5930

Conference

Conference	2020 IEEE Custom Integrated Circuits Conference, CICC 2020
Country/Territory	United States
City	Boston
Period	3/22/20 → 3/25/20

Keywords

ASIC
Deep convolutional neural networks (DCNNs)
conditional computing
deep learning
energy-efficient accelerator

ASJC Scopus subject areas

Electrical and Electronic Engineering

Access to Document

10.1109/CICC48029.2020.9075931

Cite this

Kim, M., & Seo, J. S. (2020). Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access. In 2020 IEEE Custom Integrated Circuits Conference, CICC 2020 Article 9075931 (Proceedings of the Custom Integrated Circuits Conference; Vol. 2020-March). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CICC48029.2020.9075931

Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access. / Kim, Minkyu; Seo, Jae Sun.
2020 IEEE Custom Integrated Circuits Conference, CICC 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 9075931 (Proceedings of the Custom Integrated Circuits Conference; Vol. 2020-March).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Kim, M & Seo, JS 2020, Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access. in 2020 IEEE Custom Integrated Circuits Conference, CICC 2020., 9075931, Proceedings of the Custom Integrated Circuits Conference, vol. 2020-March, Institute of Electrical and Electronics Engineers Inc., 2020 IEEE Custom Integrated Circuits Conference, CICC 2020, Boston, United States, 3/22/20. https://doi.org/10.1109/CICC48029.2020.9075931

@inproceedings{2e87de149b954a4e8b5878dfc9728e20,

title = "Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access",

abstract = "This paper presents an ASIC accelerator for deep convolutional neural networks (DCNNs) featuring a novel conditional computing scheme that synergistically combines precision-cascading with zero-skipping. To reduce many redundant convolution operations that are followed by max-pooling operations, we propose precision-cascading, where the input features are divided into a number of low-precision groups and approximate convolutions with only the most significant bits (MSBs) are performed first. Based on this approximate computation, the full-precision convolution is performed only on the maximum pooling output that is found. This way, the total number of bit-wise convolutions can be reduced by 2× without affecting the output feature values and with <0.8% degradation in final ImageNet classification accuracy. Precision-cascading provides the added benefit of increased sparsity per low-precision group, which we exploit with zero-skipping to eliminate clock cycles as well as external memory access that involve zero inputs. By jointly optimizing the conditional computing scheme and hardware architecture, the 40nm prototype chip demonstrates a peak energy-efficiency of 8.85 TOPS/W at 0.9V supply and low external memory access of 55.31 MB (or 0.0018 access/MAC) for ImageNet classification with VGG-16 CNN.",

keywords = "ASIC, Deep convolutional neural networks (DCNNs), conditional computing, deep learning, energy-efficient accelerator",

author = "Minkyu Kim and Seo, {Jae Sun}",

note = "Funding Information: ACKNOWLEDGEMENT This work was in part supported by NSF grant 1652866, Samsung Electronics, and C-BRIC, one of six centers in JUMP, a SRC program sponsored by DARPA. Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE Custom Integrated Circuits Conference, CICC 2020 ; Conference date: 22-03-2020 Through 25-03-2020",

year = "2020",

month = mar,

doi = "10.1109/CICC48029.2020.9075931",

language = "English (US)",

series = "Proceedings of the Custom Integrated Circuits Conference",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2020 IEEE Custom Integrated Circuits Conference, CICC 2020",

}

TY - GEN

T1 - Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access

AU - Kim, Minkyu

AU - Seo, Jae Sun

N1 - Funding Information: ACKNOWLEDGEMENT This work was in part supported by NSF grant 1652866, Samsung Electronics, and C-BRIC, one of six centers in JUMP, a SRC program sponsored by DARPA. Publisher Copyright: © 2020 IEEE.

PY - 2020/3

Y1 - 2020/3

N2 - This paper presents an ASIC accelerator for deep convolutional neural networks (DCNNs) featuring a novel conditional computing scheme that synergistically combines precision-cascading with zero-skipping. To reduce many redundant convolution operations that are followed by max-pooling operations, we propose precision-cascading, where the input features are divided into a number of low-precision groups and approximate convolutions with only the most significant bits (MSBs) are performed first. Based on this approximate computation, the full-precision convolution is performed only on the maximum pooling output that is found. This way, the total number of bit-wise convolutions can be reduced by 2× without affecting the output feature values and with <0.8% degradation in final ImageNet classification accuracy. Precision-cascading provides the added benefit of increased sparsity per low-precision group, which we exploit with zero-skipping to eliminate clock cycles as well as external memory access that involve zero inputs. By jointly optimizing the conditional computing scheme and hardware architecture, the 40nm prototype chip demonstrates a peak energy-efficiency of 8.85 TOPS/W at 0.9V supply and low external memory access of 55.31 MB (or 0.0018 access/MAC) for ImageNet classification with VGG-16 CNN.

AB - This paper presents an ASIC accelerator for deep convolutional neural networks (DCNNs) featuring a novel conditional computing scheme that synergistically combines precision-cascading with zero-skipping. To reduce many redundant convolution operations that are followed by max-pooling operations, we propose precision-cascading, where the input features are divided into a number of low-precision groups and approximate convolutions with only the most significant bits (MSBs) are performed first. Based on this approximate computation, the full-precision convolution is performed only on the maximum pooling output that is found. This way, the total number of bit-wise convolutions can be reduced by 2× without affecting the output feature values and with <0.8% degradation in final ImageNet classification accuracy. Precision-cascading provides the added benefit of increased sparsity per low-precision group, which we exploit with zero-skipping to eliminate clock cycles as well as external memory access that involve zero inputs. By jointly optimizing the conditional computing scheme and hardware architecture, the 40nm prototype chip demonstrates a peak energy-efficiency of 8.85 TOPS/W at 0.9V supply and low external memory access of 55.31 MB (or 0.0018 access/MAC) for ImageNet classification with VGG-16 CNN.

KW - ASIC

KW - Deep convolutional neural networks (DCNNs)

KW - conditional computing

KW - deep learning

KW - energy-efficient accelerator

UR - http://www.scopus.com/inward/record.url?scp=85084506452&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85084506452&partnerID=8YFLogxK

U2 - 10.1109/CICC48029.2020.9075931

DO - 10.1109/CICC48029.2020.9075931

M3 - Conference contribution

AN - SCOPUS:85084506452

T3 - Proceedings of the Custom Integrated Circuits Conference

BT - 2020 IEEE Custom Integrated Circuits Conference, CICC 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 IEEE Custom Integrated Circuits Conference, CICC 2020

Y2 - 22 March 2020 through 25 March 2020

ER -

Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this