A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA

Mohammad Farhadi; Mehdi Ghasemi; Yezhou Yang

doi:10.1109/HPEC.2019.8916237

A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA

Mohammad Farhadi, Mehdi Ghasemi, Yezhou Yang

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

20 Scopus citations

Abstract

Nowadays most research in visual recognition using Convolutional Neural Networks (CNNs) follows the 'deeper model with deeper confidence' belief to gain a higher recognition accuracy. At the same time, deeper model brings heavier computation. On the other hand, for a large chunk of recognition challenges, a system can classify images correctly using simple models or so-called shallow networks. Moreover, the implementation of CNNs faces with the size, weight, and energy constraints on the embedded devices. In this paper, we implement the adaptive switching between shallow and deep networks to reach the highest throughput on a resource-constrained MPSoC with CPU and FPGA. To this end, we develop and present a novel architecture for the CNNs where a gate makes the decision whether using the deeper model is beneficial or not. Due to resource limitation on FPGA, the idea of partial reconfiguration has been used to accommodate deep CNNs on the FPGA resources. We report experimental results on CIFAR-10, CIFAR-100, and SVHN datasets to validate our approach. Using confidence metric as the decision making factor, only 69.8%, 71.8%, and 43.8% of the computation in the deepest network is done for CIFAR10, CIFAR-100, and SVHN while it can maintain the desired accuracy with the throughput of around 400 images per second for SVHN dataset. https://github.com/mfarhadi/AHCNN.

Original language	English (US)
Title of host publication	2019 IEEE High Performance Extreme Computing Conference, HPEC 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728150208
DOIs	https://doi.org/10.1109/HPEC.2019.8916237
State	Published - Sep 2019
Event	2019 IEEE High Performance Extreme Computing Conference, HPEC 2019 - Waltham, United States Duration: Sep 24 2019 → Sep 26 2019

Publication series

Name	2019 IEEE High Performance Extreme Computing Conference, HPEC 2019

Conference

Conference	2019 IEEE High Performance Extreme Computing Conference, HPEC 2019
Country/Territory	United States
City	Waltham
Period	9/24/19 → 9/26/19

ASJC Scopus subject areas

Computational Theory and Mathematics
Computer Networks and Communications
Hardware and Architecture
Safety, Risk, Reliability and Quality
Artificial Intelligence

Access to Document

10.1109/HPEC.2019.8916237

Cite this

Farhadi, M., Ghasemi, M., & Yang, Y. (2019). A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA. In 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019 Article 8916237 (2019 IEEE High Performance Extreme Computing Conference, HPEC 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/HPEC.2019.8916237

A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA. / Farhadi, Mohammad; Ghasemi, Mehdi; Yang, Yezhou.
2019 IEEE High Performance Extreme Computing Conference, HPEC 2019. Institute of Electrical and Electronics Engineers Inc., 2019. 8916237 (2019 IEEE High Performance Extreme Computing Conference, HPEC 2019).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Farhadi, M, Ghasemi, M & Yang, Y 2019, A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA. in 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019., 8916237, 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019, Institute of Electrical and Electronics Engineers Inc., 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019, Waltham, United States, 9/24/19. https://doi.org/10.1109/HPEC.2019.8916237

Farhadi M, Ghasemi M, Yang Y. A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA. In 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019. Institute of Electrical and Electronics Engineers Inc. 2019. 8916237. (2019 IEEE High Performance Extreme Computing Conference, HPEC 2019). doi: 10.1109/HPEC.2019.8916237

@inproceedings{1dd8f493a032418588ad1124f81bb277,

title = "A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA",

abstract = "Nowadays most research in visual recognition using Convolutional Neural Networks (CNNs) follows the 'deeper model with deeper confidence' belief to gain a higher recognition accuracy. At the same time, deeper model brings heavier computation. On the other hand, for a large chunk of recognition challenges, a system can classify images correctly using simple models or so-called shallow networks. Moreover, the implementation of CNNs faces with the size, weight, and energy constraints on the embedded devices. In this paper, we implement the adaptive switching between shallow and deep networks to reach the highest throughput on a resource-constrained MPSoC with CPU and FPGA. To this end, we develop and present a novel architecture for the CNNs where a gate makes the decision whether using the deeper model is beneficial or not. Due to resource limitation on FPGA, the idea of partial reconfiguration has been used to accommodate deep CNNs on the FPGA resources. We report experimental results on CIFAR-10, CIFAR-100, and SVHN datasets to validate our approach. Using confidence metric as the decision making factor, only 69.8%, 71.8%, and 43.8% of the computation in the deepest network is done for CIFAR10, CIFAR-100, and SVHN while it can maintain the desired accuracy with the throughput of around 400 images per second for SVHN dataset. https://github.com/mfarhadi/AHCNN.",

author = "Mohammad Farhadi and Mehdi Ghasemi and Yezhou Yang",

note = "Funding Information: In this paper, we proposed a new approach to run heavy neural networks on FPGAs with constrained resources. We stacked various shallow and deep models yielding an adaptive and hierarchical structure for quantaized neural networks. We conducted experiments on CIFAR-10, CIFAR-100 and SVHN, and empirically validated that AH-CNN maintains a similarly low inference time as the shallow models while achieving the high recognition accuracy of the deep model on image classification tasks. The flexible nature of this hierarchical method makes it suitable for applications that need adaptive behavior towards dynamic priority change over object categories, such as an agent with active perception. Acknowledgments: The National Science Foundation under the Robust Intelligence Program (1750082), and the IoT Innovation (I-square) fund provided by ASU Fulton Schools of Engineering are gratefully acknowledged. We also acknowledge NVIDIA and Xilinx for the donation of GPUs and FPGAs. Funding Information: The National Science Foundation under the Robust Intelligence Program (1750082), and the IoT Innovation (I-square) fund provided by ASU Fulton Schools of Engineering are gratefully acknowledged. We also acknowledge NVIDIA and Xilinx for the donation of GPUs and FPGAs. Publisher Copyright: {\textcopyright} 2019 IEEE.; 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019 ; Conference date: 24-09-2019 Through 26-09-2019",

year = "2019",

month = sep,

doi = "10.1109/HPEC.2019.8916237",

language = "English (US)",

series = "2019 IEEE High Performance Extreme Computing Conference, HPEC 2019",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2019 IEEE High Performance Extreme Computing Conference, HPEC 2019",

}

TY - GEN

T1 - A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA

AU - Farhadi, Mohammad

AU - Ghasemi, Mehdi

AU - Yang, Yezhou

N1 - Funding Information: In this paper, we proposed a new approach to run heavy neural networks on FPGAs with constrained resources. We stacked various shallow and deep models yielding an adaptive and hierarchical structure for quantaized neural networks. We conducted experiments on CIFAR-10, CIFAR-100 and SVHN, and empirically validated that AH-CNN maintains a similarly low inference time as the shallow models while achieving the high recognition accuracy of the deep model on image classification tasks. The flexible nature of this hierarchical method makes it suitable for applications that need adaptive behavior towards dynamic priority change over object categories, such as an agent with active perception. Acknowledgments: The National Science Foundation under the Robust Intelligence Program (1750082), and the IoT Innovation (I-square) fund provided by ASU Fulton Schools of Engineering are gratefully acknowledged. We also acknowledge NVIDIA and Xilinx for the donation of GPUs and FPGAs. Funding Information: The National Science Foundation under the Robust Intelligence Program (1750082), and the IoT Innovation (I-square) fund provided by ASU Fulton Schools of Engineering are gratefully acknowledged. We also acknowledge NVIDIA and Xilinx for the donation of GPUs and FPGAs. Publisher Copyright: © 2019 IEEE.

PY - 2019/9

Y1 - 2019/9

N2 - Nowadays most research in visual recognition using Convolutional Neural Networks (CNNs) follows the 'deeper model with deeper confidence' belief to gain a higher recognition accuracy. At the same time, deeper model brings heavier computation. On the other hand, for a large chunk of recognition challenges, a system can classify images correctly using simple models or so-called shallow networks. Moreover, the implementation of CNNs faces with the size, weight, and energy constraints on the embedded devices. In this paper, we implement the adaptive switching between shallow and deep networks to reach the highest throughput on a resource-constrained MPSoC with CPU and FPGA. To this end, we develop and present a novel architecture for the CNNs where a gate makes the decision whether using the deeper model is beneficial or not. Due to resource limitation on FPGA, the idea of partial reconfiguration has been used to accommodate deep CNNs on the FPGA resources. We report experimental results on CIFAR-10, CIFAR-100, and SVHN datasets to validate our approach. Using confidence metric as the decision making factor, only 69.8%, 71.8%, and 43.8% of the computation in the deepest network is done for CIFAR10, CIFAR-100, and SVHN while it can maintain the desired accuracy with the throughput of around 400 images per second for SVHN dataset. https://github.com/mfarhadi/AHCNN.

AB - Nowadays most research in visual recognition using Convolutional Neural Networks (CNNs) follows the 'deeper model with deeper confidence' belief to gain a higher recognition accuracy. At the same time, deeper model brings heavier computation. On the other hand, for a large chunk of recognition challenges, a system can classify images correctly using simple models or so-called shallow networks. Moreover, the implementation of CNNs faces with the size, weight, and energy constraints on the embedded devices. In this paper, we implement the adaptive switching between shallow and deep networks to reach the highest throughput on a resource-constrained MPSoC with CPU and FPGA. To this end, we develop and present a novel architecture for the CNNs where a gate makes the decision whether using the deeper model is beneficial or not. Due to resource limitation on FPGA, the idea of partial reconfiguration has been used to accommodate deep CNNs on the FPGA resources. We report experimental results on CIFAR-10, CIFAR-100, and SVHN datasets to validate our approach. Using confidence metric as the decision making factor, only 69.8%, 71.8%, and 43.8% of the computation in the deepest network is done for CIFAR10, CIFAR-100, and SVHN while it can maintain the desired accuracy with the throughput of around 400 images per second for SVHN dataset. https://github.com/mfarhadi/AHCNN.

UR - http://www.scopus.com/inward/record.url?scp=85076684354&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85076684354&partnerID=8YFLogxK

U2 - 10.1109/HPEC.2019.8916237

DO - 10.1109/HPEC.2019.8916237

M3 - Conference contribution

AN - SCOPUS:85076684354

T3 - 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019

BT - 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2019 IEEE High Performance Extreme Computing Conference, HPEC 2019

Y2 - 24 September 2019 through 26 September 2019

ER -

A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on FPGA

Abstract

Publication series

Conference

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this