Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs

Yufei Ma; Tu Zheng; Yu Cao; Sarma Vrudhula; Jae-sun Seo

doi:10.1145/3240765.3240775

Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs

Yufei Ma, Tu Zheng, Yu Cao, Sarma Vrudhula, Jae-sun Seo

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

21 Scopus citations

Abstract

The rapid improvement in computation capability has made convolutional neural networks (CNNs) a great success in recent years on image classification tasks, which has also prospered the development of objection detection algorithms with significantly improved accuracy. However, during the deployment phase, many applications demand low latency processing of one image with strict power consumption requirement, which reduces the efficiency of GPU and other general-purpose platform, bringing opportunities for specific acceleration hardware, e.g. FPGA, by customizing the digital circuit specific for the inference algorithm. Therefore, this work proposes to customize the detection algorithm, e.g. SSD, to benefit its hardware implementation with low data precision at the cost of marginal accuracy degradation. The proposed FPGA-based deep learning inference accelerator is demonstrated on two Intel FPGAs for SSD algorithm achieving up to 2.18 TOPS throughput and up to 3.3X superior energy-efficiency compared to GPU.

Original language	English (US)
Title of host publication	2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781450359504
DOIs	https://doi.org/10.1145/3240765.3240775
State	Published - Nov 5 2018
Event	37th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - San Diego, United States Duration: Nov 5 2018 → Nov 8 2018

Publication series

Name	IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD
ISSN (Print)	1092-3152

Other

Other	37th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018
Country/Territory	United States
City	San Diego
Period	11/5/18 → 11/8/18

Keywords

FPGA
HW/SW co-design
hardware accelerator
neural network

ASJC Scopus subject areas

Software
Computer Science Applications
Computer Graphics and Computer-Aided Design

Access to Document

10.1145/3240765.3240775

Cite this

Ma, Y., Zheng, T., Cao, Y., Vrudhula, S., & Seo, J. (2018). Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs. In 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers Article a57 (IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1145/3240765.3240775

Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs. / Ma, Yufei; Zheng, Tu; Cao, Yu et al.
2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers. Institute of Electrical and Electronics Engineers Inc., 2018. a57 (IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Ma, Y, Zheng, T, Cao, Y, Vrudhula, S & Seo, J 2018, Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs. in 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers., a57, IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD, Institute of Electrical and Electronics Engineers Inc., 37th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018, San Diego, United States, 11/5/18. https://doi.org/10.1145/3240765.3240775

Ma Y, Zheng T, Cao Y, Vrudhula S, Seo J. Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs. In 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers. Institute of Electrical and Electronics Engineers Inc. 2018. a57. (IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD). doi: 10.1145/3240765.3240775

Ma, Yufei ; Zheng, Tu ; Cao, Yu et al. / Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs. 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers. Institute of Electrical and Electronics Engineers Inc., 2018. (IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD).

@inproceedings{c70275d047ba48299dbfaa6373ad1f6d,

title = "Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs",

abstract = "The rapid improvement in computation capability has made convolutional neural networks (CNNs) a great success in recent years on image classification tasks, which has also prospered the development of objection detection algorithms with significantly improved accuracy. However, during the deployment phase, many applications demand low latency processing of one image with strict power consumption requirement, which reduces the efficiency of GPU and other general-purpose platform, bringing opportunities for specific acceleration hardware, e.g. FPGA, by customizing the digital circuit specific for the inference algorithm. Therefore, this work proposes to customize the detection algorithm, e.g. SSD, to benefit its hardware implementation with low data precision at the cost of marginal accuracy degradation. The proposed FPGA-based deep learning inference accelerator is demonstrated on two Intel FPGAs for SSD algorithm achieving up to 2.18 TOPS throughput and up to 3.3X superior energy-efficiency compared to GPU.",

keywords = "FPGA, HW/SW co-design, hardware accelerator, neural network",

author = "Yufei Ma and Tu Zheng and Yu Cao and Sarma Vrudhula and Jae-sun Seo",

note = "Funding Information: This work was supported in part by the NSF I/UCRC Center for Embedded Systems through NSF grants 1230401, 1237856, 1701241, 1361926 and 1535669, NSF grants 1652866 and 1715443, Intel Labs, and C-BRIC, one of six centers in JUMP, a SRC program sponsored by DARPA. Publisher Copyright: {\textcopyright} 2018 ACM.; 37th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 ; Conference date: 05-11-2018 Through 08-11-2018",

year = "2018",

month = nov,

day = "5",

doi = "10.1145/3240765.3240775",

language = "English (US)",

series = "IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers",

}

TY - GEN

T1 - Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs

AU - Ma, Yufei

AU - Zheng, Tu

AU - Cao, Yu

AU - Vrudhula, Sarma

AU - Seo, Jae-sun

N1 - Funding Information: This work was supported in part by the NSF I/UCRC Center for Embedded Systems through NSF grants 1230401, 1237856, 1701241, 1361926 and 1535669, NSF grants 1652866 and 1715443, Intel Labs, and C-BRIC, one of six centers in JUMP, a SRC program sponsored by DARPA. Publisher Copyright: © 2018 ACM.

PY - 2018/11/5

Y1 - 2018/11/5

N2 - The rapid improvement in computation capability has made convolutional neural networks (CNNs) a great success in recent years on image classification tasks, which has also prospered the development of objection detection algorithms with significantly improved accuracy. However, during the deployment phase, many applications demand low latency processing of one image with strict power consumption requirement, which reduces the efficiency of GPU and other general-purpose platform, bringing opportunities for specific acceleration hardware, e.g. FPGA, by customizing the digital circuit specific for the inference algorithm. Therefore, this work proposes to customize the detection algorithm, e.g. SSD, to benefit its hardware implementation with low data precision at the cost of marginal accuracy degradation. The proposed FPGA-based deep learning inference accelerator is demonstrated on two Intel FPGAs for SSD algorithm achieving up to 2.18 TOPS throughput and up to 3.3X superior energy-efficiency compared to GPU.

AB - The rapid improvement in computation capability has made convolutional neural networks (CNNs) a great success in recent years on image classification tasks, which has also prospered the development of objection detection algorithms with significantly improved accuracy. However, during the deployment phase, many applications demand low latency processing of one image with strict power consumption requirement, which reduces the efficiency of GPU and other general-purpose platform, bringing opportunities for specific acceleration hardware, e.g. FPGA, by customizing the digital circuit specific for the inference algorithm. Therefore, this work proposes to customize the detection algorithm, e.g. SSD, to benefit its hardware implementation with low data precision at the cost of marginal accuracy degradation. The proposed FPGA-based deep learning inference accelerator is demonstrated on two Intel FPGAs for SSD algorithm achieving up to 2.18 TOPS throughput and up to 3.3X superior energy-efficiency compared to GPU.

KW - FPGA

KW - HW/SW co-design

KW - hardware accelerator

KW - neural network

UR - http://www.scopus.com/inward/record.url?scp=85058172945&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85058172945&partnerID=8YFLogxK

U2 - 10.1145/3240765.3240775

DO - 10.1145/3240765.3240775

M3 - Conference contribution

AN - SCOPUS:85058172945

T3 - IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD

BT - 2018 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018 - Digest of Technical Papers

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 37th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2018

Y2 - 5 November 2018 through 8 November 2018

ER -

Algorithm-hardware co-design of single shot detector for fast object detection on FPGAs

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this