NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training

Mihailo Isakov; Michel A. Kinsy

doi:10.1109/ICCD56317.2022.00088

NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training

Mihailo Isakov, Michel A. Kinsy

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Sparse Deep Neural Networks (DNN) offer a large improvement in model storage requirements, execution latency and execution throughput. DNN pruning is contingent on knowing model weights, so networks can be pruned only after training. A priori sparse neural networks have been proposed as a way to extend sparsity benefits to the training process as well. Selecting a topology a priori is also beneficial for hardware accelerator specialization, lowering power, chip area, and latency.We present NeuroFabric, a hardware-ML model co-design approach that jointly optimizes a sparse neural network topology and a hardware accelerator configuration. NeuroFabric replaces dense DNN layers with cascades of sparse layers with a specific topology. We present an efficient and data-agnostic method for sparse network topology optimization, and show that parallel butterfly networks with skip connections achieve the best accuracy independent of sparsity or depth. We also present a multi-objective optimization framework that finds a Pareto frontier of hardware-ML model configurations over six objectives: accuracy, parameter count, throughput, latency, power, and hardware area.

Original language	English (US)
Title of host publication	Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	561-564
Number of pages	4
ISBN (Electronic)	9781665461863
DOIs	https://doi.org/10.1109/ICCD56317.2022.00088
State	Published - 2022
Event	40th IEEE International Conference on Computer Design, ICCD 2022 - Olympic Valley, United States Duration: Oct 23 2022 → Oct 26 2022

Publication series

Name	Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors
Volume	2022-October
ISSN (Print)	1063-6404

Conference

Conference	40th IEEE International Conference on Computer Design, ICCD 2022
Country/Territory	United States
City	Olympic Valley
Period	10/23/22 → 10/26/22

Keywords

acceleration
neural network
Sparsity
topology

ASJC Scopus subject areas

Hardware and Architecture
Electrical and Electronic Engineering

Access to Document

10.1109/ICCD56317.2022.00088

Cite this

Isakov, M., & Kinsy, M. A. (2022). NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training. In Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022 (pp. 561-564). (Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors; Vol. 2022-October). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCD56317.2022.00088

NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training. / Isakov, Mihailo; Kinsy, Michel A.
Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022. Institute of Electrical and Electronics Engineers Inc., 2022. p. 561-564 (Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors; Vol. 2022-October).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Isakov, M & Kinsy, MA 2022, NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training. in Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022. Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors, vol. 2022-October, Institute of Electrical and Electronics Engineers Inc., pp. 561-564, 40th IEEE International Conference on Computer Design, ICCD 2022, Olympic Valley, United States, 10/23/22. https://doi.org/10.1109/ICCD56317.2022.00088

Isakov M, Kinsy MA. NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training. In Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022. Institute of Electrical and Electronics Engineers Inc. 2022. p. 561-564. (Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors). doi: 10.1109/ICCD56317.2022.00088

Isakov, Mihailo ; Kinsy, Michel A. / NeuroFabric : Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training. Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022. Institute of Electrical and Electronics Engineers Inc., 2022. pp. 561-564 (Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors).

@inproceedings{1469214cccf44b678d8c9543f3dc3210,

title = "NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training",

abstract = "Sparse Deep Neural Networks (DNN) offer a large improvement in model storage requirements, execution latency and execution throughput. DNN pruning is contingent on knowing model weights, so networks can be pruned only after training. A priori sparse neural networks have been proposed as a way to extend sparsity benefits to the training process as well. Selecting a topology a priori is also beneficial for hardware accelerator specialization, lowering power, chip area, and latency.We present NeuroFabric, a hardware-ML model co-design approach that jointly optimizes a sparse neural network topology and a hardware accelerator configuration. NeuroFabric replaces dense DNN layers with cascades of sparse layers with a specific topology. We present an efficient and data-agnostic method for sparse network topology optimization, and show that parallel butterfly networks with skip connections achieve the best accuracy independent of sparsity or depth. We also present a multi-objective optimization framework that finds a Pareto frontier of hardware-ML model configurations over six objectives: accuracy, parameter count, throughput, latency, power, and hardware area.",

keywords = "acceleration, neural network, Sparsity, topology",

author = "Mihailo Isakov and Kinsy, {Michel A.}",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 40th IEEE International Conference on Computer Design, ICCD 2022 ; Conference date: 23-10-2022 Through 26-10-2022",

year = "2022",

doi = "10.1109/ICCD56317.2022.00088",

language = "English (US)",

series = "Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "561--564",

booktitle = "Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022",

}

TY - GEN

T1 - NeuroFabric

T2 - 40th IEEE International Conference on Computer Design, ICCD 2022

AU - Isakov, Mihailo

AU - Kinsy, Michel A.

PY - 2022

Y1 - 2022

N2 - Sparse Deep Neural Networks (DNN) offer a large improvement in model storage requirements, execution latency and execution throughput. DNN pruning is contingent on knowing model weights, so networks can be pruned only after training. A priori sparse neural networks have been proposed as a way to extend sparsity benefits to the training process as well. Selecting a topology a priori is also beneficial for hardware accelerator specialization, lowering power, chip area, and latency.We present NeuroFabric, a hardware-ML model co-design approach that jointly optimizes a sparse neural network topology and a hardware accelerator configuration. NeuroFabric replaces dense DNN layers with cascades of sparse layers with a specific topology. We present an efficient and data-agnostic method for sparse network topology optimization, and show that parallel butterfly networks with skip connections achieve the best accuracy independent of sparsity or depth. We also present a multi-objective optimization framework that finds a Pareto frontier of hardware-ML model configurations over six objectives: accuracy, parameter count, throughput, latency, power, and hardware area.

AB - Sparse Deep Neural Networks (DNN) offer a large improvement in model storage requirements, execution latency and execution throughput. DNN pruning is contingent on knowing model weights, so networks can be pruned only after training. A priori sparse neural networks have been proposed as a way to extend sparsity benefits to the training process as well. Selecting a topology a priori is also beneficial for hardware accelerator specialization, lowering power, chip area, and latency.We present NeuroFabric, a hardware-ML model co-design approach that jointly optimizes a sparse neural network topology and a hardware accelerator configuration. NeuroFabric replaces dense DNN layers with cascades of sparse layers with a specific topology. We present an efficient and data-agnostic method for sparse network topology optimization, and show that parallel butterfly networks with skip connections achieve the best accuracy independent of sparsity or depth. We also present a multi-objective optimization framework that finds a Pareto frontier of hardware-ML model configurations over six objectives: accuracy, parameter count, throughput, latency, power, and hardware area.

KW - acceleration

KW - neural network

KW - Sparsity

KW - topology

UR - http://www.scopus.com/inward/record.url?scp=85145876251&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85145876251&partnerID=8YFLogxK

U2 - 10.1109/ICCD56317.2022.00088

DO - 10.1109/ICCD56317.2022.00088

M3 - Conference contribution

AN - SCOPUS:85145876251

T3 - Proceedings - IEEE International Conference on Computer Design: VLSI in Computers and Processors

SP - 561

EP - 564

BT - Proceedings - 2022 IEEE 40th International Conference on Computer Design, ICCD 2022

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 23 October 2022 through 26 October 2022

ER -

NeuroFabric: Hardware and ML Model Co-Design for A Priori Sparse Neural Network Training

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this