PARAG: PIM Architecture for Real-Time Acceleration of GCNs

Gian Singh; Sanmukh R. Kuppannagari; Sarma Vrudhula

doi:10.1109/HiPC58850.2023.00016

PARAG: PIM Architecture for Real-Time Acceleration of GCNs

Gian Singh, Sanmukh R. Kuppannagari, Sarma Vrudhula

Engineering, Ira A. Fulton Schools of (IAFSE)

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Graph Convolutional Networks (GCNs) have successfully incorporated deep learning to graph structures for social network analysis, bio-informatics, etc. The execution pattern of GCNs is a hybrid of graph processing and neural networks which poses unique and significant challenges for hardware implementation. Graph processing involves a large amount of irregular memory access with little computation whereas processing of neural networks involves a large number of operations with regular memory access. Existing graph processing and neural network accelerators are therefore inefficient for computing GCNs. This paper presents Parag, processing in memory (PIM) architecture for GCN computation. It consists of customized logic with minuscule computing units called Neural Processing Elements (NPEs) interfaced to each bank of the DRAM to support parallel graph processing and neural network computation. It utilizes the massive internal parallelism of DRAM to accelerate the GCN execution with high energy efficiency. Simulation results for inference of GCN over standard datasets show a latency and energy reduction by three orders of magnitude over a CPU implementation. When compared to a state-of-the-art PIM architecture, PARAG achieves on an average 4x reduction in latency and 4.23x reduction in the energy-delay-product (EDP).

Original language	English (US)
Title of host publication	Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	11-20
Number of pages	10
ISBN (Electronic)	9798350383225
DOIs	https://doi.org/10.1109/HiPC58850.2023.00016
State	Published - 2023
Event	30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023 - Goa, India Duration: Dec 18 2023 → Dec 21 2023

Publication series

Name	Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023

Conference

Conference	30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023
Country/Territory	India
City	Goa
Period	12/18/23 → 12/21/23

Keywords

DRAM
Graph Convolutional Networks
Memory Bottleneck
Processing In-Memory

ASJC Scopus subject areas

Artificial Intelligence
Computer Networks and Communications
Computer Science Applications
Hardware and Architecture
Information Systems
Information Systems and Management

Access to Document

10.1109/HiPC58850.2023.00016

Cite this

Singh, G., Kuppannagari, S. R., & Vrudhula, S. (2023). PARAG: PIM Architecture for Real-Time Acceleration of GCNs. In Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023 (pp. 11-20). (Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/HiPC58850.2023.00016

PARAG: PIM Architecture for Real-Time Acceleration of GCNs. / Singh, Gian; Kuppannagari, Sanmukh R.; Vrudhula, Sarma.
Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 11-20 (Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Singh, G, Kuppannagari, SR & Vrudhula, S 2023, PARAG: PIM Architecture for Real-Time Acceleration of GCNs. in Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023. Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Institute of Electrical and Electronics Engineers Inc., pp. 11-20, 30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023, Goa, India, 12/18/23. https://doi.org/10.1109/HiPC58850.2023.00016

Singh G, Kuppannagari SR, Vrudhula S. PARAG: PIM Architecture for Real-Time Acceleration of GCNs. In Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 11-20. (Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023). doi: 10.1109/HiPC58850.2023.00016

Singh, Gian ; Kuppannagari, Sanmukh R. ; Vrudhula, Sarma. / PARAG : PIM Architecture for Real-Time Acceleration of GCNs. Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 11-20 (Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023).

@inproceedings{ea4640d973ca4718a796683c25fb2cc0,

title = "PARAG: PIM Architecture for Real-Time Acceleration of GCNs",

abstract = "Graph Convolutional Networks (GCNs) have successfully incorporated deep learning to graph structures for social network analysis, bio-informatics, etc. The execution pattern of GCNs is a hybrid of graph processing and neural networks which poses unique and significant challenges for hardware implementation. Graph processing involves a large amount of irregular memory access with little computation whereas processing of neural networks involves a large number of operations with regular memory access. Existing graph processing and neural network accelerators are therefore inefficient for computing GCNs. This paper presents Parag, processing in memory (PIM) architecture for GCN computation. It consists of customized logic with minuscule computing units called Neural Processing Elements (NPEs) interfaced to each bank of the DRAM to support parallel graph processing and neural network computation. It utilizes the massive internal parallelism of DRAM to accelerate the GCN execution with high energy efficiency. Simulation results for inference of GCN over standard datasets show a latency and energy reduction by three orders of magnitude over a CPU implementation. When compared to a state-of-the-art PIM architecture, PARAG achieves on an average 4x reduction in latency and 4.23x reduction in the energy-delay-product (EDP).",

keywords = "DRAM, Graph Convolutional Networks, Memory Bottleneck, Processing In-Memory",

author = "Gian Singh and Kuppannagari, {Sanmukh R.} and Sarma Vrudhula",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023 ; Conference date: 18-12-2023 Through 21-12-2023",

year = "2023",

doi = "10.1109/HiPC58850.2023.00016",

language = "English (US)",

series = "Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "11--20",

booktitle = "Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023",

}

TY - GEN

T1 - PARAG

T2 - 30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023

AU - Singh, Gian

AU - Kuppannagari, Sanmukh R.

AU - Vrudhula, Sarma

PY - 2023

Y1 - 2023

N2 - Graph Convolutional Networks (GCNs) have successfully incorporated deep learning to graph structures for social network analysis, bio-informatics, etc. The execution pattern of GCNs is a hybrid of graph processing and neural networks which poses unique and significant challenges for hardware implementation. Graph processing involves a large amount of irregular memory access with little computation whereas processing of neural networks involves a large number of operations with regular memory access. Existing graph processing and neural network accelerators are therefore inefficient for computing GCNs. This paper presents Parag, processing in memory (PIM) architecture for GCN computation. It consists of customized logic with minuscule computing units called Neural Processing Elements (NPEs) interfaced to each bank of the DRAM to support parallel graph processing and neural network computation. It utilizes the massive internal parallelism of DRAM to accelerate the GCN execution with high energy efficiency. Simulation results for inference of GCN over standard datasets show a latency and energy reduction by three orders of magnitude over a CPU implementation. When compared to a state-of-the-art PIM architecture, PARAG achieves on an average 4x reduction in latency and 4.23x reduction in the energy-delay-product (EDP).

AB - Graph Convolutional Networks (GCNs) have successfully incorporated deep learning to graph structures for social network analysis, bio-informatics, etc. The execution pattern of GCNs is a hybrid of graph processing and neural networks which poses unique and significant challenges for hardware implementation. Graph processing involves a large amount of irregular memory access with little computation whereas processing of neural networks involves a large number of operations with regular memory access. Existing graph processing and neural network accelerators are therefore inefficient for computing GCNs. This paper presents Parag, processing in memory (PIM) architecture for GCN computation. It consists of customized logic with minuscule computing units called Neural Processing Elements (NPEs) interfaced to each bank of the DRAM to support parallel graph processing and neural network computation. It utilizes the massive internal parallelism of DRAM to accelerate the GCN execution with high energy efficiency. Simulation results for inference of GCN over standard datasets show a latency and energy reduction by three orders of magnitude over a CPU implementation. When compared to a state-of-the-art PIM architecture, PARAG achieves on an average 4x reduction in latency and 4.23x reduction in the energy-delay-product (EDP).

KW - DRAM

KW - Graph Convolutional Networks

KW - Memory Bottleneck

KW - Processing In-Memory

UR - http://www.scopus.com/inward/record.url?scp=85190579767&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85190579767&partnerID=8YFLogxK

U2 - 10.1109/HiPC58850.2023.00016

DO - 10.1109/HiPC58850.2023.00016

M3 - Conference contribution

AN - SCOPUS:85190579767

T3 - Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023

SP - 11

EP - 20

BT - Proceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 18 December 2023 through 21 December 2023

ER -

PARAG: PIM Architecture for Real-Time Acceleration of GCNs

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this