Profile-Guided Parallel Task Extraction and Execution for Domain Specific Heterogeneous SoC

Liangliang Chang, Joshua Mack, Benjamin Willis, Xing Chen, John Brunhaver, Ali Akoglu, Chaitali Chakrabarti

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this study, we introduce a methodology for automatically transforming user applications in the radar and communication domain written in C C++ based on dynamic profiling to a parallel representation targeted for a heterogeneous SoC. We present our approach for instrumenting the user application binary during the compilation process with barrier synchronization primitives that enable runtime system schedule and execute independent tasks concurrently over the available compute resources. We demonstrate the capabilities of our integrated compile time and runtime flow through task-level parallel and functionally correct execution of real-life applications. We perform validation of our integrated system by executing four distinct applications each carrying various degrees of task level parallelism over the Xeon-based multi-core homogeneous processor. We use the proposed compilation and code transformation methodology to re-target each application for execution on a heterogeneous SoC composed of three ARM cores and one FFT accelerator that is emulated on the Xilinx Zynq Ultra Scale+ platform. We demonstrate our runtime's ability to process application binary, dispatch independent tasks over the available compute resources of the emulated SoC on the Zynq FPGA based on three different scheduling heuristics. Finally we demonstrate execution of each application individually with task level parallelism on the Zynq FPGA and execution of workload scenarios composed of multiple instances of the same application as well as mixture of two distinct applications to demonstrate ability to realize both application and task level parallel execution. Our integrated approach offers a path forward for application developers to take full advantage of the target SoC without requiring users to become hardware and parallel programming experts.

Original languageEnglish (US)
Title of host publicationProceedings - 20th IEEE International Symposium on Parallel and Distributed Processing with Applications, 12th IEEE International Conference on Big Data and Cloud Computing, 12th IEEE International Conference on Sustainable Computing and Communications and 15th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages913-920
Number of pages8
ISBN (Electronic)9781665464970
DOIs
StatePublished - 2022
Externally publishedYes
Event20th IEEE International Symposium on Parallel and Distributed Processing with Applications, 12th IEEE International Conference on Big Data and Cloud Computing, 12th IEEE International Conference on Sustainable Computing and Communications and 15th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2022 - Melbourne, Australia
Duration: Dec 17 2022Dec 19 2022

Publication series

NameProceedings - 20th IEEE International Symposium on Parallel and Distributed Processing with Applications, 12th IEEE International Conference on Big Data and Cloud Computing, 12th IEEE International Conference on Sustainable Computing and Communications and 15th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2022

Conference

Conference20th IEEE International Symposium on Parallel and Distributed Processing with Applications, 12th IEEE International Conference on Big Data and Cloud Computing, 12th IEEE International Conference on Sustainable Computing and Communications and 15th IEEE International Conference on Social Computing and Networking, ISPA/BDCloud/SocialCom/SustainCom 2022
Country/TerritoryAustralia
CityMelbourne
Period12/17/2212/19/22

Keywords

  • Task-level parallelism
  • dynamic profiling
  • heterogeneous SoC and runtime
  • parallelism detection

ASJC Scopus subject areas

  • Information Systems
  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Decision Sciences (miscellaneous)
  • Information Systems and Management
  • Renewable Energy, Sustainability and the Environment
  • Communication

Fingerprint

Dive into the research topics of 'Profile-Guided Parallel Task Extraction and Execution for Domain Specific Heterogeneous SoC'. Together they form a unique fingerprint.

Cite this