Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming

Weijia Che, Karam Chatha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Scopus citations


The prevalence of stream applications in signal processing, multi-media, and network processing domains has resulted in a new trend of programming and architecture design. Several languages and multicore architectures have been developed to support streaming applications. In many of these multicore architectures scratchpad memories (SPM) have substituted caches due to their lower power consumption. Performance optimization on SPM based architectures requires the programmer/compiler to efficiently manage the limited local memory. Our paper addresses the problem of compilation of stream programs onto multicore architectures that incorporate SPMs. We propose a retiming technique that maximizes the throughput under a memory constraint with a user-specified number of software pipeline stages. Trade-offs between double buffering and code overlay are explored intensively in our technique to achieve the best performance. The efficiency of our technique was evaluated by compiling several stream applications for the IBM Cell BE and comparing their results against existing approaches.

Original languageEnglish (US)
Title of host publication2011 48th ACM/EDAC/IEEE Design Automation Conference, DAC 2011
PublisherInstitute of Electrical and Electronics Engineers Inc.
Number of pages6
ISBN (Print)9781450306362
StatePublished - 2011

Publication series

NameProceedings - Design Automation Conference
ISSN (Print)0738-100X


  • Compiler
  • Embedded
  • Multicore
  • Processors
  • Retiming
  • Scratchpad Memory
  • Stream

ASJC Scopus subject areas

  • Computer Science Applications
  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Modeling and Simulation


Dive into the research topics of 'Compilation of stream programs onto scratchpad memory based embedded multicore processors through retiming'. Together they form a unique fingerprint.

Cite this