Skip to main navigation Skip to search Skip to main content

Re-thinking Memory-Bound Limitations in CGRAs

  • XIANGFENG LIU
  • , ZHE JIANG*
  • , ANZHEN ZHU
  • , XIAOMENG HAN
  • , MINGSONG LYU
  • , QINGXU DENG
  • , NAN GUAN
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Coarse-Grained Reconfigurable Arrays (CGRAs) are specialized accelerators commonly employed to boost performance in workloads with iterative structures. Existing research typically focuses on compiler or architecture optimizations aimed at improving CGRA performance, energy efficiency, flexibility, and area utilization, under the idealistic assumption that kernels can access all data from Scratchpad Memory (SPM). However, certain complex workloads–particularly in fields like graph analytics, irregular database operations, and specialized forms of high-performance computing (e.g., unstructured mesh simulations)–exhibit irregular memory access patterns that hinder CGRA utilization, sometimes dropping below 1.5%, making the CGRA memory-bound. To address this challenge, we conduct a thorough analysis of the underlying causes of performance degradation, then propose a redesigned memory subsystem and refine the memory model. With both microarchitectural and theoretical optimization, our solution can effectively manage irregular memory accesses through CGRA-specific runahead execution mechanism and cache reconfiguration techniques. Our results demonstrate that we can achieve performance comparable to the original SPM-only system while requiring only 1.27% of the storage size. The runahead execution mechanism achieves an average 3.04× speedup (up to 6.91×), with cache reconfiguration technique providing an additional 6.02% improvement, significantly enhancing CGRA performance for irregular memory access patterns.
© 2025 Copyright held by the owner/author(s).
Original languageEnglish
Article number105
Number of pages26
JournalACM Transactions on Embedded Computing Systems
Volume24
Issue number5s
Online published26 Sept 2025
DOIs
Publication statusPublished - Nov 2025

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Funding

We appreciate the reviewers for their insightful and helpful feedback. This work is supported by the National Key Research and Development Program (Grant No. 2024YFB4405600), the National Natural Science Foundation of China (Grant No. 62472086), the Basic Research Program of Jiangsu (Grant No. BK20243042), the Science and Technology Major Special Program of Jiangsu (Grants No. BG2024010), the Start-up Research Fund of Southeast University (Grant No. RF1028624005), and the Provincial Science and Technology Research Project (Grant No. 2024JH2/102400070 and No. 2023JH2/101700370).

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Research Keywords

  • Coarse-Grained Reconfigurable Array (CGRA)
  • irregular memory access
  • memory subsystem
  • runahead execution
  • cache reconfiguration

Fingerprint

Dive into the research topics of 'Re-thinking Memory-Bound Limitations in CGRAs'. Together they form a unique fingerprint.

Cite this