Design and Optimization of Coarse-Grained Reconfigurable Array (CGRA) Architecture for Efficient Processing-in-Memory (PIM) Systems

Anas A. Salameh; Kho Mei Ye

doi:10.31838/jvcs/07.01.02

Design and Optimization of Coarse-Grained Reconfigurable Array (CGRA) Architecture for Efficient Processing-in-Memory (PIM) Systems

Anas A. Salameh, Kho Mei Ye

Management Information Systems

University of Malaya

Research output: Contribution to journal › Article › peer-review

Abstract

The Coarse-Grained Reconfigurable Array (CGRA) architecture for Efficient Processingin-Memory (PIM) systems is presented in this article. PIM architectures that incorporate computational capabilities directly into memory present a promising solution to mitigate the memory wall issue. There are several difficulties in CGRA architecture optimization for PIM systems especially when it comes to striking a balance between area efficiency, power consumption and performance. The proposed framework tackles these problems by examining crucial design components like processing element (PE) architecture memory hierarchy integration and interconnect design. Using a design space exploration (DSE) methodology we assess various CGRA configurations to find the optimal trade-offs between computation throughput, power consumption and silicon area utilization. To assist in selecting effective architectures that meet different application workloads the framework combines performance analysis and advanced modeling techniques. Based on test results, the optimized CGRA architecture for PIM achieves significant improvements in processing performance (20 percent increase in throughput), area reduction and energy efficiency (up to 40 percent reduction in power consumption) when compared to conventional PIM designs. Our architecture is well-suited for data-intensive applications such as machine learning and graph analytics since these enhancements are achieved without compromising computational accuracy or scalability.

Original language	English
Pages (from-to)	11-18
Number of pages	8
Journal	Journal of VLSI Circuits and Systems
Volume	7
Issue number	1
DOIs	https://doi.org/10.31838/jvcs/07.01.02
State	Published - 20 Jan 2025

Keywords

Coarse-Grained Reconfigurable Array (CGRA)
Design Space Exploration (DSE)
Energy efficiency.
Processing-in-Memory (PIM)
memory wall

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.31838/jvcs/07.01.02

Cite this

@article{8f96ddceb40f4593a337f253b0f6317e,

title = "Design and Optimization of Coarse-Grained Reconfigurable Array (CGRA) Architecture for Efficient Processing-in-Memory (PIM) Systems",

abstract = "The Coarse-Grained Reconfigurable Array (CGRA) architecture for Efficient Processingin-Memory (PIM) systems is presented in this article. PIM architectures that incorporate computational capabilities directly into memory present a promising solution to mitigate the memory wall issue. There are several difficulties in CGRA architecture optimization for PIM systems especially when it comes to striking a balance between area efficiency, power consumption and performance. The proposed framework tackles these problems by examining crucial design components like processing element (PE) architecture memory hierarchy integration and interconnect design. Using a design space exploration (DSE) methodology we assess various CGRA configurations to find the optimal trade-offs between computation throughput, power consumption and silicon area utilization. To assist in selecting effective architectures that meet different application workloads the framework combines performance analysis and advanced modeling techniques. Based on test results, the optimized CGRA architecture for PIM achieves significant improvements in processing performance (20 percent increase in throughput), area reduction and energy efficiency (up to 40 percent reduction in power consumption) when compared to conventional PIM designs. Our architecture is well-suited for data-intensive applications such as machine learning and graph analytics since these enhancements are achieved without compromising computational accuracy or scalability.",

keywords = "Coarse-Grained Reconfigurable Array (CGRA), Design Space Exploration (DSE), Energy efficiency., Processing-in-Memory (PIM), memory wall",

author = "Salameh, \{Anas A.\} and Ye, \{Kho Mei\}",

year = "2025",

month = jan,

day = "20",

doi = "10.31838/jvcs/07.01.02",

language = "English",

volume = "7",

pages = "11--18",

journal = "Journal of VLSI Circuits and Systems",

issn = "2582-1458",

publisher = "Society for Communication and Computer Technologies",

number = "1",

}

TY - JOUR

T1 - Design and Optimization of Coarse-Grained Reconfigurable Array (CGRA) Architecture for Efficient Processing-in-Memory (PIM) Systems

AU - Salameh, Anas A.

AU - Ye, Kho Mei

PY - 2025/1/20

Y1 - 2025/1/20

N2 - The Coarse-Grained Reconfigurable Array (CGRA) architecture for Efficient Processingin-Memory (PIM) systems is presented in this article. PIM architectures that incorporate computational capabilities directly into memory present a promising solution to mitigate the memory wall issue. There are several difficulties in CGRA architecture optimization for PIM systems especially when it comes to striking a balance between area efficiency, power consumption and performance. The proposed framework tackles these problems by examining crucial design components like processing element (PE) architecture memory hierarchy integration and interconnect design. Using a design space exploration (DSE) methodology we assess various CGRA configurations to find the optimal trade-offs between computation throughput, power consumption and silicon area utilization. To assist in selecting effective architectures that meet different application workloads the framework combines performance analysis and advanced modeling techniques. Based on test results, the optimized CGRA architecture for PIM achieves significant improvements in processing performance (20 percent increase in throughput), area reduction and energy efficiency (up to 40 percent reduction in power consumption) when compared to conventional PIM designs. Our architecture is well-suited for data-intensive applications such as machine learning and graph analytics since these enhancements are achieved without compromising computational accuracy or scalability.

AB - The Coarse-Grained Reconfigurable Array (CGRA) architecture for Efficient Processingin-Memory (PIM) systems is presented in this article. PIM architectures that incorporate computational capabilities directly into memory present a promising solution to mitigate the memory wall issue. There are several difficulties in CGRA architecture optimization for PIM systems especially when it comes to striking a balance between area efficiency, power consumption and performance. The proposed framework tackles these problems by examining crucial design components like processing element (PE) architecture memory hierarchy integration and interconnect design. Using a design space exploration (DSE) methodology we assess various CGRA configurations to find the optimal trade-offs between computation throughput, power consumption and silicon area utilization. To assist in selecting effective architectures that meet different application workloads the framework combines performance analysis and advanced modeling techniques. Based on test results, the optimized CGRA architecture for PIM achieves significant improvements in processing performance (20 percent increase in throughput), area reduction and energy efficiency (up to 40 percent reduction in power consumption) when compared to conventional PIM designs. Our architecture is well-suited for data-intensive applications such as machine learning and graph analytics since these enhancements are achieved without compromising computational accuracy or scalability.

KW - Coarse-Grained Reconfigurable Array (CGRA)

KW - Design Space Exploration (DSE)

KW - Energy efficiency.

KW - Processing-in-Memory (PIM)

KW - memory wall

UR - http://www.scopus.com/inward/record.url?scp=105003132353&partnerID=8YFLogxK

U2 - 10.31838/jvcs/07.01.02

DO - 10.31838/jvcs/07.01.02

M3 - Article

AN - SCOPUS:105003132353

SN - 2582-1458

VL - 7

SP - 11

EP - 18

JO - Journal of VLSI Circuits and Systems

JF - Journal of VLSI Circuits and Systems

IS - 1

ER -

Design and Optimization of Coarse-Grained Reconfigurable Array (CGRA) Architecture for Efficient Processing-in-Memory (PIM) Systems

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this