TY - JOUR
T1 - Low-Complexity Scalable Architectures for Parallel Computation of Similarity Measures
AU - Kanan, Awos
AU - Gebali, Fayez
AU - Ibrahim, Atef
AU - Li, Kin Fun
N1 - Publisher Copyright:
© 2019 Awos Kanan et al.
PY - 2019
Y1 - 2019
N2 - Processor array architectures have been employed, as an accelerator, to compute similarity distance found in a variety of data mining algorithms. However, most of the proposed architectures in the existing literature are designed in an ad hoc manner without taking into consideration the size and dimensionality of the datasets. Furthermore, data dependencies have not been analyzed, and often, only one design choice is considered for the scheduling and mapping of computational tasks. In this work, we present a systematic methodology to design scalable and area-efficient linear (1-D) processor arrays for the computation of similarity distance matrices. Six possible design options are obtained and analyzed in terms of area and time complexities. The obtained architectures provide us with the flexibility to choose the one that meets hardware constraints for a specific problem size. Comparisons with the previously reported architectures demonstrate that one of the proposed architectures achieves less area and area-delay product besides its scalability to high-dimensional data.
AB - Processor array architectures have been employed, as an accelerator, to compute similarity distance found in a variety of data mining algorithms. However, most of the proposed architectures in the existing literature are designed in an ad hoc manner without taking into consideration the size and dimensionality of the datasets. Furthermore, data dependencies have not been analyzed, and often, only one design choice is considered for the scheduling and mapping of computational tasks. In this work, we present a systematic methodology to design scalable and area-efficient linear (1-D) processor arrays for the computation of similarity distance matrices. Six possible design options are obtained and analyzed in terms of area and time complexities. The obtained architectures provide us with the flexibility to choose the one that meets hardware constraints for a specific problem size. Comparisons with the previously reported architectures demonstrate that one of the proposed architectures achieves less area and area-delay product besides its scalability to high-dimensional data.
UR - https://www.scopus.com/pages/publications/85067106161
U2 - 10.1155/2019/3185137
DO - 10.1155/2019/3185137
M3 - Article
AN - SCOPUS:85067106161
SN - 1058-9244
VL - 2019
JO - Scientific Programming
JF - Scientific Programming
M1 - 3185137
ER -