Skip to main navigation Skip to search Skip to main content

A High-Performance FPGA Accelerator for CUR Decomposition

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

A matrix factorization is to decompose a matrix into a product of smaller matrices. It is widely used in machine learning algorithms. There are many matrix decomposition algorithms, and each has various applications. CUR matrix decomposition is a widely-used factorization tool that has been employed for dimension reduction and pattern recognition in many scientific and engineering applications, such as image processing, text mining, and wireless communications. In this paper we propose an efficient FPGA-based floating-point accelerator using high-level synthesis (HLS) for the CUR decomposition algorithm. Our experiment results demonstrate the better efficiency of our hardware design compared to the optimized CPU-based software solutions. The speedup of our FPGA-based architecture over the optimized software implementation ranges from 2.37 to 16.82 times for different dimensions of the data input matrix. We evaluated our design using large dimension matrices 1024 x 1024 and 2048 x 2048 and the experiment results demonstrated the efficiency of our design in terms of the utilized resources and latency. Finally, we have compared our design with other matrix decomposition algorithms such as SVD and QR decomposition, the experiment results demonstrated that CUR is more efficient than SVD and QR decomposition in terms of latency and required resources. © 2022 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2022 32nd International Conference on Field-Programmable Logic and Applications
Subtitle of host publicationFPL 2022
PublisherIEEE
Pages294-299
ISBN (Electronic)9781665473903
ISBN (Print)978-1-6654-7391-0
DOIs
Publication statusPublished - Aug 2022
Event32nd International Conference on Field-Programmable Logic and Applications (FPL 2022) - Belfast, United Kingdom
Duration: 29 Aug 20222 Sept 2022
https://2022.fpl.org/

Publication series

NameProceedings - International Conference on Field-Programmable Logic and Applications, FPL
ISSN (Print)1946-147X
ISSN (Electronic)1946-1488

Conference

Conference32nd International Conference on Field-Programmable Logic and Applications (FPL 2022)
PlaceUnited Kingdom
CityBelfast
Period29/08/222/09/22
Internet address

Funding

This work is supported by Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA), Hong Kong Research Grants Council (Project 11204821), and City University of Hong Kong (Project 9610460).

Research Keywords

  • CUR decomposition
  • high level synthesis
  • low-rank decomposition
  • SVD and QR decomposition

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'A High-Performance FPGA Accelerator for CUR Decomposition'. Together they form a unique fingerprint.

Cite this