Abstract
A matrix factorization is to decompose a matrix into a product of smaller matrices. It is widely used in machine learning algorithms. There are many matrix decomposition algorithms, and each has various applications. CUR matrix decomposition is a widely-used factorization tool that has been employed for dimension reduction and pattern recognition in many scientific and engineering applications, such as image processing, text mining, and wireless communications. In this paper we propose an efficient FPGA-based floating-point accelerator using high-level synthesis (HLS) for the CUR decomposition algorithm. Our experiment results demonstrate the better efficiency of our hardware design compared to the optimized CPU-based software solutions. The speedup of our FPGA-based architecture over the optimized software implementation ranges from 2.37 to 16.82 times for different dimensions of the data input matrix. We evaluated our design using large dimension matrices 1024 x 1024 and 2048 x 2048 and the experiment results demonstrated the efficiency of our design in terms of the utilized resources and latency. Finally, we have compared our design with other matrix decomposition algorithms such as SVD and QR decomposition, the experiment results demonstrated that CUR is more efficient than SVD and QR decomposition in terms of latency and required resources. © 2022 IEEE.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2022 32nd International Conference on Field-Programmable Logic and Applications |
| Subtitle of host publication | FPL 2022 |
| Publisher | IEEE |
| Pages | 294-299 |
| ISBN (Electronic) | 9781665473903 |
| ISBN (Print) | 978-1-6654-7391-0 |
| DOIs | |
| Publication status | Published - Aug 2022 |
| Event | 32nd International Conference on Field-Programmable Logic and Applications (FPL 2022) - Belfast, United Kingdom Duration: 29 Aug 2022 → 2 Sept 2022 https://2022.fpl.org/ |
Publication series
| Name | Proceedings - International Conference on Field-Programmable Logic and Applications, FPL |
|---|---|
| ISSN (Print) | 1946-147X |
| ISSN (Electronic) | 1946-1488 |
Conference
| Conference | 32nd International Conference on Field-Programmable Logic and Applications (FPL 2022) |
|---|---|
| Place | United Kingdom |
| City | Belfast |
| Period | 29/08/22 → 2/09/22 |
| Internet address |
Funding
This work is supported by Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA), Hong Kong Research Grants Council (Project 11204821), and City University of Hong Kong (Project 9610460).
Research Keywords
- CUR decomposition
- high level synthesis
- low-rank decomposition
- SVD and QR decomposition
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'A High-Performance FPGA Accelerator for CUR Decomposition'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Matching Large Feature Sets based on Hypergraph Models and Structurally Adaptive CUR Decompositions of Compatibility Tensors
YAN, H. (Principal Investigator / Project Coordinator)
1/01/22 → 3/06/26
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver