Abstract
Transformers have achieved great success in natural language processing and computer vision. The core and basic technique of transformers is the self-attention mechanism. The vanilla self-attention mechanism has quadratic complexity, which limits its applications to vision tasks. Most of the existing linear self-attention mechanisms will sacrifice performance to some extent to reduce complexity. In this paper, we propose a novel linear approximation of the vanilla self-attention mechanism named CURSA to achieve both high performance and low complexity at the same time. CURSA is based on the CUR decomposition to decompose the multiplication of large matrices into the multiplication of several small matrices to achieve almost linear complexity. Experiment results of CURSA in image classification tasks, semantic segmentation tasks, object detection tasks, and long-range arena show that it outperforms state-of-the-art self-attention mechanisms with better data efficiency, faster speed, and higher accuracy.
© 2025 IEEE
© 2025 IEEE
| Original language | English |
|---|---|
| Pages (from-to) | 4792-4809 |
| Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
| Volume | 48 |
| Issue number | 4 |
| Online published | 19 Dec 2025 |
| DOIs | |
| Publication status | Published - Apr 2026 |
Funding
This work is supported by the Hong Kong Innovation and Technology Commission (InnoHK Project CIMDA), the Institute of Digital Medicine, City University of Hong Kong (Projects 9229503 and 9610460), the National Natural Science Foundation of China (No. 12561095), and the Special Posts of Guizhou University (No. [2025]06).
Research Keywords
- Attention mechanism
- CUR decomposition
- linear approximation
- vision transformer
Fingerprint
Dive into the research topics of 'The CUR Decomposition of Self-Attention Matrices in Vision Transformers'. Together they form a unique fingerprint.Projects
- 1 Active
-
DON_RMG: IDM Cluster 2 : Health Data and Models - RMGS
YAN, J. (Principal Investigator / Project Coordinator) & ZHAO, X. (Co-Principal Investigator)
1/09/24 → …
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver